Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4)

18
1 Farhan Mohamed Ali (W2- 1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4) Presentation 8 MAD MAC 525 22 nd March, 2006 Functional Block and Simulations W2 Project Objective: Design a crucial part of a GPU called the Multiply Accumulate Unit (MAC) which will revolutionize graphics. Design Manager: Zack Menegakis

description

Presentation 8 MAD MAC 525. Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4). W2. Design Manager: Zack Menegakis. 22 nd March, 2006 Functional Block and Simulations. Project Objective: - PowerPoint PPT Presentation

Transcript of Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4)

Page 1: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

1

Farhan Mohamed Ali (W2-1)Jigar Vora (W2-2)Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4)

Presentation 8

MAD MAC 525

22nd March, 2006Functional Block and Simulations

W2

Project Objective:Design a crucial part of a GPU called the Multiply Accumulate Unit (MAC) which will revolutionize graphics.

Design Manager: Zack Menegakis

Page 2: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

2

MAD MAC 525 Status:Project chosenSpecifications definedArchitecture

DesignBehavioral VerilogTestbenchesVerilog : Gate Level DesignFloor plan Schematics and Analog VerificationsLayout of basic gates and small blocks Large block layouts, extractions, LVS, simulations (in progress) Spring Break

To be doneFull chip layout and simulation

Page 3: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

3

RegArray A RegArray B RegArray C

Multiplier Exp Calc Align

Adder/SubtractorControlLogic

&Sign

Dtrmin

Normalize

Round

Ovf Checker

Leading 0 Anticipator

10 10 10

5

55

1435225

4

36

14

101

5

5

Input Input Input

Output

16 16 16

16RegY

15

1

1

1

Block Diagram

Page 4: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

4

Design Decisions• Pipelining Stages: Add another stage in

multiplier since adder is very fast. – Projected speed is at least 400 MHz – Exceeds the design goal of 300MHz

Page 5: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

5

Pipelining Stages

MultiplierAlign

C

Reg A

Reg

BExpCalc

Reg C

Pipeline Reg Pipeline Reg

AdderLd Zero

Pipeline Reg

NormalizeRound

Reg Y

Pipeline Reg

Overflow checker

Pipeline Reg

Pipeline Reg

Page 6: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

6

Timing DiagramPipeline stage 1

Pipeline stage 2

Pipeline stage 3

Pipeline stage 4

Pipeline stage 5

Multiplier lower 7 outputs

Multiplier mid 4 outputs

Multiplier top 11 outputs

Adder Normalize

Exponent calculator

Holds exponent calculator?

Align Zero Counter

Round

Holds exponent calculator bits

Holds exponent calculator bits

Overflow Checker

Page 7: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

7

New Floorplan

Page 8: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

8

Design Decisions• Pipelining Stages: Add another stage in

multiplier since adder is very fast. – Projected speed is at least 400 MHz – Exceeds the design goal of 300MHz

• Optimized adder design which implements carry look ahead architecture – Propagation delay of around 800ps (1250MHz)– Bit slicing the adder in the layout

Page 9: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

9

Adder Schematic

Page 10: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

10

Adder Schematic Simulation

Page 11: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

11

Adder Bit Slice Layout

Page 12: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

12

Transistor Count

Area in um2

Prop. Delay

Power in mW (350MHz)

Multiplier 3600 16560 4.64n 8.5Exponents 738 3800 942p 1.608Align 500 2990 637p 0.393Adder 3174 15870 1n 5.236Leading 0 364 1222 551p 0.857Normalize 942 3176 430p 2.291Round 462 2310 948p 0.631OvfCheck 100 500 475p 0.13Registers 1850 9200 120p -

Total 11730 55628 - -

Page 13: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

13

Optimized Round Schematic

Page 14: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

14

Normalize Layout

Page 15: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

15

Align Layout

Page 16: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

16

Align Simulation

Page 17: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

17

Exponents progress…

Page 18: Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3)  Avni Jhunjhunwala (W2-4)

18

Questions??