EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80...

15
1 EE241 Spring 2011 EE241 - Spring 2011 Advanced Digital Integrated Circuits Lecture 23: Wrap-up Announcements Homework #4 due today Quiz #4 today Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon 6 pages, double column Project presentations next Wednesday May 4 at 2pm in 2 Project presentations next Wednesday, May 4, at 2pm in BWRC 20min + 5 min Q&A

Transcript of EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80...

Page 1: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

1

EE241 Spring 2011EE241 - Spring 2011Advanced Digital Integrated Circuits

Lecture 23: Wrap-up

Announcements

Homework #4 due today

Quiz #4 today

Final exam on Wedensday!80 minutes, in class

Project reports due next Wednesday, May 4, noon6 pages, double column

Project presentations next Wednesday May 4 at 2pm in

2

Project presentations next Wednesday, May 4, at 2pm in BWRC

20min + 5 min Q&A

Page 2: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

2

Outline

Last lectureOther dynamic logic styles

Adders: Conditional sum and carry-lookahead

This lectureFinish adders

Perspective

Reading: Selected publications

3

g p

AddersAdders

Page 3: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

3

Carry Tree Considerations

Number of signals merging at each stage (radix)Uniform vs. non-uniform

Number of logic levels

Full vs. sparse trees

5

Tree Adders: Kogge-Stone

S0

S1

S2

S3

S4

S5

S6

S7

S8

S9

S10

S11

S12

S13

S14

S15

6

16-bit radix-2 Kogge-Stone Tree

(A0,

B0)

(A1,

B1)

(A2,

B2)

(A3,

B3)

(A4,

B4)

(A5,

B5)

(A6,

B6)

(A7,

B7)

(A8,

B8)

(A9,

B9)

(A1

0, B

10)

(A1

1, B

11)

(A1

2, B

12)

(A1

3, B

13)

(A1

4, B

14)

(A1

5, B

15)

Page 4: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

4

Tree Adders: Other Trees

Ladner-Fischer

S0

S1

S2

S3

S4

S5

S6

S7

S8

S9

S1

0

S1

1

S1

2

S1

3

S1

4

S1

5

S S S S S S S S S S S S S S S S

7

(A0, B

0)

(A1, B

1)

(A2, B

2)

(A3, B

3)

(A4, B

4)

(A5, B

5)

(A6, B

6)

(A7, B

7)

(A8, B

8)

(A9, B

9)

(A1

0, B

10)

(A1

1, B

11)

(A1

2, B

12)

(A1

3, B

13)

(A1

4, B

14)

(A1

5, B

15)

Tree Adders: Radix 4

S0

S1

S2

S3

S4

S5

S6

S7

S8

S9

S1

0

S1

1

S1

2

S1

3

S1

4

S1

5

8

(a0,

b0)

(a1,

b1)

(a2,

b2)

(a3,

b3)

(a4,

b4)

(a5,

b5)

(a6,

b6)

(a7,

b7)

(a8,

b8)

(a9,

b9)

(a1

0,

b1

0)

(a1

1,

b1

1)

(a1

2,

b1

2)

(a1

3,

b1

3)

(a1

4,

b1

4)

(a1

5,

b1

5)

16-bit radix-4 Kogge-Stone Tree

Page 5: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

5

Ling Adder

CLA Ling’s equations

:0 1:0

1:0

i i i

i i i

i i i i

i i i i

g a b

p a b

G g p G

S a b G

:0 1 1:0

:0 1 1:0

i i i

i i i

i i i i

i i i i i i

g a b

t a b

H g t H

S t H g t H

9Ling, IBM J. Res. Dev, 5/81

Ling Adder

G g p g p p g p p p g Conventional radix-4

3:0 3 3 2 3 2 1 3 2 1 0G g p g p p g p p p g

3:0 3 2 2 2 1 1 2 1 0 0

3 2 2 1 2 1 0

H g t g t t g t t t g

g g t g t t g

Ling’s radix-4

10

Reduces the stack height (or width)Reduces input loading

Page 6: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

6

Ling vs. CLA

Conventional G3Ling’s H3

C K

a3

b3

a3 b3

a2

b2

a2

a1

b2

a1 b1

G 3

CK

a3

b3

a2 a2

b2 a1

b1

b1

a0

H3

b2

a1

11

b1 a0

b0

b1 a0

b0

Ling vs. CLA: Sum Pre-Computation

Conventional CLA Ling’s

0

1

i i i

i i i

S a b

S a b

0

11 1

i i i

i i i i i

S a b

S a b a b

12

Page 7: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

7

Ling vs. CLA (64 bit)

44

49

Radix-2 Ling

0.5 FO4

1 1

19

24

29

34

39

En

erg

y [

pJ

]

Radix-4 Ling

Radix-2 CLA

Radix-4 CLA

0.5 FO4

1

3 2

3

4

13

4

9

14

7 9 11 13 15

Delay [FO4]

24

Ling vs. CLA

Tradeoff between the first carry and the sum circuit complexityLater carry stages are unchanged from conventional CLA

Reducing the input loading and smaller stack speed up the carryReducing the input loading and smaller stack speed up the carrySum gets more complex

With tight power constraints Ling is slower than CLA

14

Page 8: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

8

Sparse Trees

Not all the carries are calculatedOnly every 2nd or 4th

R d d i t itReduced input capacitanceMore complex sum

Sparseness of 2 doubles the sum select loading

Effectively shifting the fanout towards back

15

Sparse Trees

S1

S3

S5

S7

S9

S1

1

S1

3

S1

5

S0

S2

S4

S6

S8

S1

0

S1

2

S1

4

16

(a0,

b0)

(a1,

b1)

(a2,

b2)

(a3,

b3)

(a4,

b4)

(a5,

b5)

(a6,

b6)

(a7,

b7)

(a8,

b8)

(a9,

b9)

(a1

0,

b1

0)

(a1

1,

b1

1)

(a1

2,

b1

2)

(a1

3,

b1

3)

(a1

4,

b1

4)

(a1

5,

b1

5)

16-bit radix-2 sparse Kogge-Stone tree with sparseness of 2 (Han-Carlson)

Page 9: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

9

Sparse Trees

Precomputed sums for sparse Ling adder

Even bit sums unchanged

Odd bit lOdd bit sums complex

221111

1

110

iiiiiiiii

iiiii

babababaS

babaS

17

Sparse Trees

Ladner-Fischer

18

Page 10: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

10

Sparse TreesFull trees

19

24

29

pJ

]1-1-1-1-1-1

2-2-2-2-2-1

4-4-4-4-2-1

8-8-8-4-2-1

16-16-8-4-2-1

32-16-8-4-2-1

1

2

3

4

5

6

1 2 3

Lateral fanout

Sparse-2 trees

19

24

29

[pJ

]

1-1-1-1-1-1

2-2-2-2-2-1

4-4-4-4-2-1

8-8-8-4-2-1

16-16-8-4-2-1

32-16-8-4-2-1

1

2

3

4

5

6

1 2 3 4

5

4

9

14

19

7 9 11 13 15

Delay [FO4]

En

erg

y [p

456

4

9

14

7 9 11 13 15

Delay [FO4]

En

erg

y

Lateral fanout

6

Sparse-4 trees

29

1-1-1-1-1-1

2-2-2-2-2-1

4 4 4 4 2 1

1

2

31

194

9

14

19

24

7 9 11 13 15

Delay [FO4]

En

erg

y [

pJ

]

4-4-4-4-2-1

8-8-8-4-2-1

16-16-8-4-2-1

32-16-8-4-2-1

3

4

5

6

2

4

5

6

3

Lateral fanout

Zlatanovici, JSSC’09

Other Sparse Trees

20Mathew, VLSI’02

Page 11: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

11

Intel’s 65nm 32b ALU

WijeratneISSCC’06

21

Radix-2 carry tree generates every fourth carry 73% fewer carry-merge gates 80% reduction in wiring complexity

vs. dense parallel-prefix

adders

Grouping Gates

29

Radix-4 sparse-2 Radix-4 Ladner-Fischer sparse-4

14

19

24

29

En

erg

y [p

J]

Radix-2 full

Radix-4 full

Radix-2 Ladner-Fischer full

g = grouped sizing f = flat sizing

gg

g

f fg

f

22

4

9

7 8 9 10 11 12 13 14 15Delay [FO4]

g

f f

Page 12: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

12

Sparse Trees

1-1-1-1-1-1 trees

24

29Full

Sparse-2

Sparse-4

1

2

3

32-16-8-4-2-1 trees

24

29

Full

Sparse-2

Sparse-4

1

2

3

4

9

14

19

24

7 9 11 13 15

Delay [FO4]

En

erg

y [p

J] Sparseness

1

23

4

9

14

19

24

7 9 11 13 15

Delay [FO4]

En

erg

y [

pJ

]

Sparseness

1

2 3

23

What is the fastest 64-b adder?

32-Bit Adders

24Patil, ARITH’07

Page 13: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

13

Hybrid Adders

25Dobberpuhl, JSSC 11/92 DEC Aplha 21064

DEC Adder

Combination:8-bit tapered pre-discharged Manchester carry chains, with Cin = 0

d C 1and Cin = 1

32-bit LSB carry-lookahead

32-bit MSB conditional sum adder

Carry-select on most significant bits

Latch-based timing

26

Page 14: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

14

Another DEC Adder

27

Propagate-kill cell

Group propagate-kill Kowaleski, ISSCC’96

This Class

Tried to put design choices in perspective of technology

The design constraints have changed and will be changing

Cost, energy, (power, leakage, …), performance

Stressed on variability, power-performance tradeoffs

Did not cover multipliers, power regulation/distribution

28

p , p g(class projects), I/O

Page 15: EE241 - Spring 2011bwrcs.eecs.berkeley.edu/Classes/icdesign/...Finale...Final exam on Wedensday! 80 minutes, in class Project reports due next Wednesday, May 4, noon ... Intel’s

15

This Field

Moore’s law will end sometime during your (my?) career28nm in 2011 scales to 0.1nm by 2050 with 2-yr cycles (or to 1nm

ith 4 l )with 4-yr cycles)

Physics will stop CMOS somewhere around 5nmWe will see a different CMOS device beforehand

Economics will likely stop it earlierAnd the nodes will be stretched out

29

Don’t worry: There is plenty of problems that we don’t know how to solve today, and they will be around for a while!

Even filling 10B/100B/1 trillion transistor chips with SRAM is not trivial!

Technology Strategy / Roadmap 2000 2005 2010 2015 2020 2025 2030

Plan A: Extending Si CMOSPlan A: Extending Si CMOS

Plan B: Subsytem IntegrationPlan B: Subsytem Integration

R D

R D

30

Plan C: Post Si CMOS Options Plan C: Post Si CMOS Options

R R&D

Plan Q:Plan Q:

R D

Quantum ComputingQuantum Computing

T.C. Chen, Where Si-CMOS is going: Trendy Hype vs. Real Technology, ISSCC’06