Deep Challenges.
Transcript of Deep Challenges.
-
7/29/2019 Deep Challenges.
1/45
-
7/29/2019 Deep Challenges.
2/45
The Deep Sub-Micron ChallengeThe Deep Sub-Micron Challenge
Microscopic Problems Wiring Load Management
Noise, Crosstalk Reliability, Manufacturability
Complexity: LRC, ERC Accurate Power Prediction Accurate Delay Prediction
etc.
Everything Looks a Little Different
Macroscopic Issues Time-to-Market
Millions of Gates High-Level Abstractions Reuse & IP: Portability
Predictability
etc.
and Theres a Lot of Them!
DSM 1/DSM
?
-
7/29/2019 Deep Challenges.
3/45
Dens ity Acc e s s Time
(Gb its /c m2) (ns )
DRAM 8.5 10
DRAM (Lo g ic ) 2.5 10
S RAM (Cac he ) 0.3 1.5
Density Max. Ave. Powe Clock Rate
(Mgates/cm2) (W/cm2) (GHz)
Custom 25 54 3
Std. Cell 10 27 1.5
Gate Array 5 18 1
Single-Mask GA 2.5 12.5 0.7
FPGA 0.4 4.5 0.25
Design at a CrossroadDesign at a Crossroad
Silicon technology trackingSilicon technology tracking MooresMoores LawLaw
Die Area: 2.5x2.5 cmVoltage: 0.6 - 0.9 V
Technology: 0.07 m
15 times denser
than today
2.5 times power
density
5 times clock rate
Silicon in 2010Silicon in 2010
-
7/29/2019 Deep Challenges.
4/45
Design at a CrossroadDesign at a Crossroad
Silicon technology trackingSilicon technology tracking MooresMoores LawLaw
1970 1980 1990 2000 2010
Year
1 Gbits 0.15-0.2m
256 Mbits0.25-0.3m
4 Gbits0.15m
64 Mbits0.35-0.4m
16 Mbits 0.5-0.6m
1 Mbits1.0-1.2m
4 Mbits 0.7-0.8m
256 Kbits
1.6-2.4m
64 Kbits
1010
109
108
107
106
105
104
Numberofbitsperchip
MicroprocessorPower(MIPS)
1000.0
100.0
10.0
1.0
1980 1985 1990 1995 2000
Year
P7
P6
Pentium486
386
286
64 Gbits
0.08m*
Encyclopedia2 hrs CD Audio30 sec HDTV
Encyclopedia
2 hrs CD Audio30 sec HDTV
Human memoryHuman DNA
Human memoryHuman DNA
BookBook
PagePage
Courtesy of David Eaglesham, Lucent
-
7/29/2019 Deep Challenges.
5/45
Power DissipationPower Dissipation
0
10
20
30
40
50
60
70
Watts/cm
2
Surpassed Hot-Plate PowerSurpassed Hot-Plate Power
Density in 0.6Density in 0.6 m CMOSm CMOS
Courtesy Intel
Pentium
(R)486
386
PentiumPro (R)
0
1
10
100
1,000
10,000
1985 1990 1995 2000 2005 2010
Icc(amps)
100-2,000amps100-2,000amps
Due to 30%Vdd scaling
-
7/29/2019 Deep Challenges.
6/45
Challenges in Deep-Challenges in Deep-SubmicronSubmicron DesignDesign
qq Device scalingDevice scaling Scaling of the voltageScaling of the voltage
The leaky transistorThe leaky transistor
Short- and long-term reliabilityShort- and long-term reliabilityqq Interconnect scalingInterconnect scaling
CapacitanceCapacitance
ResistanceResistance InductanceInductance
-
7/29/2019 Deep Challenges.
7/45
Transistor ScalingTransistor Scaling
(velocity-saturated devices)(velocity-saturated devices)
-
7/29/2019 Deep Challenges.
8/45
DSM devices: Evolution ofDSM devices: Evolution of IIdsatdsat
Data taken from 16 papers (IBM,TI, Bell Labs, Motorola, Intel, AMD)
Demonstrates a relatively constant Idsat
from Ldrawn
of 0.25 to 0.09 m
[Sylvester, Keutzer, 98]
0.08 0.12 0.16 0.20 0.24200
300
400
500
600
700
800
200
300
400
500
600
700
800
NMOS
PMOS
Idsat
(A/m)
Drawn Channel Length (m)
-
7/29/2019 Deep Challenges.
9/45
Evolution of Power DensityEvolution of Power Density
Source: Sakurai97
-
7/29/2019 Deep Challenges.
10/45
Scaling the Supply VoltageScaling the Supply Voltage
0 0.05 0.1 0.15 0.20
0.05
0.1
0.15
0.2
Vin
(V)
Vout
(V)
Minimum Feature Size (micron)10
-11
0.5
1
1.5
2
2.5
3
3.5
4
4.5
5
SupplyV
oltage
(V)
Scaling forced by reliabilityScaling forced by reliability
and power considerationsand power considerations
Scaling limited by gainScaling limited by gain
and thermal noiseand thermal noise
-
7/29/2019 Deep Challenges.
11/45
Projected Evolution inProjected Evolution in IIoffoff
0
2
4
6
8
10
12
250 180 130 100 70 50
Technology Node
Off-curre
nt(25C)
0
2
4
6
8
10
12
250 180 130 100 70 50
Technology Node
Off-curr
ent(25C)
-
7/29/2019 Deep Challenges.
12/45
Power & Delay Dependence on VPower & Delay Dependence on VDDDD & V& VTHTH
Courtesy Sakurai97
-
7/29/2019 Deep Challenges.
13/45
Power-Delay vs Energy-Delay Product
-
7/29/2019 Deep Challenges.
14/45
Reduced VReduced VDDDD/V/VTT ratioratio
Reduces PredictabilityReduces Predictability
[Sakurai&Kuroda]
-
7/29/2019 Deep Challenges.
15/45
DSM Reduces PredictabilityDSM Reduces Predictability
Degradation of IV characteristics of NMOS transistorDegradation of IV characteristics of NMOS transistor
due to hot-electron effects [McGaughy88]due to hot-electron effects [McGaughy88]
-
7/29/2019 Deep Challenges.
16/45
Silicon-on-InsulatorSilicon-on-Insulator
ttSiSi < 50< 50 nmnmGate
Buried Oxide (BOX)
P Substrate
tBOX
tOX
OxideOxide tSi n+n+
qq Extension beyond Bulk CMOSExtension beyond Bulk CMOS
Scaling LimitScaling Limit
qq Performance ImprovementPerformance Improvement
Reduced Junction CapacitanceReduced Junction Capacitance
Absence of Reverse-Body EffectAbsence of Reverse-Body Effect
qq Soft Error Rate (SER) ImprovementSoft Error Rate (SER) Improvement
qq Reduced leakage (low-power)Reduced leakage (low-power)
qq Latch-Up EliminationLatch-Up Elimination
qq Ease of Device IsolationEase of Device Isolation
qq Potentially Reduced WaferPotentially Reduced Wafer
Fabrication CostFabrication Cost
-
7/29/2019 Deep Challenges.
17/45
IBMIBM 64b64b PowerPCPowerPCqq Bulk CMOS Base DesignBulk CMOS Base Design
0.12 m0.12 m LLeffeff, 6LM (Cu), 6LM (Cu)
450 MHz, 22 W @ 1.8 V450 MHz, 22 W @ 1.8 V
qq PD/SOI TechnologyPD/SOI Technology
0.12 m0.12 m LLeffeff , 6LM (Cu), 6LM (Cu)
550 MHz, 24 W @ 1.8 V550 MHz, 24 W @ 1.8 V
(D. H. Allen et al., ISSCC, 1999)
CMOS6S2 CMOS7S CMOS7S SOICore Clock Frequency 350 MHz 450 MHz 550 MHz
L1 Cache 64KB-I + 64KB-D 128KB-I + 128KB-D 128KB-I + 128KB-D
L2 Directory N/A 104 x 16 K 104 x 16 K
Supply Voltage 2.5 V 1.8 V 1.8 V
Transistors 12 M 34 M 34 M
Die Size 162 mm2 139 mm2 139 mm2
Power 34 W 22 W 24 W
Leff(nFET) 0.18 m 0.12 m 0.12 m
TOX 5.0 nm 3.5 nm 3.5 nm
Metalization 5 Layers AL 6 Layers Cu 6 Layers Cu
Contacted M2 M4
Pitch
1.26 m 0.81 m 0.81 m
-
7/29/2019 Deep Challenges.
18/45
Alternatives to planar CMOS:Alternatives to planar CMOS:
Vertical transistorsVertical transistors
qq Some early types of vertical structures discussed since 1980sSome early types of vertical structures discussed since 1980s
qq Separates performance-scaling from packing-scalingSeparates performance-scaling from packing-scaling
qq
RequireRequire
manufacturablemanufacturable
process with low parasiticsprocess with low parasitics
Gate Gate
Source
Drain
-
7/29/2019 Deep Challenges.
19/45
Dual-Gate BerkeleyDual-Gate Berkeley FinFETFinFET (1999)(1999)
SiO2
Closer to planar technology Proven operation for NMOS and PMOS down to 18 nm Can be scaled down to 10 nm Suppresses short-channel effects! [Huang, IEDM99]
-
7/29/2019 Deep Challenges.
20/45
Berkeley PMOS FINFET (Berkeley PMOS FINFET (LgLg = 45= 45 nmnm))
S = 69 mV/decade Highest reported PMOS Drive Current
[Huang, IEDM99]
-
7/29/2019 Deep Challenges.
21/45
-
7/29/2019 Deep Challenges.
22/45
The Interconnect ChallengeThe Interconnect Challengeqq With increases in performance and integrationWith increases in performance and integration
density, wire parasitics gain dominancedensity, wire parasitics gain dominance
qq The wire combines capacitance, resistance, andThe wire combines capacitance, resistance, and
inductanceinductance
qq Wire parasitics impact performance, energyWire parasitics impact performance, energy
dissipation and reliabilitydissipation and reliability
transmitters receivers
-
7/29/2019 Deep Challenges.
23/45
Interconnect DistributionInterconnect Distribution
10 100 1,000 10,000 100,000
Length (u)
Noofn
ets
(LogSc
ale)
Pentium Pro (R)
Pentium(R) IIPentium (MMX)
Pentium (R)
Pentium (R) II
Source
:Intel
-
7/29/2019 Deep Challenges.
24/45
The Ideal Wire Scaling ModelThe Ideal Wire Scaling Model
While transistor delay
scales as 1/S!
The RC DilemmaThe RC Dilemma
-
7/29/2019 Deep Challenges.
25/45
Constant Resistance ScalingConstant Resistance Scaling
Scaling would increase R (Scaling would increase R ( SS33))
historically aspect ratio has increased to compensatehistorically aspect ratio has increased to compensate
-
7/29/2019 Deep Challenges.
26/45
-
7/29/2019 Deep Challenges.
27/45
Will Interconnect Dominate Delay?Will Interconnect Dominate Delay?
qq # logic levels decreasing# logic levels decreasing
(architecture)(architecture)
qq Min. gate size shrinkingMin. gate size shrinking
qq ParasiticsParasitics increase due toincrease due to
scalingscaling
qq Increasing RC delay withIncreasing RC delay with
chip sizechip size
delay
Year
1
0.25
0.5
88 94 00
gate delay
delay due to
sizing and buffering
interconnect delay
From Aykut Dengi
1996 ICCAD tutorial
-
7/29/2019 Deep Challenges.
28/45
Or Will Its Impact Decrease?Or Will Its Impact Decrease?
0
10
20
30
40
5060
70
80
90
100110
120
0
10
20
30
40
5060
70
80
90
100110
120
2-input NAND, FO = 2, W/L = 16
0.250.180.130.1
Delay
(ps)
Process Generation (m)
Gate delayStage delay [Keutzer98]
Shorter local wire length, transistor sizing, and low-k dielectricsShorter local wire length, transistor sizing, and low-k dielectrics
-
7/29/2019 Deep Challenges.
29/45
Interconnect ProjectionsInterconnect Projections
Low-k dielectricsLow-k dielectrics
qq BothBoth delay and power are reduceddelay and power are reducedby droppingby dropping
interconnect capacitanceinterconnect capacitanceqq Types of low-k materials include: inorganic (SiOTypes of low-k materials include: inorganic (SiO22),),
organic (organic (PolyimidesPolyimides) and) and aerogelsaerogels (ultra low-k)(ultra low-k)
qq The numbers below are on theThe numbers below are on the
conservative side of the NRTS roadmapconservative side of the NRTS roadmap
Generation 0.25
m
0.18
m
0.13
m
0.1
m
0.07
m
0.05
m
Dielectric
Constant
3.3 2.7 2.3 2.0 1.8 1.5
F C i GNDF C it t GND t
-
7/29/2019 Deep Challenges.
30/45
From Capacitance-to-GND toFrom Capacitance-to-GND to
InterwireInterwire CapacitanceCapacitance
fringing parallel
-
7/29/2019 Deep Challenges.
31/45
CrosstalkCrosstalkW S
Ca CaCv
Ground Plane
T
H
Cc Cc
Neighboring wires switch,
coupling to a quiet line
Quiet line sees a undesired
voltage spike
Crosstalk can lead to:- Logic faults (especially in dynamic circuits)
- Voltage overshoot (stress, forward-bias PN junctions)
Voltage spike, Vx Cc / Ctotal Vx is a complex function of
- Driver strength
- Fan-out capacitance
- Wiring resistance
-
7/29/2019 Deep Challenges.
32/45
Delay DegradationDelay Degradation
Cc- Impact of neighboring signalactivity on switching delay
- When neighboring lines switch
in opposite direction of victim
line, delay increases
Miller EffectMiller Effect
- Both terminals of capacitor are switched in opposite directions
(0 Vdd, Vdd 0)
- Effective voltage is doubled and additional charge is needed
(from Q=CV)
-
7/29/2019 Deep Challenges.
33/45
Structured and Predictable InterconnectStructured and Predictable Interconnect
S
S SV V S
G
S
SV
G
V
Example: Dense Wire Fabric (DWF) [Khatri, DAC99]Trade-off:
Cross-coupling capacitance 40x lower, 2% delay variation
Increase in area and overall capacitance
-
7/29/2019 Deep Challenges.
34/45
The Impact ofThe Impact of ResistivityResistivity
CN-1 CNC2
R1 R2
C1
Tr
Vin
RN-1 RN
0 0.5 1 1.5 2 2.5 3 3 .5 4 4.5 50
0.5
1
1.5
2
2.5
t ime (n sec)
voltage(
V)
x= L/10
x = L/4
x = L/2
x= L
0 0.5 1 1.5 2 2.5 3 3 .5 4 4.5 5
0
0.5
1
1.5
2
2.5
t ime (nsec )
voltage(
V)
x= L/10
x = L/4
x = L/2
x= L
Diffused signalDiffused signal
propagationpropagation
Delay ~ LDelay ~ L22
The distributedThe distributed rcrc-line-line
U i C I t tU i C I t t
-
7/29/2019 Deep Challenges.
35/45
Using Copper as InterconnectUsing Copper as Interconnect
MaterialMaterial
With cladding and other effects,
Cu ~ 2.2 mW-cm vs. 3.5 for Al(Cu)40% reduction in resistance
Yields 12% performanceimprovement over an aluminum
process in a PowerPC design
Electromigration improvement;100X longer lifetime (IBM, IEDM97)
Electromigration is a limiting
factor beyond 0.18 mm if Al isused (HP, IEDM95)
Transistor SEM
-
7/29/2019 Deep Challenges.
36/45
The Global Wire ProblemThe Global Wire Problem( )outwwdoutdwwd CRCRCR693.0CR377.0T +++=
ChallengesChallenges
qq No further improvements to be expected after theNo further improvements to be expected after the
introduction of Copper (introduction of Copper (superconductingsuperconducting, optical?), optical?)
qq Design solutionsDesign solutions
Use of fat wiresUse of fat wires
Insert repeaters but might become prohibitive (power, area)Insert repeaters but might become prohibitive (power, area)
Efficient chipEfficient chip floorplanningfloorplanning
qq Towards Towards communication-basedcommunication-based design design
How to deal with latency?How to deal with latency?
Is synchronicity an absolute necessity?Is synchronicity an absolute necessity?
Architect re M st E ol e to FitArchitecture Must Evolve to Fit
-
7/29/2019 Deep Challenges.
37/45
Architecture Must Evolve to FitArchitecture Must Evolve to Fit
the Landscapethe Landscape
20 Clocks
90,000
tracks
Local, parallel operationsLocal, parallel operations
High bandwidthHigh bandwidth
Low latency &Low latency &
Low powerLow power
Global operationsGlobal operations
Low bandwidthLow bandwidth
High latency &High latency &High powerHigh power
Source: Bill Dally, Stanford
-
7/29/2019 Deep Challenges.
38/45
Interconnect: # of Wiring LayersInterconnect: # of Wiring Layers
# of metal layers is steadily increasing due to:
Increasing die size and device count: we need
more wires and longer wires to connecteverything
Rising need for a hierarchical wiring network;local wires with high density and global wires withlow RC
substrate
poly
M1
M2
M3
M4
M5
M6
Tins
H
WS
= 2.2
-cm
0.25 m wiring stack
Minimum Widths (Relative)
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
1.0 0.8 0.6 0.35 0.25
M5
M4
M3
M2
M1
Poly
Minimum Spacing (Relative)
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
4.0
1.0 0.8 0.6 0.35 0.25
M5M4
M3
M2
M1
Poly
Resistance and the PowerResistance and the Power
-
7/29/2019 Deep Challenges.
39/45
Resistance and the PowerResistance and the Power
Distribution ProblemDistribution Problem
Pentium
(R)486386
Pentium
Pro (R)
0
1
10
100
1,000
10,000
1985 1990 1995 2000 2005 2010
Icc(amps)
100-3,000amps
VDD
X
I
I
R
R
VDD - V
V
V
pre
RI dropRI drop
Resistance and the PowerResistance and the Power
-
7/29/2019 Deep Challenges.
40/45
Resistance and the PowerResistance and the Power
Distribution ProblemDistribution Problem
Source: Simplex
Requires fast and accurate peak current predictionRequires fast and accurate peak current prediction
Heavily influenced by packaging technologyHeavily influenced by packaging technology
BeforeBefore AfterAfter
-
7/29/2019 Deep Challenges.
41/45
InductanceInductance
qq
Transmission line effectsTransmission line effectscause overshooting andcause overshooting and non-non-
monotonic behaviormonotonic behavior
qq Wave propagation putsWave propagation puts
minimum boundminimum bound on delayon delay
and may require terminationand may require termination
qq Only to be considered whenOnly to be considered when
thethe rise and fall timesrise and fall times of theof the
signal are comparable to thesignal are comparable to the
time-of-flight of the line, andtime-of-flight of the line, andwhen thewhen the resistanceresistance of theof the
wire is small (< 5Zwire is small (< 5Z00))Clock signals in 400 MHz IBM Microprocessor
(measured using e-beam prober) [Restle98]
-
7/29/2019 Deep Challenges.
42/45
Dealing with InductanceDealing with Inductance
Bus linesV
dd
GND
Inductance hard to analyze accuratelyInductance hard to analyze accurately
Structural design approaches might be more appropriateStructural design approaches might be more appropriate
DEC approach in Alpha 21264 use entireDEC approach in Alpha 21264 use entire planes of metal asplanes of metal as
referencesreferences ((VVdddd and GND) to reduce inductance by controlling theand GND) to reduce inductance by controlling the
return pathreturn path
- Loss of routing density, added metal layers reduce yield &- Loss of routing density, added metal layers reduce yield &
raise costsraise costs
Another industry approach usesAnother industry approach uses shield wiresshield wires every ~ 3 signal linesevery ~ 3 signal lines
in a dense arrayin a dense array
-
7/29/2019 Deep Challenges.
43/45
Inductive Noise -Inductive Noise - LdiLdi//dtdt
PentiumPro
Pentium
486
3861.E+00
1.E+01
1.E+02
1.E+03
1.E+04
1.E+05
1.E+061.E+07
1.E+08
1.5 0.8 0.35 0.18 0.1
di/dt
inAU
di/dt noiseincreases
Source: Intel
-
7/29/2019 Deep Challenges.
44/45
Inductive Noise -Inductive Noise - LdiLdi//dtdt
CHIPSUPPLY
Bonding
Wire
Board
Wiring
Cd
Decoupling
Capacitor
+
-
qq DecouplingDecoupling
capacitance problemcapacitance problem
becoming extremebecoming extreme DEC 21164: 128DEC 21164: 128
nFnF of on-chipof on-chip
decouplingdecoupling
DEC 21264: addDEC 21264: addflip-chipflip-chip decouplingdecoupling
capacitor chipcapacitor chip
qq Mostly solvable byMostly solvable by
advances inadvances in
packagingpackagingtechnology and noveltechnology and novel
timing approachestiming approaches
-
7/29/2019 Deep Challenges.
45/45
SummarySummaryqq Deep-Deep-submicronsubmicron effects impact reliability,effects impact reliability,
performance, and power dissipationperformance, and power dissipation
qq The major device challenge: low-voltage, non-leakyThe major device challenge: low-voltage, non-leaky
designdesign
qq Interconnect starts playing a dominant roleInterconnect starts playing a dominant role
capacitivecapacitive: the increasing impact of: the increasing impact of interwireinterwire capacitancecapacitance resistive: global wire delay and power distributionresistive: global wire delay and power distribution
inductive: mostly supply noise, but transmission line effectsinductive: mostly supply noise, but transmission line effects
are emergingare emerging
qq Requires a new generation of fast and accurateRequires a new generation of fast and accurateanalysis toolsanalysis tools
qq But most of all But most of all novel design methodologies andnovel design methodologies and
concepts producing predictable or insensitive designconcepts producing predictable or insensitive design