Lec_03_SRAM

download Lec_03_SRAM

of 16

Transcript of Lec_03_SRAM

  • 8/8/2019 Lec_03_SRAM

    1/16

  • 8/8/2019 Lec_03_SRAM

    2/16

    2

    Implementing Registers in CMOS (cont.)

    Another example: D latch (register)Uses transmission gateWhen WR asserted, write operation will take placeStack D latch structures to get n-bit register

    D

    WRQ Q

    WR

    EE 5324 -VLSI Design II - Kia Bazargan 5

    WR

    WR

    Shift Registers: Idea

    Shift registers are used for iteratively shifting dataUsed in pipelining, bit-by-bit processing, etc.

    DD1

    D1 D2

    D2 D3

    D3

    D1 D2 D3 Problem?

    When clock goes high, the data will traverse all

    EE 5324 -VLSI Design II - Kia Bazargan 6

    the shift registers chain in one clock cycle!Solution: use non overlapping clocks1 and 2.1 used by odd gates,2 by even gates (usexmission gates after D1, D2, D3).

    Memory Architecture: the Big Picture

    Address: which one of the M words to access Data: the N bits of the word are read/written

    Storagecells

    Word 0Word 1

    Word M-2Word M-1

    . . .

    S0S1S

    2

    SM-2S

    ... Word 0Word 1

    . . .

    S0S1S2

    ...

    D e c o d

    ...

    A0A1

    Addressdecoder

    EE 5324 -VLSI Design II - Kia Bazargan

    7

    wordselectlines

    N bits

    -Word M-2Word M-1

    N bits

    M-2SM-1

    r k-1

    k = log 2 (M)

    Memory Access Timing: the Big Picture

    Timing:Send address on the address lines,wait for the word line to become stableRead/write data on the data lines

    READ

    WRITE

    Read Access Read Access

    Read Cycle

    Write Cycle

    EE 5324 -VLSI Design II - Kia Bazargan

    8

    [Prentice Hall]

    DATA

    Data Valid Data Written

    Write Access

  • 8/8/2019 Lec_03_SRAM

    3/16

  • 8/8/2019 Lec_03_SRAM

    4/16

    4

    Hierarchical Memory Structure

    Taking the idea one step furtherShorter wires within each block Enable only one block addr decoder power savings

    Blk EN

    RowAddress

    ColumnAddress

    Block

    EE 5324 -VLSI Design II - Kia Bazargan 13

    Blk ENBlk ENAddress

    Global Bus

    SAmp/Drv

    Global drivers/sense amplifiers

    [Rab96] p. 558

    Decreasing Word Line Delay

    Word line delay comes into play!We used to have long busses, made 2D arrayshorter bussesBut, longer word lines!

    How to decrease the delay on the word lines?Break the word line by inserting buffersPlace the decoder in the middlePolysilicon word line Polysilicon word line

    EE 5324 -VLSI Design II - Kia Bazargan 14

    [Prentice Hall]

    (a) Drive the word linefrom both sides

    Metal word line

    (b) Use metal bypass

    Metal bypass

    Decreasing Word Line Delay (cont.)

    Place the decoder in the middle Add buffers to outputs of decoder

    d

    ecode

    memorycell array

    memorycell array

    EE 5324 -VLSI Design II - Kia Bazargan 15

    [Hauck]

    r

    Address lines

    k

    Array-Structured Memory Architecture

    16

  • 8/8/2019 Lec_03_SRAM

    5/16

    5

    Semiconductor MemoryClassification

    Non-VolatileRead-Write Memory Read-Write

    Memory

    Read-Only Memory

    EPROM

    E2PROM

    FLASH

    RandomAccess

    Non-RandomAccess

    SRAM

    Mask-Programmed

    Programmable (PROM)

    FIFO

    17

    DRAMShift Register

    CAM

    LIFO

    Read-Write Memories (RWM) Basic storage elements of semiconductor

    memoryRAM

    SRAM DRAM

    18

    : ce as ga n , , , , og ccompatible, differential

    DRAM: cell has no gain , 1T, refresh, slow, DRAM process,single ended, DENSE

    Memory Scaling Trend

    19

    High density is the primary design goal for memories Low voltage operation is essential for low power

    Itoh, IBM R&D, 2003

    Memory Scaling Trend

    Long retention timelow Ioff

    High Vt is required Fast access time

    high Ion High Vgs-Vt is

    required

    20

    scaled downaggressively for lowpower consumption

    Itoh, IBM R&D, 2003

  • 8/8/2019 Lec_03_SRAM

    6/16

    6

    Why SRAMs are Important

    0.18 mCoreLogic

    Cache

    0.13 m

    0.09 m

    Cache

    CoreLogic

    Cache

    21

    Logic

    Memories have better power efficiency compared to logic ~9.9B out of 10B transistors will be used for SRAMs Company with better SRAM design will dominate

    Why SRAMs are Important

    1.2

    1.4

    e d I O

    N

    NMOSPMOS

    WL Areat V

    =

    Taur, Ning

    0.4

    0.6

    0.8

    1.0

    0.01 0.1 1 10 100

    Normalized I OFF

    N o r m a

    l i z

    100X

    2X

    150nm, 110 C

    22

    Area is the number one concern minimum sized devices Smaller devices have larger variation Delay variation, stability, leakage is a problem Central limit theorem doesnt hold ( / )

    Positive Feedback: Bi-StabilityV i1 V o2V o1 = V i2

    V o2 = V i1

    Vi1

    A

    Vo2

    o1 i2

    23

    C

    B

    Vi1 = Vo2

    i2 = o1

    Meta-Stability

    A V

    o 1A

    V o 1

    C

    B

    V i 2 =

    C

    B

    V i 2

    =

    24

    Gain should be larger than 1 in the transition region

    V i 1 = V o 2

    V i 1 = V o 2

  • 8/8/2019 Lec_03_SRAM

    7/16

    7

    SRAM Memory CellWL

    NMOS access transistors

    1

    BL BLB

    25

    margins One wordline to access cell Two bit lines (BL, BLB) to carry the data Almost minimum size transistors for small cell area

    SRAM Read Operation

    0

    WL1

    Both bit lines are precharged to Vdd

    Wordline is fired for one of the cells on bit line

    BL BLB11

    26

    Cell pulls down either BL or BLB Sense amp regenerates the differential signal Data should not flip after read access Driver TR must be stronger than access TR

    SRAM Read Operation

    27

    For high density, large number of cells share bitline andwordline Subarray organization for 32Kb: 128 WLs, 256BLs

    Murmann class notes

    SRAM Read Operation

    bitlinebitline V C

    WL

    BL

    cell I e ayne =

    C bitline is large due to large number of cells attached

    BLB

    SAout

    m

    28

    cell s sma ue o g ens y ce s Vbitline has to be minimized for high speed

    < 100mV bitline voltage difference generated by SRAM cell Let the sense amplifier finish the job Increased noise sensitivity, circuit complexity

  • 8/8/2019 Lec_03_SRAM

    8/16

    8

    SRAM Read Operation: Precharge

    29

    Vdd-VtnVdd

    SRAM Read Operation: Precharge Option (a)

    Similar to dynamic logic precharge Balance transistor to equalize bitline voltages Short wordline pulse required to limit bitline swing

    Option (b) Pseudo-NMOS type circuit Bitline voltage clamped during read

    Option (c) NMOS pullup instead of PMOS

    -

    30

    dd Cant operate at low V dd

    VddVdd

    Vdd

    SRAM Cell Read Margin

    Vdd 0 Vx

    When cell is not accessed (WL=0) Data is safely kept inside the cell

    31

    g no se marg n When cell is accessed (WL=V dd )

    Access transistor acts as a noise source Data 0 is pulled up to V x Cell data can flip if V x rises above V tn

    VDDVDD

    VDD

    Static Noise Margin

    0.2

    0.4

    0.6

    0.8

    1

    V ( Q B )

    VQVQB

    E. Seevinck,1987, JSSC

    0.6

    0.8

    1

    Q B )

    Good SNM0

    0 0.2 0.4 0.6 0.8 1V(Q)

    32

    Destructive read problemThe size of the largest squareenclosed in the butterfly curves= read static noise margin

    0

    0.2

    0.4

    0 0.2 0.4 0.6 0.8 1V(Q)

    V (

    Bad SNM

  • 8/8/2019 Lec_03_SRAM

    9/16

    9

    CMOS SRAM Analysis (Read)WL

    BL

    V DD M 4BL

    M 5

    M 6

    M 1V DD V DD V DD

    Q = 1Q = 0

    C bit C bit

    33

    Techniques to Improve Read Margin

    Cell beta ratio =

    (W/L) drv / (W/L) access

    J. Rabaey

    34

    margin But remember, area is the number one constraint in

    memory design Increasing cell size a not a good trade off

    Techniques to Improve Read Margin

    High V t transistors Internal node on low

    side needs to riseto V or more

    Virtually never happens when V t is

    larger than half V dd Cell is extremely

    stable at ultra-lowpower design point SNM low V t

    35

    relaxed smaller driver and larger access TR can beused for faster readand write

    SNM high V t

    Techniques to Improve Read Margin

    Boosted cell supply Supply voltage of

    Vdd +

    ce s g er than outside

    Makes driver stronger

    than access,suppressing the rise inthe low side

    Effectivel im roves

    VddVdd

    Vdd0

    36

    the beta ratio

    Driver NMOS can bedownsized, decreasingcell size

  • 8/8/2019 Lec_03_SRAM

    10/16

    10

    SRAM Write OperationWL

    1

    1

    BL BLB01

    37

    Launch the write data on BL and BLB Word line signal is fired Low bit line value flips cell data Access TR must be stronger than PMOS load

    CMOS SRAM Analysis (Write)

    M 4

    V DD

    WL

    ( )4 / LW PR =

    BL = 1 BL = 0

    Q = 0Q = 1

    M 1

    M 5

    M 6

    V DD

    6

    38

    SRAM Cell Write Margin

    0Vdd

    VddJ. Rabaey

    Vdd 0 0 Vdd

    = (W/L) pmos / (W/L) access

    39

    Access transistor must be stronger than PMOS to pull thebelow the trip point (typical pull-up ratio ~ 1.5)

    To avoid cell size increase, correct pull-up ratio achievedby controlling V tn and V tp

    Techniques to Improve Write Margin Sizing: access TR vs. PMOS in latch Higher WL voltage for access TR Virtual VDD

    Higher voltage

    Sizing

    40

  • 8/8/2019 Lec_03_SRAM

    11/16

    11

    6T-SRAM Layout Until 90nm

    V DD

    BL BLB

    GND

    41

    Compact cellBitlines: M2Wordline: strapped in M3

    6T-SRAM Layout From 65nm

    42

    6T versus 4T SRAM

    6T SRAM CellSupply current is limitedo e ea age curren o

    transistors in the stablestate

    4T SRAM Cell

    43

    g egree ocompactnessHigh power consumption

    RAM Variations Many variations to the basic 6T SRAM cell More functionality, smaller cells

    True multi-ported cell Content addressable memory (CAM)

    4T memory cell 3T memory cell 2T memory cell

    44

    1T DRAM cell

  • 8/8/2019 Lec_03_SRAM

    12/16

    12

    Dual Read or Single Write CellWL0WL1

    Two wordlines, one for each access transistor BL BLB

    45

    ma ncrease n ce s ze Can either

    read two different cells in one cycle or write to one cell

    Multi Port CellWL0

    BL0 BLB0WL1

    BL1 BLB1

    46

    ac por as separa e a ress Memory access bandwidth is twice (ideally) Write through: data written can be read by

    another port in the very same cycle

    Multi-Port RAM Cells Array 7 words deep,

    2 wide words,dual port mem

    SA0

    word j and write

    d1d0 to word k

    simultaneously:Set SA j=1, and allother SAs=0Set SB =1, and all

    SB0

    . . . . . .

    SA1

    SB1

    SA7

    EE 5324 -VLSI Design II - Kia Bazargan 47

    other SBs=0Sense the values onbus_A0 and bus_A1Write d 1d0 tobus_B0 and bus_B1

    b u s _B

    0

    b u s _A

    0

    b u s _A

    0

    b u s _B

    0

    b u s _B 1

    b u s _A 1

    b u s _A 1

    b u s _B 1

    SB7

    Content Addressable Memory (CAM) Instead of address, provide data find a match

    Applications: cache, physical particle collider

    Needs Encoder:Inverse function of decoderTake a one-hot collection of signals and encode them

    en

    m bits

    EE 5324 -VLSI Design II - Kia Bazargan 48

    [Hauck]

    coder

    n

    2n rows

    m

    addressablememory

    cell array

  • 8/8/2019 Lec_03_SRAM

    13/16

    13

    Content Addressable Memory Cell

    Read and write like normal 6T memory cell Match signal is precharged to 1,

    ulled to 0 if no matchSend data on bit and data on bit for matchingMatch remains 1 iff all bits in word match

    rowselect

    EE 5324 -VLSI Design II - Kia Bazargan 49

    [Hauck]

    match

    bit bit'

    Encoders

    enco

    contentaddressable

    memoryer

    ce array

    rowselect

    EE 5324 -VLSI Design II - Kia Bazargan 50

    match

    bit bit'

    Smaller RAM Cells

    Internal nodes dont go toVdd Cell wont work at low Vdd

    Need 2 wordlines, read WLand write WL

    Can have 1 or 2 bitlines

    51

    degraded

    Effective strength of NMOSdriver is reduced

    Refresh needed

    ea r e Not very small, since it has

    more wires

    1-T DRAM CellWL

    BL

    WLWrite 1 Read 1

    1

    C S

    C BL

    V DD 2 V T X

    sensing

    BL

    GND

    V DD

    V DD /2

    V DD /2

    -

    52

    Write: C S is charged or discharged by asserting WL and BL.Read: Charge redistribution takes places between bit line and storage capacitance

    Voltage swing is small; typically around 250 mV.

    V BL V PRE V BIT V PREC S

    C S C BL+------------= =V

  • 8/8/2019 Lec_03_SRAM

    14/16

    14

    DRAM Cell Observations1T DRAM requires a sense amplifier for each bit line,

    due to charge redistribution read-out.

    SRAM cells.The read-out of the 1T DRAM cell is destructive; read

    and refresh operations are necessary for correctoperation.

    When writing a 1 into a DRAM cell, a threshold

    53

    .bootstrapping the word lines to a higher value than V DD

    Sense Amp Operation

    V (1)V BL

    D V (1)V PRE

    54

    t Sense amp activatedWord line activated

    1-T DRAM Cell

    M1 word

    Capacitor

    line

    Diffusedbit line

    Polysilicongate

    Polysiliconplate

    Metal word line

    Poly

    SiO 2

    Field Oxiden + n +Inversion layerinduced byplate bias

    Poly

    55

    Uses Polysilicon-Diffusion CapacitanceExpensive in Area

    Cross-section Layout

    Dynamic RAM 1-Transistor Cell: Layout

    Spring 2006 EE 5324 - VLSI Design II - Kia Bazargan 56

    [Prentice Hall]

  • 8/8/2019 Lec_03_SRAM

    15/16

    15

    Advanced 1-T DRAM CellsCapacitor dielectric layerCell plate

    Word lineInsulating Layer

    Cell Plate Si

    Capacitor Insulator

    Storage Node Poly

    Refilling Poly

    Si Substrate

    IsolationTransfer gateStorage electrode

    57

    Trench Cell Stacked-capacitor Cell

    Good References on RAM K. Itoh, VLSI Memory Chip Design , Springer-Verlag

    New York, LLC . , . , . , . ,

    Review and future prospects of low-voltage RAM circuits , Vol. 47, No. 5/6, 2003, IBM J R&D

    R. W. Mann, W. W. Abadeer, M. J. Breitwisch, O. Bula,et al, Ultralow-power SRAM technology , Vol. 47, No. 5/6,2003, IBM J R&D

    58

    59 60

  • 8/8/2019 Lec_03_SRAM

    16/16