06 Memory Print Version

download 06 Memory Print Version

of 72

Transcript of 06 Memory Print Version

  • 8/13/2019 06 Memory Print Version

    1/72

    Section 6

    Memory

  • 8/13/2019 06 Memory Print Version

    2/72

    2

    Chapter 6 Objectives

    Master the concepts of hierarchical memoryorganization.

    Understand how each level of memory contributes

    to system performance, and how the performanceis measured.

    Master the concepts behind cache memory, virtualmemory, memory segmentation, paging andaddress translation.

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    3/72

    3

    6.1 Introduction

    Memory lies at the heart of the stored-programcomputer.

    In previous chapters, we studied thecomponents from which memory is built and the

    ways in which memory is accessed by variousISAs. In this chapter, we focus on memory

    organization. A clear understanding of these

    ideas is essential for the analysis of systemperformance.

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    4/72

    4

    6.2 Types of Memory

    There are two kinds of main memory: randomaccess memory, RAM, and read-only-memory,ROM .

    There are two types of RAM, dynamic RAM (DRAM)

    and static RAM (SRAM). DRAM consists of capacitors that slowly leak their

    charge over time. Thus, they must be refreshedevery few milliseconds to prevent data loss.

    DRAM is cheap memory owing to its simpledesign.

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    5/72

    5

    SRAM consists of circuits similar to the D flip-flopthat we studied in Chapter 3.

    SRAM is very fast memory and it doesnt need tobe refreshed like DRAM does. It is used to build

    cache memory, which we will discuss in detail later. ROM also does not need to be refreshed, either. In

    fact, it needs very little charge to retain its memory.

    ROM is used to store permanent, or semi-permanent data that persists even while the systemis turned off.

    6.2 Types of Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    6/72

    6

    6.3 The Memory Hierarchy

    Generally speaking, faster memory is moreexpensive than slower memory.

    To provide the best performance at the lowest cost,memory is organized in a hierarchical fashion.

    Small, fast storage elements are kept in the CPU,larger, slower main memory is accessed throughthe data bus.

    Larger, (almost) permanent storage in the form ofdisk and tape drives is still further from the CPU.

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    7/72

    7

    This storage organization can be thought of as a pyramid:

    6.3 The Memory Hierarchy

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    8/72

    8

    We are most interested in the memory hierarchy

    that involves registers, cache, main memory, andvirtual memory.

    Registers are storage locations available on theprocessor itself.

    Virtual memory is typically implemented using ahard drive; it extends the address space fromRAM to the hard drive.

    Virtual memory provides more space: Cachememory provides speed.

    6.3 The Memory Hierarchy

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    9/72

    9

    To access a particular piece of data, the CPU firstsends a request to its nearest memory, usuallycache.

    If the data is not in cache, then main memory is

    queried. If the data is not in main memory, thenthe request goes to disk.

    Once the data is located, then the data, and anumber of its nearby data elements are fetched

    into cache memory.

    6.3 The Memory Hierarchy

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    10/72

    10

    This leads us to some definitions.

    A hit is when data is found at a given memory level. A miss is when it is not found. The hit rate is the percentage of time data is found at a given

    memory level. The miss rate is the percentage of time it is not. Miss rate = 1 - hit rate . The hit time is the time required to access data at a given

    memory level. The miss penalty is the time required to process a miss,

    including the time that it takes to replace a block of memory plus the time it takes to deliver the data to the processor.

    6.3 The Memory Hierarchy

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    11/72

    11

    An entire blocks of data is copied after a hitbecause the principle of locality tells us that once abyte is accessed, it is likely that a nearby dataelement will be needed soon.

    There are three forms of locality: Temporal locality - Recently-accessed data elements tendto be accessed again.

    Spatial locality - Accesses tend to cluster. Sequential locality - Instructions tend to be accessed

    sequentially.

    6.3 The Memory Hierarchy

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    12/72

    12

    6.4 Cache Memory

    The purpose of cache memory is to speed upaccesses by storing recently used data closer to theCPU, instead of storing it in main memory.

    Although cache is much smaller than main memory,its access time is a fraction of that of main memory.

    Unlike main memory, which is accessed by address,cache is typically accessed by content; hence, it isoften called content addressable memory .

    Because of this, a single large cache memory isntalways desirable-- it takes longer to search.

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    13/72

    13

    The content that is addressed in content addressablecache memory is a subset of the bits of a main memoryaddress called a field . Many blocks of main memory map to a single block of

    cache. A tag field in the cache block distinguishes onecached memory block from another.

    A valid bit indicates whether the cache block is being used. An offset field points to the desired data in the block.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    14/72

    14

    The simplest cache mapping scheme isdirect mapped cache .

    In a direct mapped cache consisting of N blocks of cache, block X of main memorymaps to cache block Y = X mod N .

    Thus, if we have 10 blocks of cache, block 7of cache may hold blocks 7, 17, 27, 37, . . .of main memory.

    The next slide illustrates this mapping.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    15/72

  • 8/13/2019 06 Memory Print Version

    16/72

    16

    EXAMPLE 6.1 Consider a word-addressable main

    memory consisting of four blocks, and a cache withtwo blocks, where each block is 4 words.

    This means Block 0 and 2 of main memory map toBlock 0 of cache, and Blocks 1 and 3 of mainmemory map to Block 1 of cache.

    Using the tag, block, and offset fields, we can seehow main memory maps to cache as follows.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    17/72

    17

    EXAMPLE 6.1 Consider a word-addressable mainmemory consisting of four blocks, and a cache with twoblocks, where each block is 4 words. First, we need to determine the address format for mapping.

    Each block is 4 words, so the offset field must contain 2 bits;there are 2 blocks in cache, so the block field must contain 1 bit;

    this leaves 1 bit for the tag (as a main memory address has 4 bitsbecause there are a total of 2 4=16 words).

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    18/72

  • 8/13/2019 06 Memory Print Version

    19/72

  • 8/13/2019 06 Memory Print Version

    20/72

  • 8/13/2019 06 Memory Print Version

    21/72

    21

    In summary, direct mapped cache maps mainmemory blocks in a modular fashion to cacheblocks. The mapping depends on:

    The number of bits in the main memory address(how many addresses exist in main memory)

    The number of blocks are in cache (whichdetermines the size of the block field)

    How many addresses (either bytes or words) are

    in a block (which determines the size of theoffset field)

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    22/72

    22

    Suppose instead of placing memory blocks inspecific cache locations based on memoryaddress, we could allow a block to go anywherein cache.

    In this way, cache would have to fill up beforeany blocks are evicted.

    This is how fully associative cache works.

    A memory address is partitioned into only twofields: the tag and the word.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    23/72

    23

    Suppose, as before, we have 14-bit memoryaddresses and a cache with 16 blocks, each blockof size 8. The field format of a memory referenceis:

    When the cache is searched, all tags are searchedin parallel to retrieve the data quickly.

    This requires special, costly hardware.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    24/72

  • 8/13/2019 06 Memory Print Version

    25/72

    25

    Set associative cache combines the ideas of direct

    mapped cache and fully associative cache. An N -way set associative cache mapping is like

    direct mapped cache in that a memory referencemaps to a particular location in cache.

    Unlike direct mapped cache, a memory referencemaps to a set of several cache blocks, similar to theway in which fully associative cache works.

    Instead of mapping anywhere in the entire cache, amemory reference can map only to the subset ofcache slots.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    26/72

    26

    The number of cache blocks per set in set associativecache varies according to overall system design.

    6.4 Cache Memory

    For example, a 2-way set associativecache can be conceptualized as shown inthe schematic below.

    Each set contains two different memory

    blocks.

    Logical view Linear view

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    27/72

    27

    In set associative cache mapping, a memoryreference is divided into three fields: tag, set,and offset.

    As with direct-mapped cache, the offset fieldchooses the word within the cache block, andthe tag field uniquely identifies the memoryaddress.

    The set field determines the set to which the

    memory block maps.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    28/72

    28

    EXAMPLE 6.5 Suppose we are using 2-way set

    associative mapping with a word-addressable mainmemory of 2 14 words and a cache with 16 blocks,where each block contains 8 words. Cache has a total of 16 blocks, and each set has 2 blocks,

    then there are 8 sets in cache. Thus, the set field is 3 bits, the offset field is 3 bits, and

    the tag field is 8 bits.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    29/72

  • 8/13/2019 06 Memory Print Version

    30/72

    30

    The replacement policy that we choose dependsupon the locality that we are trying to optimize--usually, we are interested in temporal locality.

    A least recently used (LRU) algorithm keeps track ofthe last time that a block was assessed and evictsthe block that has been unused for the longestperiod of time.

    The disadvantage of this approach is its complexity:LRU has to maintain an access history for eachblock, which ultimately slows down the cache.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    31/72

    31

    First-in, first-out (FIFO) is a popular cachereplacement policy.

    In FIFO, the block that has been in the cache thelongest, regardless of when it was last used.

    A random replacement policy does what its nameimplies: It picks a block at random and replaces itwith a new block.

    Random replacement can certainly evict a block that

    will be needed often or needed soon, but it neverthrashes.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    32/72

    32

    The performance of hierarchical memory ismeasured by its effective access time (EAT).

    EAT is a weighted average that takes into accountthe hit ratio and relative access times of successivelevels of memory.

    The EAT for a two-level memory is given by:EAT = H Access C + (1- H ) Access MM .

    where H is the cache hit rate and Access C and Access MM are

    the access times for cache and main memory, respectively.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    33/72

    33

    For example, consider a system with a main

    memory access time of 200ns supported by acache having a 10ns access time and a hit rate of99%.

    Suppose access to cache and main memoryoccurs concurrently. (The accesses overlap.)

    The EAT is:

    0.99(10ns) + 0.01(200ns) = 9.9ns + 2ns = 11ns.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    34/72

    34

    For example, consider a system with a main memory

    access time of 200ns supported by a cache having a10ns access time and a hit rate of 99%.

    If the accesses do not overlap, the EAT is:

    0.99(10ns) + 0.01(10ns + 200ns)= 9.9ns + 2.01ns = 12ns.

    This equation for determining the effective accesstime can be extended to any number of memorylevels, as we will see in later sections.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    35/72

    35

    Caching is depends upon programs exhibiting good

    locality. Some object-oriented programs have poor locality

    owing to their complex, dynamic structures. Arrays stored in column-major rather than row-major

    order can be problematic for certain cacheorganizations.

    With poor locality, caching can actually cause

    performance degradation rather than performanceimprovement.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    36/72

    36

    Cache replacement policies must take into account

    dirty blocks , those blocks that have been updatedwhile they were in the cache.

    Dirty blocks must be written back to memory. Awrite policy determines how this will be done.

    There are two types of write policies, write throughand write back .

    Write through updates cache and main memory

    simultaneously on every write.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    37/72

    37

    Write back (also called copyback ) updates memory

    only when the block is selected for replacement. The disadvantage of write through is that memory

    must be updated with each cache write, which slowsdown the access time on updates. This slowdown isusually negligible, because the majority of accessestend to be reads, not writes.

    The advantage of write back is that memory traffic isminimized, but its disadvantage is that memory doesnot always agree with the value in cache, causingproblems in systems with many concurrent users.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    38/72

    38

    The cache we have been discussing is called aunified or integrated cache where both instructionsand data are cached.

    Many modern systems employ separate caches fordata and instructions. This is called a Harvard cache.

    The separation of data from instructions providesbetter locality, at the cost of greater complexity.

    Simply making the cache larger provides about the same performance improvement without the complexity.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    39/72

    39

    Cache performance can also be improved byadding a small associative cache to hold blocksthat have been evicted recently. This is called a victim cache .

    A trace cache is a variant of an instructioncache that holds decoded instructions forprogram branches, giving the illusion thatnoncontiguous instructions are really

    contiguous.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    40/72

    40

    Most of todays small systems employ multilevel

    cache hierarchies. The levels of cache form their own small memory

    hierarchy.

    Level1 cache (8KB to 64KB) is situated on theprocessor itself. Access time is typically about 4ns.

    Level 2 cache (64KB to 2MB) may be on the

    motherboard, or on an expansion card. Access time is usually around 15 - 20ns.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    41/72

  • 8/13/2019 06 Memory Print Version

    42/72

    42

    If the cache system used an inclusive cache, thesame data may be present at multiple levels ofcache.

    Strictly inclusive caches guarantee that all data in asmaller cache also exists at the next higher level.

    Exclusive caches permit only one copy of the data.

    The tradeoffs in choosing one over the otherinvolve weighing the variables of access time,

    memory size, and circuit complexity.

    6.4 Cache Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    43/72

    43

    6.5 Virtual Memory

    Cache memory enhances performance by providingfaster memory access speed.

    Virtual memory enhances performance by providinggreater memory capacity, without the expense ofadding main memory.

    Instead, a portion of a disk drive serves as anextension of main memory.

    If a system uses paging, virtual memory partitionsmain memory into individually managed page frames ,that are written (or paged) to disk when they are notimmediately needed.

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    44/72

    44

    A physical address is the actual memory address ofphysical memory.

    Programs create virtual addresses that are mapped to physical addresses by the memory manager.

    Page faults occur when a logical address requiresthat a page be brought in from disk.

    Memory fragmentation occurs when the pagingprocess results in the creation of small, unusableclusters of memory addresses.

    6.5 Virtual Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    45/72

    45

    Main memory and virtual memory are divided intoequal sized pages.

    The entire address space required by a processneed not be in memory at once. Some parts can beon disk, while others are in main memory.

    Further, the pages allocated to a process do notneed to be stored contiguously-- either on disk or inmemory.

    In this way, only the needed pages are in memoryat any time, the unnecessary pages are in slowerdisk storage.

    6.5 Virtual Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    46/72

    46

    Information concerning the location of each page,

    whether on disk or in memory, is maintained in a datastructure called a page table (shown below). There is one page table for each active process.

    6.5 Virtual Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    47/72

    47

    When a process generates a virtual address, theoperating system translates it into a physicalmemory address.

    To accomplish this, the virtual address is dividedinto two fields: A page field, and an offset field.

    The page field determines the page location of theaddress, and the offset indicates the location of theaddress within the page.

    The logical page number is translated into aphysical page frame through a lookup in the pagetable.

    6.5 Virtual Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    48/72

    48

    If the valid bit is zero in the page table entry for the

    logical address, this means that the page is not inmemory and must be fetched from disk. This is a page fault. If necessary, a page is evicted from memory and is replaced

    by the page retrieved from disk, and the valid bit is set to 1.

    If the valid bit is 1, the virtual page number isreplaced by the physical frame number.

    The data is then accessed by adding the offset to thephysical frame number.

    6.5 Virtual Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    49/72

  • 8/13/2019 06 Memory Print Version

    50/72

    50

    Suppose we have the page table shown below.

    What happens when CPU generates address 5459 10= 1010101010011 2 = 1553 16 ?

    6.5 Virtual Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    51/72

    51

    What happens when CPU generates address 5459 10

    = 1010101010011 2 = 1553 16 ?

    6.5 Virtual Memory

    The high-order 3 bits of the virtual address, 101(510), provide the page number in the page table.

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    52/72

    52

    The address 1010101010011 2 is converted to

    physical address 010101010011 2 = 1363 16 becausethe page field 101 is replaced by frame number 01through a lookup in the page table.

    6.5 Virtual Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    53/72

    53

    What happens when the CPU generates address

    1000000000100 2?

    6.5 Virtual Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    54/72

    54

    We said earlier that effective access time (EAT) takesall levels of memory into consideration.

    Thus, virtual memory is also a factor in thecalculation, and we also have to consider page tableaccess time.

    Suppose a main memory access takes 200ns, thepage fault rate is 1%, and it takes 10ms to load apage from disk. We have:

    EAT = 0.99(200ns + 200ns) 0.01(10ms) = 100, 396ns.

    6.5 Virtual Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    55/72

    55

    Even if we had no page faults, the EAT would be400ns because memory is always read twice: First toaccess the page table, and second to load the pagefrom memory.

    Because page tables are read constantly, it makessense to keep them in a special cache called atranslation look-aside buffer (TLB).

    TLBs are a special associative cache that stores the

    mapping of virtual pages to physical pages.The next slide shows address lookupsteps when a TLB is involved.

    6.5 Virtual Memory

    COMP 228: System Hardware

    TLB lookup process

  • 8/13/2019 06 Memory Print Version

    56/72

    56

    1. Extract the page number fromthe virtual address.2. Extract the offset from the virtual

    address.3. Search for the virtual page numberin the TLB.4. If the (virtual page #, page frame #)

    pair is found in the TLB, add theoffset to the physical frame number

    and access the memory location.5. If there is a TLB miss, go to the page table to get the necessary framenumber.If the page is in memory, use thecorresponding frame number and addthe offset to yield the physicaladdress.6. If the page is not in main memory,generate a page fault and restart theaccess when the page fault iscomplete.

    TLB lookup process

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    57/72

    57

    6.5 Virtual MemoryPutting it all together:The TLB, Page Table,and Main Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    58/72

    58

    Another approach to virtual memory is the use ofsegmentation .

    Instead of dividing memory into equal-sized pages,virtual address space is divided into variable-lengthsegments, often under the control of the programmer.

    A segment is located through its entry in a segmenttable, which contains the segments memory locationand a bounds limit that indicates its size.

    After a page fault, the operating system searches fora location in memory large enough to hold thesegment that is retrieved from disk.

    6.5 Virtual Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    59/72

    59

    Both paging and segmentation can causefragmentation.

    Paging is subject to internal fragmentation because aprocess may not need the entire range of addressescontained within the page. Thus, there may be manypages containing unused fragments of memory.

    Segmentation is subject to external fragmentation,which occurs when contiguous chunks of memory

    become broken up as segments are allocated anddeallocated over time.

    The next slides illustrate internal and external fragmentation.

    6.5 Virtual Memory

    COMP 228: System Hardware

    l

  • 8/13/2019 06 Memory Print Version

    60/72

    60

    Consider a small computer

    having 32K of memory. The 32K memory is divided

    into 8 page frames of 4K each.

    A schematic of thisconfiguration is shown at theright.

    The numbers at the right arememory frame addresses.

    6.5 Virtual Memory

    COMP 228: System Hardware

    6 5 Vi l M

  • 8/13/2019 06 Memory Print Version

    61/72

    61

    Suppose there are fourprocesses waiting to beloaded into the system withmemory requirements as

    shown in the table. We observe that these

    processes require 31K ofmemory.

    6.5 Virtual Memory

    COMP 228: System Hardware

    6 5 Vi l M

  • 8/13/2019 06 Memory Print Version

    62/72

    62

    When the first three processes areloaded, memory looks like this:

    All of the frames are occupied bythree of the processes.

    6.5 Virtual Memory

    COMP 228: System Hardware

    6 5 Vi l M

  • 8/13/2019 06 Memory Print Version

    63/72

    63

    Despite the fact that there are enough

    free bytes in memory to load thefourth process, P4 has to wait for oneof the other three to terminate,because there are no unallocatedframes.

    This is an example of internalfragmentation.

    6.5 Virtual Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    64/72

  • 8/13/2019 06 Memory Print Version

    65/72

    6 5 Vi t l M

  • 8/13/2019 06 Memory Print Version

    66/72

    66

    Eventually, Segment 2 of Process 1is no longer needed, so it isunloaded giving 11K of free memory.

    But Segment 2 of Process 2 cannotbe loaded because the free memoryis not contiguous.

    6.5 Virtual Memory

    COMP 228: System Hardware

    6 5 Vi t l M

  • 8/13/2019 06 Memory Print Version

    67/72

    67

    Over time, the problem getsworse, resulting in smallunusable blocks scatteredthroughout physical memory.

    This is an example of externalfragmentation .

    Eventually, this memory isrecovered through compaction,

    and the process starts over.

    6.5 Virtual Memory

    COMP 228: System Hardware

    6 5 Vi t l M

  • 8/13/2019 06 Memory Print Version

    68/72

    68

    Large page tables are cumbersome and slow, but with

    its uniform memory mapping, page operations arefast. Segmentation allows fast access to the segmenttable, but segment loading is labor-intensive.

    Paging and segmentation can be combined to takeadvantage of the best features of both by assigningfixed-size pages within variable-sized segments.

    Each segment has a page table. This means that amemory address will have three fields, one for thesegment, another for the page, and a third for theoffset.

    6.5 Virtual Memory

    COMP 228: System Hardware

  • 8/13/2019 06 Memory Print Version

    69/72

    6 6 A Real World Example

  • 8/13/2019 06 Memory Print Version

    70/72

    70

    6.6 A Real-World Example

    COMP 228: System Hardware

    Ch 6 C l i

  • 8/13/2019 06 Memory Print Version

    71/72

    71

    Computer memory is organized in a hierarchy, withthe smallest, fastest memory at the top and thelargest, slowest memory at the bottom.

    Cache memory gives faster access to main memory,while virtual memory uses disk storage to give theillusion of having a large main memory.

    Cache maps blocks of main memory to blocks ofcache memory. Virtual memory maps page framesto virtual pages.

    There are three general types of cache: Directmapped, fully associative and set associative.

    Chapter 6 Conclusion

    COMP 228: System Hardware

    Ch 6 C l i

  • 8/13/2019 06 Memory Print Version

    72/72

    With fully associative and set associative cache,as well as with virtual memory, replacementpolicies must be established.

    Replacement policies include LRU, FIFO, or LFU.

    These policies must also take into account whatto do with dirty blocks.

    All virtual memory must deal with fragmentation,internal for paged memory, external for

    segmented memory.

    Chapter 6 Conclusion