Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint -...

27
Low Power Cache Design Angel Chen Joe Gambino

Transcript of Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint -...

Page 1: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Low Power Cache DesignAngel ChenJoe Gambino

Page 2: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Agenda➢ Why is “low power” important?➢ How does cache contribute to the power consumption of a

processor?➢ What are some design challenges for low power caches?➢ Existing solutions for low power cache design➢ Researched ultra-low-voltage cache design solution for

many-core systems ○ Detailed architectural overview of the design○ performance comparison with existing solutions

Page 3: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

BackgroundThe cost/performance ratio of computing systems have

seen a steady decline due to advances in:❏ Integrated circuit technology❏ Architectural improvements in CPU design

At the same time, more and more emphasis is given to theminimization of environmental footprint of computingsystems. In particular, energy efficiency is a key concernin future many-core (1000+ processors) system design.

Page 4: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Background (cont.)❏ Among all processor components, the cache

consumes about 40% of the total area and 20% ofpower.

❏ Supply voltage scaling can reduce system energyconsumption at the cost of performance and reliability.(Especially with the current trend toward many-coreprocessors and deep nanometer designs)

Page 5: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Existing Solutions❏ Cache designs that intrinsically consume less energy.

❏ Employ advanced fault tolerance techniques to maintainreliability under low supply voltage.

Page 6: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Existing Solutions (cont.)❏ Redundant Memory

Excessive area and power overheads when cell-failure probability increases

❏ Cache Line DisablingThis is only effective if the cell-failure probability is low

❏ ECC (error correcting codes)E.g. extended hamming codes, multi-bit clustered ECC

Page 7: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Cache Fault Analysis❏ Soft cache fault is generally caused by random noises

or particle strikes. It occurs transiently and is recoverablethrough error-detecting/correcting codes.

❏ Hard cache fault is persistent memory cell fault thatoccurs as a result of chip manufacturing defects ordevice aging. Error correction codes can also be usedto correct this failure but can only correct single-biterrors.

Page 8: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Cache Fault Analysis (cont.)

Page 9: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Cache Fault Analysis (cont.)

Page 10: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Researched Solution

Cache Line Disabling+

Double Error CorrectingTriple Error Detecting Codes

(1-bit error correction for hard errors and 1-bit error correction for soft errors)

Page 11: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Researched Solution (cont.)

Page 12: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Cache Line Disabling

❏ On ROM software called Built In Self Testchecks for errors during initialization ofprocessor

❏ Detects Hard Cache faults❏ Disables Cache lines with more than one fault❏ Only works if the number of faulty lines is low❏ Requires no additional hardware overhead

Page 13: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Cache Line Disabling (cont.)❏ SRAM - Modeled as 3 blocks

❏ Memory Cell Array, Address Decoder, and Read/WriteCircuit

❏ All faults can be detected by checking memory cells❏ Two general types of cache errors

❏ Single-Cell and Multi-Cell❏ Stuck-At 1/0, Stuck Open, State Transition, Data

Retention❏ Cell State Coupling, Multiple Address

Page 14: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Cache Line Disabling (cont.)❏ Any BIST with 100% correctness can be used❏ Most efficient is 9N with an added Data Retention test❏ 9N utilizes ‘Marching’

❏ For Word-Oriented SRAM, each march element is aword❏ Introduces Multi-Cell errors

❏ Writes one march element at a time to each wordaddress, reads it, compares them

❏ After marching, data retention test runs❏ This method performs all checks with the best

speed/ROM space ratio

Page 15: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

ECC Overview❏ Error Detection with Parity Codes

can detect all odd number faults, cannot correct error❏ Single-Error Correction Codes

- product codes (2-D parity check), parity checkmatrices

- such coding scheme is commonly referred to ashamming codes

- cannot simultaneously detect double-bit errors

Page 16: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

ECC Overview (cont.)

Page 17: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

ECC Overview (cont.)❏ Single-Error Correct Double-Error Detect Codes

- widely used to protect memory cells- extend SEC by adding an extra bit that represents

the parity over the seven bits of the SEC code word

Page 18: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

DECTEDDouble-Error Correcting Triple-Error Detecting Codes

Page 19: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

DECTED (cont.)

Page 20: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Simulation Comparison

Page 21: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Simulation Comparison (cont.)

Page 22: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Simulation Comparison (cont.)

Page 23: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Simulation Comparison (cont.)

Page 24: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Simulation Comparison (cont.)

Page 25: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Simulation Comparison (cont.)

Page 26: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

References[1] M. Zhang, V. Stojanovic and P. Ampadu, "Reliable ultra-low-voltage cache design for many-core systems," IEEE

Trans. Circuits Syst. II: Regular Papers,, vol. 59, no. 12, pp. 858-862, Dec. 2012.

[2] A. Aladmeldeen, I. Wagner, Z. Chishti, W. Wu, C. Wilkerson, and S.-L. Lu, “Energy-efficient cache design usingvariable-strength errorcorrecting codes,” in Proc. 38th Int. Symp. Comput. Architecture, 2011, pp. 461–472.

[3] Mukherjee, Shubu. Architecture Design for Soft Errors. Amsterdam: Morgan Kaufmann/Elsevier, 2008. Print.

[4] A. Agarwal, B. C. Paul, H. Mahmoodi, A. Datta, and K. Roy, “A processtolerant cache architecture for improved yieldin nanoscale technologies,” IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 13, no. 1, pp. 27–38, Jan.2005.

[5] M. H. Tehranipour, Z. Navabi, and S. M. Falkhrai, “An efficient BIST method for testing of embedded SRAMs,” inProc. IEEE Int. Symp. Circuits and Systems, vol. 5, May 2001, pp. 73–76.

[6] Kenyon, Samantha Rose, "Drowsy Cache Partitioning for Multithreaded Systems and High Level Caches" (2014). Thesis. Rochester Institute of Technology. Accessed from RIT Scholar Works Thesis/Dissertation Collections.

Page 27: Low Power Cache Designmeseec.ce.rit.edu/551-projects/spring2015/2-2.pdf · Microsoft PowerPoint - Comp Arch Presentation1.pptx Author: meseec Created Date: 5/12/2015 3:42:06 PM ...

Questions?