Lecture6

High Performance Computing

Jawwad ShamsiLecture #6

27th January 2010

• Cache Coherence• NUMA

Today’s topics

• Cache Coherence – Continuation• Vector Processing

Cache Coherence

• In SMP or NUMA, multiple copies of cache– Each copy may have a different value of data item– Maintain Coherency• How?

Cache Coherence: Two Approaches

• Write back: Update Main memory once cache is flushed.

• Write through: Write is updated to cache as well as to the main memory.

Implementations

• Software Solutions: – Compile time decision– Conservative– Inefficient cache utilization

• Hardware Solutions:– Runtime decision– More effective

Hardware based solution

• Directory Protocol• Snoopy Protocol

Directory

• Disadvantage– Centralized Controller– Bottleneck

• Advantage– Useful in large –scale system

Snoopy Protocol

• Update operation announced• All Cache controllers snoop• Bus architecture– Careful• Increased Bus Traffic

Snoopy Protocol

• Two approaches– Write Invalidate• One write• Multiple readers• Exclusive: Writer invalidates others entries

– Write Update• Multiple writers• All writes are updated

Write Invalidate

• The MESI Protocol : P4 processor– Data cache: Two status bits, 4 states• Modified• Exclusive• Shared• Invalid• See Table

4 Possibilities• Read Miss:

– EX to SH– SH to SH– MO to SH

• Read-Hit• Write-Miss

– RWITM– MO to IN– SH to IN

• Write Hit– SH to IN– EX – Mo

L1- L2 Cache Consistency

Parallel programming and Amdahl's Law

Suppose 1/N time for sequential codeAnd 1-1/N for the parallel

Amdahl's Law

• Speedup: speed gain of using parallel processor vs. single processor

• Speed= 1/(s+(p/N))– S=sequential code, p = parallel code, N= no. of

processors– S= T(1)/ T(j)• For j parallel processorsAs problem size increases, p may rise and s may decrease

Lecture6

Technology

Transcript of Lecture6

Tk3813 Lecture6

lecture6 - Pacharmonpacharmon.com/cpe4196/lecture6.pdf · lecture6 Author: Pacharmon Fuhry Created Date: 3/17/2020 4:04:41 AM ...

NMR Lecture6

Lecture6: 123.101

Cs101 lecture6

Lecture6 Dynamics

compiler lecture6

Lecture6 Wavelet

TNM Lecture6

Slides Lecture6

KNL4343 lecture6

Lecture6: 123.312

Lecture6 radiometricdating

Lecture6 MKT

BDACA1516s2 - Lecture6

Math101 lecture6

lecture6 (Phontics2)

ICS2208 lecture6

Lecture6 Handouts

Lecture6 - WaveEqnAndVibMeasurement