Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering...

29
Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun * , Roman Lysecky * , Karthik Shankar * , Avinash Kodi , Ahmed Louri * and Janet M. Wang* * ECE Department, University of Arizona, Tucson, AZ 85721, USA EECS Department, Ohio University, Athens, OH 45701, UDA E-mail: {sunj, rlysecky, karthik1, louri, wml}@ece.arizona.edu , [email protected] Speaker : Janet M. Wang

Transcript of Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering...

Page 1: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Workload Capacity Considering NBTI Degradation

in Multi-core Systems

Jin Sun*, Roman Lysecky*, Karthik Shankar*, Avinash Kodi†, Ahmed Louri* and Janet M. Wang*

*ECE Department, University of Arizona, Tucson, AZ 85721, USA†EECS Department, Ohio University, Athens, OH 45701, UDA

E-mail: {sunj, rlysecky, karthik1, louri, wml}@ece.arizona.edu, [email protected]

Speaker : Janet M. Wang

Page 2: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Talk OutlineMotivationFractional NBTI device modelNBTI-Aware workload balancing

Dynamic Zoning (DZ)Dynamic Task Scheduling (DTS)Dynamic workload balancing

Experimental ResultsSystem performance evaluationDevice life-span evaluation

Conclusions

Page 3: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

MotivationPermanent fault such as Negative Bias Temperature Instability (NBTI) affects system life-span.

Very little attention has been paid to device stress and its impact.

A meaningful approach: relaxing cores when they are stressed long before complete wear-out.

how to assess a core is stressedhow to assign workload to relax stressed coreshow to avoid additional performance cost

Page 4: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

The General Flow

Page 5: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

NBTI Concept

The alternation of stress and recovery

Fast Recovery

Slow Recovery

A fast recovery and a slow recoveryRecovery and stress periods are fairly symmetric.

Page 6: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Fractional NBTI model (1/2)We consider temperature changes w.r.t time.The changes in Vth in turn affect core performance.Fractional NBTI device model:

Capacity Rate:

σ31 PCR Δ−=

Page 7: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Fractional NBTI model (2/2)

This model averages leakage and delay impact on core performance.

We have transferred from single PMOS model to core capacity rate interpretation.

The capacity rate will be regarded as an upper bound limit for core workload.

NBTI impact on Vth changes with regard to time, so as the capacity rate.

Page 8: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Dynamic Zoning (DZ)The process of one task flow is limited to a zone.

A zone: a group of cores physically adjacent to each other.

Three factors:size of the zone (number of cores)total Manhattan distancesZone capacity

Page 9: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Task GraphDirected Acyclic Graph G = (V, E)

: a set of tasks to be executed.

: precedence relationships.

wi : task weights

The number of tokens consumed (at input) or produced (at output).

},,,{ 21 nvvvV L=

)},{( jiE =

Page 10: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Estimated Workload

Estimated workload:

WCP: Worst-Case Path

the minimally required capacity rate for a zone to process it.

v10

v7v6

v11

v8 v9

v4 v5

v2 v3

v1

WCPvw

VvwWorkload

jj

j

ii

i

∈=∑∑

,

,

WC

P

Page 11: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

ki

ii

kji

ji

k

Zv

WorkloadCRts

Zvv

jidZfind

∈∀

≥∈∀

,..

,

,),(min ),(

Page 12: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Dynamic Task Scheduling (DTS)

Objective:maximizing system utilizationminimizing communication cost

Workload Constraints:Core utilization should not exceed its capacity rate.

Solution: Mixed Integer Programming (MIP)

Page 13: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Task-Core Mapping MatrixAssign tasks to particular cores (within a zone):

Task 1 Task 2 Task 3 Task 4 Task 5

Core 1 1 0 0 0 0

Core 2 0 0 1 0 1

Core 3 0 1 0 1 0

Task Graph

Binary Mapping Matrix

Task-Core

Mapping

Page 14: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Decision Variables (1/3)

A binary matrix M (mxn):m : the number of available cores in a zonen : the number of nodes from the task graphMij =1: task vi mapped onto j-th coreConstraint on Mij ’s:

njMm

iij ,,1,1

1L=∀=∑

=

t1 t2 t3 t4 t5

C1 1 0 0 0 0

C2 0 0 1 0 1

C3 0 1 0 1 0

Page 15: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Decision Variables (2/3)

Starting time for each task:

To keep precedence relationships:For each arc (i, j):

Ti : the execution time for task vi.Tcomm: the communication time between vi and vj.

{ }nSSS ,,, 21 L

jji

commii STTS ≤++ ),(

Page 16: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Decision Variables (3/3)M and {S1, S2, …, Sn} determine the schedule:

We need to identify M matrix and {S1, S2, …, Sn} for the optimal schedule.

S1 S1+T1 < S4 S4+T4 < S7

Page 17: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Workload Constraint

A stressed core will be assigned low workload.Assigned workload for each core

: the frequency scaling ratio Ts : the length of schedule

Core utilization must not exceed its capacity rate:

miCRWL ii ,,1, L=∀≤

s

ijj jiji T

MTWL

1, =∀=∑ α

ijα

Page 18: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Communication Cost (1/2)

Transmitting cost:Nc(i, j): the number of token transmitted on (i, j)c(i, j): time it takes to transmit one token on (i, j)

Buffering cost:Nb(i, j) : the number of token buffered on (i, j)b(i, j): time it takes to buffer one token on (i, j)

Page 19: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

The total communication cost:Tcomm = Ttran + Tbuff

Ttran : total transmitting cost

Tbuff : total buffering cost

Communication Cost (2/2)

),(),(),(

jicjinTji

ctran ∑=

),(),(),(

jibjinTji

bbuff ∑=

Page 20: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

A mixed-integer program:

The Optimization Model

( )

},,{,][var,,1,),(

),(,

,,1,1..

1min

1

),(1

1

nmxnij

ii

jji

commii

m

jij

comm

m

ii

SSSMMmiCRSMWLEjiSTTS

njMts

TU

L

L

L

===∀≤

∈∀≤++

=∀=

⋅+⎟⎠

⎞⎜⎝

⎛−⋅

=

=

βα

Page 21: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

NBTI-Aware Workload Balancing

Adaptive to the frequent update of capacity rates

Generate a new zone when a new flow comes in

Relax an existing flow when a flow is finished execution

Page 22: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Generation of A New Zone

Page 23: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Relaxation of An existing Zone

Maximally Contiguous Region

0.5 0.5 0.5 0.8

0.3 0 0.5 0.5

0.5

0.3

0.8

1.0

0.3 0.6

0 0.5

(a)

Page 24: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Results (1/5): Scheduling resultsStressed

Core IndexCapacity

RateAssigned Workload

9 0.7 0.6248

12 0.5 0.5000

28 0.5 0.3892

33 0.4 0.4000

37 0.4 0.4000

39 0.5 0.4473

50 0.8 0.6482

57 0.9 0.8079

At 90 seconds At 100 seconds

Stressed Core Index

Capacity Rate

Assigned Workload

14 0.4 0.351425 0.4 0.400031 0.5 0.500038 0.5 0.389241 0.7 0.586444 0.3 0.300052 0.6 0.553658 0.5 0.429359 0.2 0.2000

Assigned workload is well bounded by capacity rate.Stressed cores have been recovered.

Page 25: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Results (2/5): Performance Comparison

An insignificant increase in execution time with different offered load (<2% performance degradation)

Page 26: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Results (3/5): Performance Comparison

An insignificant performance drop with increasing number of stressed cores (~3% when there are 16 stressed cores).

Page 27: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Results (4/5): Yield Comparison

The differences in terms of yield becomes obvious after 2 years and starts to widen.

Page 28: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Results (5/5): MTTF Comparison

After about 3 years both cases observe decreases in MTTF. The new strategy shows about 30% less changes.

Page 29: Workload Capacity Considering NBTI Degradation in Multi ...€¦ · Workload Capacity Considering NBTI Degradation in Multi-core Systems Jin Sun*, Roman Lysecky*, Karthik Shankar*,

Thank you!Q&A