Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan,...

28
Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler, Randy Katz 1
  • date post

    21-Dec-2015
  • Category

    Documents

  • view

    217
  • download

    0

Transcript of Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan,...

Page 1: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

1

Design and Analysis of an Energy Agile Cluster Computing System

Andrew Krioukov, Prashanth Mohan, Stephen Dawson-Haggerty, Sara Alspaugh, David Culler, Randy Katz

Page 2: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

2

Grid Evolution

SUPPLIES

LOADS

mostly dispatchable

renewable, variable, intermittent, greatly non-dispatchable

oblivious, stochastic, mostly non-power proportional

reactive, mostly power proportional

TODAY IDEAL FUTURE

oblivious, flat

OLD GRID

non-renewable, reactive, dispatchable

Page 3: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

3

Grid Evolution

SUPPLIES

LOADS

mostly dispatchable

renewable, variable, intermittent, greatly non-dispatchable

oblivious, stochastic, mostly non-power proportional

reactive, mostly power proportional

TODAY IDEAL FUTURE

oblivious, flat

OLD GRID

non-renewable, reactive, dispatchable

Page 4: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

4

Grid Evolution

SUPPLIES

LOADS

mostly dispatchable

renewable, variable, intermittent, greatly non-dispatchable

oblivious, stochastic, mostly non-power proportional

power proportional, reactive, grid-aware

TODAY IDEAL FUTURE

oblivious, flat

OLD GRID

non-renewable, reactive, dispatchable

Page 5: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

5

Grid

Internet

SUPPLIES:provide power

communicate renewable availability, price

LOADS:adapt demand

communicate forecast

electricity

information

Pieces Needed

?

Page 6: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

6

Non-dispatchable, variable supply

Power proportional, grid-aware loads

NREL Western Wind and Solar Integration Study Datasethttp://wind.nrel.gov/Web_nrel/

Pacheco wind farm

Scientific computing cluster

Figure of merit: amount of wind used.How do we get here?

Renewable Integration

Page 7: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

7

PO

WER

TIME

oblivious, flat load

dispatchable supply

power proportionality

grid-awareness

Page 8: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

8

Data Center Loads

IT Equip-ment 61%

Cooling30%

Power Cond.8%

Light1%

data center consumption dominated by IT load

IT load driven by workload

need power proportionality

need load shaping mechanism

Server Idle:Peak

HP ProLiant DL160

63.5%

Apple XServe 3.1 51.8%

IBM System x3450

51.6%

Dell PowerEdge 2950

57.9%

Pelley, et. al, Understanding and Abstracting Total Data Center Power, 2009Barroso et. al. The Case for Energy-Proportional Computing, 2007

SPECpower Results http://www.spec.org/power_ssj2008/results/power_ssj2008.html

5,000 servers at Google

average 30% utilization

IT equipment is not power proportional

pow

er

(W)

utilization

Page 9: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

9

Power Proportionality

Spinning Reserve

Page 10: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

10

Architecture

Page 11: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

11

Outline

• Motivation• Enabling technology• Methodology• Algorithms• Evaluation

Page 12: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

12

Renewable Energy Component

Page 13: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

13

Formulation

We assume the wind farm is sized for the data center.

Option 1: grid blend (open system)

Wind

Other

Requires assuming load is negligible fraction of grid – not realistic

Option 2: dedicated wind farm (closed system)

Fit load to specific wind farm

http://www.greenhousedata.com/

Page 14: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

14

Wind

Wind power over 48 hours from a wind farm in Monterrey County, California.

Variation in wind power for month long intervals at multiple wind farms.

Page 15: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

15

Workload Component

Page 16: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

16

Workloads

Torque jobs

Num

Jobs

Batch: Less latency sensitive, longer jobse.g., analytics, scientific computing

Request

Rate

Wikipedia traffic

Interactive: Latency sensitive, generally short jobse.g., web app server, email server, etc.

Page 17: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

23

Slack

slack = max run time – job duration

Page 18: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

24

Cluster: NERSC FranklinAverage duration: 98 minAverage slack: 68 min

Cluster: EECS PSIAverage duration: 55 minAverage slack: 17 hours

Slack in Real Systems

Page 19: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

25

Grid-Aware Batch Scheduling

• example goal: shape load to match wind availability

• method: exploit temporal slack

Pacheco wind farm

Scientific computing cluster

Page 20: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

26

Greedy Algorithm

B(t) = power budget for next 10 min

Sort jobs by slack

Schedule all jobs with no remaining slack

Schedule other eligible jobs in least-remaining-slack order until B(t) is exceeded

Page 21: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

27

Run-immediately, grid-oblivious scheduler

Greedy, grid-aware scheduler

Grid-aware scheduling increases wind energy use.

Correspondingly, reduces grid dependence.

Page 22: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

29

Reduction in grid dependence is robust to choice of wind farm.

Page 23: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

30

Page 24: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

31

As slack increases, grid dependence diminishes.

PSI Franklin

Page 25: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

32

Grid-aware scheduling is equivalent to 5 hours worth of data center-sized batteries.

Page 26: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

33

Grid-aware scheduling is equivalent to 5 hours worth of data center-sized batteries.

4

Page 27: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

34

Summary

• Power proportionality and grid-aware scheduling

• Energy savings, renewable integration, grid stability

reduce grid dependence by halfequivalent to 5 hours of batteries

• Next stepsslack in other systems...?

Page 28: Design and Analysis of an Energy Agile Cluster Computing System Andrew Krioukov, Prashanth Mohan, Stephen Dawson- Haggerty, Sara Alspaugh, David Culler,

QUESTIONS?THE END

35