Sparrow Distributed Low-Latency Scheduling Kay Ousterhout, Patrick Wendell, Matei Zaharia, Ion...

SparrowDistributed Low-Latency Scheduling

Kay Ousterhout, Patrick Wendell, Matei Zaharia, Ion Stoica

Sparrow schedules tasks in clusters

using a decentralized, randomized approach

Sparrow schedules tasks in clusters

using a decentralized, randomized approach

support constraints and fair sharing

and provides response times

within 12% of ideal

Scheduling Setting

Map Reduce/Spark/

Dryad Job

Map Reduce/Spark/

Dryad JobTask

Job Latencies Rapidly Decreasing

10 min.

10 sec.

100 ms

2004: MapReducebatch job

2009: Hive

2010: Dremel Query

2012: Impala query

2010:In-

memory Spark query

2013:Spark

streaming

Scheduling challenges:

Millisecond Latency

Quality Placement

Fault Tolerant

High Throughput

10 min.

10 sec.

100 ms

2004: MapReducebatch job

2009: Hive

2010: Dremel Query

2012: Impala query

2010:In-

memory Spark query

2013:Spark

streaming

1000 16-core machines

26decisio

ns/second

Scheduler

throughput

decisions/

second

160Kdecisio

ns/second

decisions/

second

Millisecond Latency

Quality Placement

Fault Tolerant

High Throughput

Today: Completely Centralized Less centralization

Sparrow:Completely

Decentralized

✗ ✓

SparrowDecentralized approach

Existing randomized approachesBatch SamplingLate BindingAnalytical performance evaluation

Handling constraints

Fairness and policy enforcement

Within 12% of ideal on 100 machines

Scheduling with Sparrow

WorkerWorkerWorkerWorkerWorker

Scheduler

SchedulerJob

Worker

Scheduler

SchedulerJob

Worker

Random

Simulated Results

100-task jobs in 10,000-node cluster, exp. task durations

Omniscient: infinitely fast centralized

scheduler

Per-task sampling

Scheduler

SchedulerJob

Worker

Power of Two Choices

Per-task sampling

Scheduler

SchedulerJob

Worker

Power of Two Choices

Simulated Results

70% cluster load

Response Time Grows with Tasks/Job!

Per-Task Sampling

Scheduler

SchedulerJob

Worker

Task 1

Task 2

Per-ta

Per-task Sampling

Scheduler

SchedulerJob

Worker

Place m tasks on the least loaded of dm slaves

Per-ta

4 probes (d =

Per-task versus Batch Sampling

70% cluster load

Simulated Results

Queue length poor predictor of wait time

Worker

80 ms155

530 ms

Poor performance on heterogeneous workloads

Late Binding

Worker

Scheduler

SchedulerSchedulerJob

Worker

4 probes (d =

Late Binding

Scheduler

4 probes (d =

Worker

Late Binding

Scheduler

Worker

requests

Worker

Simulated Results

What about constraints?

Job Constraints

Scheduler

SchedulerJob

Worker

Restrict probed machines to those that satisfy the constraint

Per-Task Constraints

Scheduler

SchedulerJob

Worker

Probe separately for each task

Technique Recap

Scheduler

SchedulerBatch

sampling+

Late binding+

Constraint handling

Worker

How does Sparrow perform on a real cluster?

Spark on Sparrow

Worker

Query: DAG of Stages

Sparrow

Scheduler

Spark on Sparrow

Worker

Sparrow

Scheduler

Spark on Sparrow

Worker

Sparrow

Scheduler

How does Sparrow compare to Spark’s native scheduler?

100 16-core EC2 nodes, 10 tasks/job, 10 schedulers, 80% load

TPC-H Queries: Background

TPC-H: Common benchmark for analytics workloads

Sparrow

Spark: Distributed in-memory analytics framework

Shark: SQL execution engine

TPC-H Queries

100 16-core EC2 nodes, 10 schedulers, 80% load

Percentiles

TPC-H Queries

100 16-core EC2 nodes, 10 schedulers, 80% load

Within 12% of ideal

Median queuing delay of 9ms

Fault Tolerance

Scheduler 1

Scheduler 2

Spark Client 1 ✗Spark

Client 2

Timeout: 100msFailover: 5ms

Re-launch queries: 15ms

When does Sparrow not work as well?

High cluster load

Related Work

Centralized task schedulers: e.g., Quincy

Two level schedulers: e.g., YARN, Mesos

Coarse-grained cluster schedulers: e.g., Omega

Load balancing: single task

Batch sampling

+Late binding

+Constraint handling

www.github.com/radlab/sparrow

Sparrows provides near-ideal job response

times without global visibility

Scheduler

Worker

Backup Slides

Can we do better without losing simplicity?

Policy Enforcement

SlaveHigh Priority

Low Priority SlaveUser A (75%)

User B (25%)

Fair SharesServe queues using

weighted fair queuing

PrioritiesServe queues based on strict priorities

Sparrow Distributed Low-Latency Scheduling Kay Ousterhout, Patrick Wendell, Matei Zaharia, Ion...

Documents

Transcript of Sparrow Distributed Low-Latency Scheduling Kay Ousterhout, Patrick Wendell, Matei Zaharia, Ion...

Resilient Distributed Datasets A Fault-Tolerant Abstraction for In-Memory Cluster Computing Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave,

Matei Zaharia UC Berkeley Introduction to Spark Internals UC BERKELEY.

Cloud Computing and Big Data Processing Shivaram Venkataraman UC Berkeley, AMP Lab UC BERKELEY Slides from Matei Zaharia.

2016 Spark Summit East Keynote: Matei Zaharia

2013 12 02 Sparrow - EECS at UC Berkeleykeo/talks/sparrow-spark-summit... · Sparrow Distributed Low-Latency Spark Scheduling Kay Ousterhout, Patrick Wendell, Matei Zaharia, Ion Stoica

Sparrow - mimuw.edu.pliwanicki/courses/ds/2015/presentations/...Sparrow Distributed Low-Latency Scheduling Kay Ousterhout, Patrick Wendell, Matei Zaharia , Ion Stoica

Dominant Resource Fairness: Fair Allocation of …matei/papers/2011/nsdi_drf.pdfDominant Resource Fairness: Fair Allocation of Multiple Resource Types Ali Ghodsi, Matei Zaharia, Benjamin

Matei Zaharia Amp Camp 2012 Advanced Spark

Sparrow: Scalable Scheduling for Sub-Second Parallel … · Sparrow: Scalable Scheduling for Sub-Second Parallel Jobs Kay Ousterhout Patrick Wendell Matei Zaharia Ion Stoica Electrical

Parallel Programming With Spark Matei Zaharia UC Berkeley.

Shark Cliff Engle, Antonio Lupher, Reynold Xin, Matei Zaharia, Michael Franklin, Ion Stoica, Scott Shenker Hive on Spark.

Advanced Big Data Systems · Spark •Project start - UC Berkeley, 2009 •Matei Zaharia et al. Spark: Cluster Computing with Working Sets,. HotCloud 2010. •Matei Zaharia et al.

Matei Zaharia

Advanced(Spark Features( - UC Berkeley AMP Campampcamp.berkeley.edu/.../2012/06/...advanced-spark.pdf · Matei&Zaharia& & UC&Berkeley& & && Advanced(Spark Features(UC&BERKELEY&

Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauley, Michael Franklin, Scott Shenker, Ion Stoica Spark Fast, Interactive,

Trends for Big Data and Apache Spark in 2017 by Matei Zaharia

Deep learning and streaming in Apache Spark 2.2 by Matei Zaharia

Resilient Distributed Datasets: A Fault- Tolerant Abstraction for In-Memory Cluster Computing Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave,

Data-Centric Security Dawn Song UC Berkeley Collaboration with Lorenzo Martignoni, Stephen McCamant, Pongsin Poosankam, Matei Zaharia, Scott Shenker, Ion.

Job Scheduling with the Fair and Capacity Schedulers · Job Scheduling with the Fair and Capacity Schedulers Matei Zaharia ... » Production jobs: ...