Tians Scheduling: Using Partial Processing in Best-Effort Applications Yuxiong He *, Sameh Elnikety...

Tians Scheduling: Using Partial Processing in Best-Effort Applications

Yuxiong He*, Sameh Elnikety*, Hongyang Sun+

*Microsoft Research+ Nanyang Technological University

Background

• Interactive services are an important part of data center workload

– Example: web search, web server, VOD server, etc.– SLA

» Example: web search can require 99% results returned within 150ms.

• Server utilization for interactive services is embarrassingly low

– Today high server utilization kills responsiveness and result quality.

ExampleM/M/1 Queue• Mean service time = 15ms• Deadline = 100ms• Quality = 1 if it is fully

serviced within deadline; 0 otherwise

• Average quality ≥ 0.99• What is max system

utilization?

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10.950000000000001

0.955000000000001

0.960000000000001

0.965000000000001

0.970000000000001

0.975000000000001

0.980000000000001

0.985000000000001

0.990000000000001

0.995000000000001

Normalized Arrival Rate

FIFO Server:Norm. arrival rate = 0.3CPU utilization = 30%

Example

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10.950000000000001

0.955000000000001

0.960000000000001

0.965000000000001

0.970000000000001

0.975000000000001

0.980000000000001

0.985000000000001

0.990000000000001

0.995000000000001

FIFO Server:Arrival rate = 0.3CPU utilization = 30%

M/M/1 Queue, FIFO Scheduling• Mean service time = 15ms• Deadline = 100ms• Quality = 1 if it is fully

serviced within deadline; 0 otherwise

• Average quality ≥ 0.99• What is max system

utilization?

MotivationM/M/1 Queue, FIFO Scheduling• Mean service time = 15ms• Deadline = 100ms• Quality = 1 if it is fully serviced

within deadline; 0 otherwise• Average quality ≥ 0.99• What is max system utilization?

Goal* Sustain higher load while meeting

SLA. * Reduce hardware, energy and

operational cost.

FIFO Server:Arrival rate = 0.3CPU utilization = 30%

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10.950000000000001

0.955000000000001

0.960000000000001

0.965000000000001

0.970000000000001

0.975000000000001

0.980000000000001

0.985000000000001

0.990000000000001

0.995000000000001

Characteristics of Interactive Services• Deadline with each request• Partial results

– Find the best available result within a predefined response time

• Quality function– Response quality improves with processing time progressive– Exhibits diminishing return concave

Traditional System Tians System

Strict request model

Processing time

flexible model with partial results

Processing time

+ Tians scheduler• Decide processing time of requests• Maximize overall response quality while meeting their deadline

A’xB’

Example3 Jobs, deadline 100

J1 J2 J3 Quality

Service demand 90 20 100

FIFO 90 10 / 0.98 + 0 + 0 = 0.98

FIFO + partial results 90 10 / 0.98 + 0.27 + 0 = 1.25

Tians scheduling + partial results 40 20 40 0.74 + 0.47 + 0.74 = 1.91

0 10 20 30 40 50 60 70 80 90 1000

0.10.20.30.40.50.60.70.80.9

processing time

Quality

Benefits of partial results

Benefits of scheduling

Contribution• Propose Tians scheduling for interactive services, embracing

partial results.

• Present three scheduling algorithms– Tians-Optimal (offline)

» Prove its optimality– Tians-Clairvoyant (online clairvoyant)– Tians-NonClairvoyant (online nonclairvoyant)

• Evaluate Tians scheduling algorithms using simulation– Improve response quality

Outline• Introduction• Scheduling model• Optimal offline algorithm• Online algorithms• Simulation results• Concluding Remarks

Scheduling Model• A set of requests• Each request has

– Arrival time– Deadline – Service demand

• Quality function – maps the processing time the request receives to a quality

• Scheduler – assigns request processing time– maximize total quality of all requests

Optimal Offline Algorithm: Tians-Optimal

Three Intuitions1. When SOEP is feasible, SOEP is optimal.2. An optimal schedule is composed by SOEP blocks.3. Jobs in the busiest interval define a SOEP block.

Intuition 1When SOEP is feasible, SOEP is optimal.

SOEP (service-oriented equi-partitioning) policy– Each request gets equal share of processing time unless it

requires less.» Small requests get what they demand» Large requests share equal processing time

Example: 3 Jobs, deadline 100ms

J1 J2 J3 Quality

Service demand 90 20 100

SOEP 40 20 40 1.91

Intuition 2• SOEP is not always feasible because of deadlines• SOEP not feasible divide into sub-blocks

• Inside each sub-block, SOEP is feasible

• An optimal schedule is composed by SOEP blocks.

• How to identify the SOEP blocks?

SOEP block1 SOEP block2 SOEP block3

Intuition 3Jobs in the busiest interval define a SOEP block.• Inside the block, assign processing time using SOEP

Intuition 3Jobs in the busiest interval define a SOEP block.• Inside the block, assign processing time using SOEP• Remove the block and repeat the process on remaining requests.

Tians-Optimal• Divide and conquer algorithm:

• THEOREM: Tians-Optimal produces an optimal schedule to maximize the total quality of requests.

Tians-Optimal([i,j]) {find the busiest block [k,h]

apply SOEP on requests in block [k,h]

Tians-Optimal ([i,k-1])Tians-Optimal ([h+1, j])

Online Algorithms

Tians-Clairvoyant• Know service demand of released requests• Apply Tians-Optimal on the set of ready requests.

Tians-Nonclairvoyant• Do not know service demand of any requests• Apply Tians-Clairvoyant using mean service demand of requests

Key features of Tians scheduling:• Share processing time equally among requests• Prevent long requests from starving short ones• Easy to implement in real systems : FIFO, no preemption

Simulation Results

• Application– Web Search Engine

» Search and rank matching documentss

• Implement five algorithms– Three Tians algorithms– FIFO : no partial results– FIFO-Partial : support partial results

Web Search Engine• Setup

– Service demand exponential distribution with mean = 26ms

– Poisson arrival– Quality function– SLA

» Deadline 150ms» Average quality ≥0.99

Simulation Result

FIFOArrival rate 0.15Utilization 15%

FIFOFIFO-Partial

Tians-Noncl

FIFO-PartialArrival rate 0.6Utilization 60%

Tians-Noncl.Arrival rate 0.78Utilization 78%

Simulation Result

FIFOFIFO-Partial

Gain from partial results

Tians-Noncl

Simulation Result

FIFOFIFO-Partial

Gain from better scheduling

Simulation Result

FIFOFIFO-Partial

420%Tians-Noncl• Tians sustains more than 400% load.

• Save 80% servers.

Simulation Result

FIFOTians-Noncl

FIFO-Partial

Simulation Result

Gain from knowing service demand

Simulation Result

Gain from knowing future

More Experiments• VOD Server

– Tians manages the upstream bandwidth– Tians-Clairvoyant streams to 40% more clients than FIFO.

• Variance Reduction– Partial results that produces smooth quality function– Share processing time equally among requests

Conclusions

• Tians– Partial results + enhanced scheduling

• Scheduling– Share processing time equally among requests– Prevent long requests from starving short ones

• Simulation results– Improve response quality– To achieve the same QoS, Tians supports much higher

system utilization than traditional server.

Future Work

• Applying Tians in large-scale systems

Thank you!

Questions?

Tians Scheduling: Using Partial Processing in Best-Effort Applications Yuxiong He *, Sameh Elnikety...

Documents

Transcript of Tians Scheduling: Using Partial Processing in Best-Effort Applications Yuxiong He *, Sameh Elnikety...

1. Introduction CSEP 545 Transaction Processing Philip A. Bernstein Sameh Elnikety Copyright ©2012 Philip A. Bernstein 1/4/2012 1.

Long Cheng †‡, Jiannong Cao ‡, Canfeng Chen §, Hongyang Chen ∥, Jian Ma §, Joanna Izabela Siebert ‡ † State Key Lab of Networking & Switching Tech., Beijing.

Acknowledgement: Arijit Khan, Sameh Elnikety. Google: 1 trillion indexed pages Web GraphSocial Network Facebook: 1.5 billion active users 31 billion.

Zeta: Scheduling Interactive Services with Partial Execution Yuxiong He, Sameh Elnikety, James Larus,…

Hongyang Primary School Miss Shao Three Pigs Room.

Predicting Replicated Database Scalability from Standalone ......Predicting Replicated Database Scalability from Standalone Database Profiling Sameh Elnikety Microsoft Research Cambridge,

Verification of logical consistency in robotic reasoning · 2018. 3. 28. · Veriﬁcation of Logical Consistency in Robotic Reasoning Hongyang Qua, Sandor M. Veresa,∗ aDepartment

Orbe: Scalable Causal Consistency Using Dependency Matrices & Physical Clocks Jiaqing Du, EPFL Sameh Elnikety, Microsoft Research Amitabha Roy, EPFL Willy.

ERIM: Secure, Efficient In-process Isolation with Protection Keys … · ERIM: Secure, Efﬁcient In-process Isolation with Protection Keys (MPK) Anjo Vahldiek-Oberwagner Eslam Elnikety

Mohamed Sarwat (Arizona State University) Sameh Elnikety (Microsoft Research )

Clock-RSM: Low-Latency Inter-Datacenter State Machine Replication Using Loosely Synchronized Physical Clocks Jiaqing Du, Daniele Sciascia, Sameh Elnikety.

Network Propagation Predicts Drug Synergy in Cancers · Convergence and Technologies Network Propagation Predicts Drug Synergy in Cancers Hongyang Li,Tingyang Li, Daniel Quang, and

Horton+: A Distributed System for Processing Declarative Reachability Queries over Partitioned Graphs Mohamed Sarwat (Arizona State University) Sameh Elnikety.

DeepCPU: Serving RNN-based Deep Learning Models 10x Faster · Deep Learning Models 10x Faster Minjia Zhang*, Samyam Rajbhandari*, Wenhan Wang, Yuxiong He Microsoft AI and Research

A Cooperative Internet Backup Scheme [1] Leonid Bilevich Advanced Topics in Storage Systems [1]M. Lillibridge, S. Elnikety, A. Birrell, M. Burrows, and.

Arijit Khan Systems Group ETH Zurich Sameh Elnikety Microsoft Research Redmond, WA.

1 Database Replication Using Generalized Snapshot Isolation Sameh Elnikety, EPFL Fernando Pedone, USI Willy Zwaenepoel, EPFL.

Predicting Replicated Database Scalability Sameh Elnikety, Microsoft Research Steven Dropsho, Google Inc. Emmanuel Cecchet, Univ. of Mass. Willy Zwaenepoel,

Graph Data Management Systems for New Application Domains: Social Networks & the Web of Data Tutorial at VLDB 2011 Philippe Cudré-MaurouxSameh Elnikety.

A Brief Introduction to Stereology Yuxiong (Max) Mao Center for Advanced Vehicular System Mississippi State University.

DeepCPU: Serving RNN-based Deep Learning Models 10x Faster · Deep Learning Models 10x Faster Minjia Zhang, Samyam Rajbhandari, Wenhan Wang, Yuxiong He Microsoft AI and Research