ChEsS: Cost-Effective Scheduling Nikos Zacheilas across...
Transcript of ChEsS: Cost-Effective Scheduling Nikos Zacheilas across...
![Page 1: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/1.jpg)
Summarized by: Michael Bowen
ChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce clusters
Nikos ZacheilasVana Kalogeraki
2016 IEEE International Conference on Autonomic Computing
![Page 2: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/2.jpg)
Presentation Summary❖ Key Terminology
❖ Problem Statement
❖ Contributions
❖ Challenges
❖ Variables
❖ Strategy (Methodology)
❖ Impact Estimation
❖ Optimization Problem
❖ Adaptive Weighted Sum
❖ Evaluation
![Page 3: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/3.jpg)
Key TerminologyMapreduce - Word Count - the canonical example
Photo Credit: wikis.nyu.edu
![Page 4: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/4.jpg)
Key Terminology❖ Isolation
❖ Data Isolation
❖ Privacy and security
❖ Failure Isolation
❖ Hide failures across clusters
❖ Version Isolation
❖ Dependency and version management
❖ Performance Isolation
❖ Prod, Dev, Test - multiple prod different delineations
![Page 5: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/5.jpg)
Key Terminology❖ Per-job considerations
❖ Performance
❖ Monetary Cost
❖ Data Locality
❖ Scheduling Policy
❖ FIFO, Fair, and Capacity - more recently EDF and Least-Laxity
![Page 6: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/6.jpg)
Key Terminology
❖ Makespan
❖ End-to-end execution time of the submitted job
❖ Pareto-based analysis
❖ Many possible courses of action competing for attention
![Page 7: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/7.jpg)
Problem Statement
![Page 8: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/8.jpg)
Challenges
❖ Jobs-to-clusters possible assignments is an exponential increase as number of jobs and clusters increases
❖ Difficult to manually determine these assignments
❖ Budget required vs workload makespan
![Page 9: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/9.jpg)
Contributions
❖ Parameter impact estimates
❖ Jobs locality constraints, intra-job scheduling algorithms, etc…
❖ Pareto-frontier search algorithm improvements
❖ Budget vs Makespan tradeoff analysis
❖ Evaluation study of industry workloads
![Page 10: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/10.jpg)
Variables❖ c ∈ Clusters
❖ VMsc
❖ # virtual machines
❖ mslotsc, rslots
c
❖ map/reduce slots
❖ schedulerc
❖ scheduling algorithm
❖ costc
❖ per hour cost (ec2)
❖ threadsc
❖ # threads spawned for execution
❖ Jobsc
❖ set of jobs assigned to cluster c
❖ makespanc
❖ total execution time in seconds of all jobs assigned to c
❖ budgetc
❖ required budget
❖ j ∈ Jobs
❖ mtasksj, rtasksj
❖ # map/reduce tasks used by job j
❖ mslotsj,c, rslotsj,c
❖ # map/reduce slots reserved by j from c
❖ sizej
❖ input data size of j
❖ dataHostj
❖ where input resides
❖ mtimej,c, rtimej,c, stimej,c
❖ map/reduce/shuffle time estimates
❖ JTimej,c
❖ execution time of job j
![Page 11: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/11.jpg)
Strategy
1. Estimate impact of intra-cluster scheduling policies and locality constraints on makespan and budget
2. Formulate multi-objective optimization problem
3. Solve using Adaptive Weighted Sum (AWS)
![Page 12: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/12.jpg)
Impact EstimationKey Assumption - repetitive, aperiodic jobs
Execution time -Lower Bound
Upper Bound
Map/Reduce/Shuffle for lower and upper
Final estimate - average of two limits
![Page 13: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/13.jpg)
Impact EstimationKey Assumption - repetitive, aperiodic jobs
❖ Locality Constraints -
❖ Add overhead of time to transfer data to execution time
❖ Makespan -
❖ Simulator Engine
❖ Input - scheduling policy and set of jobs
❖ Output - makespan
❖ Budget -
❖ Budget vs Exec Time -
![Page 14: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/14.jpg)
Optimization Problem❖ Pareto-frontier
❖ Detect optimal solutions with respect to constraints
❖ Result helps user decide amongst solution space
❖ Example with two job-to-cluster assignments, P and Q
❖ Q dominates P if and only if
❖ Budget Q ≤ Budget P AND Makespan Q < Makespan P
❖ Budget Q < Budget P OR Makespan Q < Makespan P
❖ The set of non-dominated assignments is the solution space of interest - known as the Pareto-frontier
![Page 15: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/15.jpg)
Adaptive Weighted Sum❖ Pareto-frontier search is very costly - use Adaptive Weighted Sum as an approximation
❖ Regular weighted sum -
❖ Greedy - assign jobs to clusters that lead to min utilityScore
❖ Challenge -
❖ Detected solutions non-uniformally distributed
❖ Cannot detect solutions in non-convex regions of the solution space
❖ Adaptive Weighted Sum -
❖ Perform single-objective optimization in unexplored regions of the solution space
![Page 16: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/16.jpg)
Evaluation
❖ Note - developed for INSIGHT, which provides real-time event detection in Dublin
❖ Used industry workloads based on Yahoo’s Hadoop clusters
❖ Used scientific workloads based on traces from Open-Cloud cluster provider
❖ Four possible clusters considered for the possible jobs
![Page 17: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/17.jpg)
EvaluationExecution time estimation error
![Page 18: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/18.jpg)
Evaluationdj parameter impact
![Page 19: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/19.jpg)
EvaluationScheduling Algorithm Impact
![Page 20: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/20.jpg)
EvaluationLocality Constraints Impact
![Page 21: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/21.jpg)
EvaluationComparison With Optimal
![Page 22: ChEsS: Cost-Effective Scheduling Nikos Zacheilas across ...menasce/cs788/slides/cs788-ChEsS_presentation-Michael.pdfChEsS: Cost-Effective Scheduling across multiple heterogeneous mapreduce](https://reader033.fdocuments.in/reader033/viewer/2022041818/5e5c6e486165514287515171/html5/thumbnails/22.jpg)
Critique
❖ Assumption of repetitive, aperiodic jobs
❖ Understandable constraint - difficult to model otherwise
❖ Unsure of how realistic this constraint is
❖ Mapreduce is more of a legacy system at this point
❖ Rapidly losing market-share to Spark