Learning to Optimize Join Queries with Deep RL€¦ · Learning to Optimize Join Queries with Deep...

Learning to Optimize Join Queries with Deep RL

Join Optimization

DQ: Deep Q-learning for join optimization

Discussion

CS294 AI-Sys Presented by: Zongheng Yang

zongheng@berkeley.edu

Join Optimization

Calculate Total Tax Owed For ‘Manager I’ Employees

SELECT SUM(sal.salary*tax.rate)FROM emp, sal, taxWHERE emp.position = sal.position AND tax.country = sal.country AND emp.position = ‘Manager I’

emp_id position country

1 Manager II USA

2 Engineer I CAN

3 Engineer II USA

4 … ..

sal_id position salary

1 Manager I 120000.00

2 Manager II 150000.00

3 Engineer I 78000.00

4 Engineer II 91000.00

tax_id country rate

1 USA 0.32

2 CAN 0.45

3 CHN 0.17

4 … …

Join SequenceCalculate Total Tax Owed

For ‘Manager I’ Employees

SELECT SUM(sal.salary*tax.rate)FROM emp, sal, taxWHERE emp.position = sal.position AND tax.country = sal.country AND emp.position = ‘Manager I’

{S,E,T}

c1 100

c2 200

Find the sequence of joins with minimal cumulative cost

“Imagine yourself standing in front of an exquisite buffet filled with numerous delicacies. Your goal is to

try them all out, but you need to decide in what order. What exchange of tastes will maximize the

overall pleasure of your palate?

…That is the type of problem that query optimizers are called to solve.”

— Yannis Ioannidis

Dynamic ProgrammingSELECT SUM(sal.salary*tax.rate)FROM emp, sal, taxWHERE emp.position = sal.position AND tax.country = sal.country AND emp.position = ‘Manager I’

Table subset Best plan Cost

{E} Index on “position” <cost estimate>

{S} File scan …

{T} File scan …

{E,S} Min{ NestedLoopJoin, SortMergeJoin }

{E,T} … …

{S,T} … …

{E,S,T} … …

Key IdeasThis work: DQ, a learned join optimizer

Key IdeasThis work: DQ, a learned join optimizer1. Observe a native optimizer 2. Train deep Q-learning model 3. Allows fine-tuning on real execution

Generalize to unseen queries Adapt to workload/hardware

Efficient planning (by 10x—10,000x)

OutlineJoin Optimization

Discussion

Markov Decision Process• States: Query graph

• Actions: a valid join

• Reward: Negative cost of the join

• Policy 𝜋: Given a graph, select a join.

𝜋*(s) = arg max q(s,a)

Q-network

Q-networkq_approx(s,a) ≈ q(s,a)

q_approx(s,a)

Join Sequences Estimate Long Term CostFunction Approximator

Input Layer

Hidden Layers

Output Layer

Q-networkq_approx(s,a) ≈ q(s,a)

“(Approximately) How valuable is it to make join a,

over unjoined relations s?”

q_approx(s,a)

Join Sequences Estimate Long Term CostFunction Approximator

Input Layer

Hidden Layers

Output Layer

Collecting Data

HashJoin

IndexJoin

HashJoin

DP emits best plan with optimal

cumulative cost V*

Collecting Data

HashJoin

IndexJoin

HashJoin

cumulative cost V*

IndexJoin

T1 T2“Decision is optimal for eventually

joining T1…T4 with cost V*”

Collecting Data

HashJoin

IndexJoin

HashJoin

cumulative cost V*

IndexJoin

“Decision is optimal for eventually joining T1…T4 with cost V*”

HashJoin

T3IndexJoin

“Decision is optimal for eventually joining T1…T4 with cost V*”HashJoin

T3IndexJoin

HashJoin

Collecting Data

HashJoin

IndexJoin

HashJoin

cumulative cost V*

IndexJoin

“Decision is optimal for eventually joining T1…T4 with cost V*”

HashJoin

T3IndexJoin

“Decision is optimal for eventually joining T1…T4 with cost V*”HashJoin

T3IndexJoin

HashJoin

Key Insight In QO, we have an oracle ——

lots of optimal decisions to learn from! (unlike most RL tasks)

Cost Model

LogicalPlan .stats

Incorporating Feedback

SELECT * FROM …

Real ExecutionCost Model

LogicalPlan .stats

SELECT * FROM …

Runtimes

SELECT * FROM …

LogicalPlan .stats

SELECT * FROM …

Runtimes

SELECT * FROM …

Inexpensive simulator (~1ms) Inaccurate

Expensive to gather More accurate

LogicalPlan .stats

SELECT * FROM …

Runtimes

SELECT * FROM …

Inexpensive simulator (~1ms) Inaccurate

Expensive to gather More accurate

Joined, NextJoin, Cost

(ES, T(ES), 1e7) (ET, (ET)S, 2e7) ...

Joined, NextJoin, Runtime

(ES, T(ES), 1000ms) (ET, (ET)S, 500ms) ...

Train on costs, optionally fine-tune on runtimes

OutlineJoin Optimization

Discussion

Learning in DatabasesB-tree, hash table,

bloom filters (SIGMOD ’18)

Join optimization (our work; in submission)

Cardinality estimationLearned Cardinalities: Estimating Correlated Joins with Deep Learning CIDR ’19

Plan enumerationThis work; Marcus et al., 2018; Ortiz et al., 2019;

End-to-endSageDB, CIDR ’19 Towards a Hands-Free Query Optimizer through Deep Learning (position paper) CIDR ‘19

DB tuning (SIGMOD ’17)

Learning in DatabasesB-tree, hash table,

bloom filters (SIGMOD ’18; Brain)

Join optimization (our work; in submission)

Cardinality estimationLearned Cardinalities: Estimating Correlated Joins with Deep Learning CIDR ’19 (to appear)

Plan enumerationThis work; Marcus et al., Arxiv 2018; …

End-to-endTowards a Hands-Free Query Optimizer through Deep Learning (position paper) CIDR ‘19

DB tuning (SIGMOD ’17; CMU)Can ML replace 40+ years of programmed heuristics

with data-driven heuristics?

Discussion• In DB context, possible/how to explore?

(disastrous plans exist)

• Breaking free from faulty cost model

• Generalizing query optimization to program optimization

Learning to Optimize Join Queries with Deep RL€¦ · Learning to Optimize Join Queries with Deep...

Documents

Transcript of Learning to Optimize Join Queries with Deep RL€¦ · Learning to Optimize Join Queries with Deep...

Learning to Optimize Join Queries With Deep Reinforcement Learningzongheng.me/pubs/dq-jan-2019.pdf · 2020-04-09 · Learning to Optimize Join Queries With Deep Reinforcement Learning

Distributed deep rl on spark strata singapore

Practical Applications of Deep Reinforcement Learning ... · DL4J and RL4J libraries. SKIL & AnyLogic. Summer 2019. AnyLogic Cloud Python API (RL ready) June 2019. RL capabilities

Deep RL with Q-Functionsrail.eecs.berkeley.edu/deeprlcourse/static/slides/lec-8.pdf · Deep RL with Q-Functions CS 285: Deep Reinforcement Learning, Decision Making, and Control Sergey

Deep Reinforcement Learning - NTU Speech Processing …speech.ee.ntu.edu.tw/~tlkagk/courses/ML_2016/Lecture/RL (v6).pdf · To learn deep reinforcement learning ... e.g. Q-learning

Video Summarisation by Classiﬁcation with Deep ...andrea/papers/2018_BMVC_Video... · ZHOU ET AL.: VIDEO SUMMARISATION BY CLASSIFICATION WITH DEEP RL 3. the classiﬁer as global

Introduction to Deep Reinforcement Learning Model-free Methods · •People started to apply deep learning into robotics Deep RL for Robotics Levine, Sergey, and Chelsea Finn. "Deep

RL 17a: Deep RL - The University of Edinburgh · AlphaGo Chess: ˇ30possiblemoves,ˇ80movespergame Go: ˇ250possiblemoves,ˇ150movespergame Approach Reduce depth by truncating search

Fast Swept Volume Estimation with Deep Learning · Fast Swept Volume Estimation with Deep Learning 5 between deep machine learning and sampling-based planning is PRM-RL [19]. Similar

10-703 Deep RL and Controls OpenAI Gym Recitation API Basic Datatypes ... Minecraft. VirtualEnv Installation ... 10-703 Deep RL and Controls OpenAI Gym Recitation Author: Devin Schwab

Stochastic Latent Actor-Critic: Deep Reinforcement ......stochastic sequential models and RL into a single method, performing RL in the model’s learned ... Partial observability

FLOW - Lagrangian Control through Deep-RL: Applications to … · 2020. 8. 7. · flow-project/flow. I. INTRODUCTION Over the past few years, deep reinforcement learning (RL) has

RL + DL · RL + DL Rob Platt Northeastern University Some slides from David Silver, Deep RL Tutorial. Deep RL in an exciting recent development. Deep RL in an exciting recent development.

DWA-RL: Dynamically Feasible Deep Reinforcement ... - UMD

APPROACH SURFACE HORIZONTAL SECTION RL 400…dsewebapps.dse.vic.gov.au/Shared/ATSAttachment1.nsf/(attachment... · rl 400 rl 400.52 rl 340 rl 350 rl 360 rl 370 rl 380 ... first section

Deep RL For Starcraft II - Machine Learningcs229.stanford.edu/proj2017/final-reports/5234603.pdfDeep RL For Starcraft II Andrew G. Chang agchang1@stanford.edu Abstract Games have proven

DEEP S GENERALIZATION IN RL · Accepted to the ICLR 2020 workshop: Beyond tabula rasa in RL (BeTR-RL) DEEP SETS FOR GENERALIZATION IN RL Tristan Karch 12C´edric Colas Laetitia Teodorescu

Learning to Walk Via Deep Reinforcement Learningrss2019.informatik.uni-freiburg.de/papers/0060_FI.pdf · Abstract ÑDeep reinforcement learning (deep RL) holds the promise of automating

Central Sydney Planning Committee (CSPC) - 23 June 2016 - Item … · 2016. 6. 23. · rl 24.48 rl 24.48 rl 24.35 rl 24.35 rl 24.35 rl 25.60 rl 25.60 rl 24.18 3a lobby 82.9 m² 3a

Theoretically Principled Deep RL Acceleration via Nearest ...