PIQL: Success- Tolerant Query Processing in the Cloud Michael Armbrust, Kristal Curtis, Tim Kraska...

PIQL: Success- Tolerant Query Processing in the Cloud

Michael Armbrust, Kristal Curtis, Tim KraskaArmando Fox, Michael J. Franklin, David A.

PattersonAMP Lab, EECS, UC Berkeley

Introduction

• Large-scale websites are increasingly moving from relational databases to distributed key-value stores.

• Why? - High request rate- Low latency workloads- Scalability

Key-value stores at a cost of ?

• Writing complex imperative functions• Index management• Intra query parallelization• And LOSS of DATA INDEPENCE

A blend of both: PIQL

• Performance predictable subset of SQL• Benefits of RDBMS such as ability to express

queries declaratively• Physical data independence• Automatic index selection and maintenance• Real time guarantees on PERFORMANCE that

come from underlying key-value store

Features:

• Run on the top of key/value stores• Bounds on the number of operations that will

be performed on key-value store• Compile time feedback on worst-case

performance for all queries• Automatic selection and maintenance of

indexes

Alternative approach

• Complex developer written imperative programs• Example:

Data model of Cassandra For a query: search for messages that contain

certain wordValues inserted of the form:row -> userid, supercolumn ->word, column ->messageTimestamp, value->messageId

Equivalent PIQL query:

FETCH messageOF user BY recipientWHERE user = [this] ANDmessage.text CONTAINS [1: word]ORDER BY timestamp

Query Scaling classes

• Class 1: (Constant)- Amount of data required to process the query is constant

• Class 2: (Bounded)- Amount of data required to process the query is naturally bounded

• Class 3: (Sub-linear or Linear)- Amount of data required to process the query grows sub-linearly eventually

• Class 4: (Super-linear)-

PIQL Query Syntax

• name• Parameters [ordinal:name]General syntax:QUERY nameFETCH entity[OF joined-entity alias BY relationship] ...

WHERE predicates[{PAGINATE perpage | LIMIT count}]

Example Class 1

• To return profile of a user given a user nameQUERY userByNameFETCH userWHERE user.name = [1:name]

• Calculating bound:Simple: 1 or zero results

Example Class 2

• To return users by their hometownQUERY userByHometownFETCH userWHERE user.hometown = [1:hometown]LIMIT [1:count] MAX 100

• Calculating bound:• LIMIT clause returns at most 100 items

Example Class 3

• To return a list of the most recent thoughts owned by a particular user

QUERY userThoughtsFETCH thought ofuser by ownerWHERE user.name = [1:username]ORDER BY timestampLIMIT [2:count] MAX 100Bound: 100

Example Class 4

To return a paginated list, 10 at a time, of the most recent thoughts of all the approved subscriptions owned by the current user.

QUERY thoughtstreamFETCH thoughtOF user friend BY ownerOF subscription BY targetOF user me BY ownerWHERE me.username=[1:username] AND approved = true

ORDER BY timestampPAGINATE 10

Comparison

• Graph

Queries in PIQL

• Entities analogous to Relations• Queries are specified as templates ahead of

time• No of operations required in worst case are

provided to developer as feedback at compile time

Architecture Overview

Optimization in PIQL

• Phase 1-(Stop Operator Insertion)

Optimization in PIQL

• Phase 2

Prediction Framework

Performance Insight Assistant• Provides feedback to developer to fix ‘unsafe’

queries• Guidance on how to set a ‘Cardinality limit’

compatible with SLO Compliance• Provides a chart of latency distribution for

each setting of the cardinality

Performance Insight assistant

• Predicted Heat Map for Thoughtstream query

Execution Engine

• Leverages key-value store to achieve scalability and high performance

• Requests to a key-value store are done in parallel

• Limit hint information is used to prefetch all required data in single request

Performance overview

Fig: System scaling in number of users/machines with constant query latency

Conclusion

• Performance predictability and scalability of Key-value stores + Scale independence of Relational Model= PIQL

• GQL, HIVE, PIG, VoltDB are also on similar grounds but they are focused on Batch Analytics rather than Interactive applications

PIQL: Success- Tolerant Query Processing in the Cloud Michael Armbrust, Kristal Curtis, Tim Kraska...

Documents

Transcript of PIQL: Success- Tolerant Query Processing in the Cloud Michael Armbrust, Kristal Curtis, Tim Kraska...

Crowdsourced Enumeration Queriesjhh/courses/readings/trushkowsky.icde13.enumeration.pdfCrowdsourced Enumeration Queries Beth Trushkowsky, Tim Kraska, Michael J. Franklin, Purnamrita

Kraska polja BiH - Knjiga sazetaka_01 - Korice.cdr

Structuring Spark: DataFrames, Datasets, and Streaming by Michael Armbrust

DECEMBER 2015 Willis Armbrust · New Neighbor Coordinator: Chris Olson christine.olson@cox.net Lost and Found: Rose Bockleman ... 2015 Halloween Costume Contest The winner! VOLUME

Home - Westmoreland Community Action - 35 · 2017-04-07 · City, Whitney, Wyano, Youngstown, Youngwood and Yukon. Alverton, Ardara, Armbrust, Arnold, Arona, Avonmore, Bell Township,

THE INSURANCE VALUE CHAIN: KEY INVESTMENT THEMES John Kraska September 2, 2015.

Above the Clouds: A Berkeley View of Cloud Computingdelara/courses/ece1779/papers/armbrutst.pdfAbove the Clouds: A Berkeley View of Cloud Computing Michael Armbrust, Armando Fox, Rean

CURE OM Patient Registry Update - Jacqueline Kraska

Load Cells Leann Armbrust Travis Wyatt SRJC, Engr 45 Fall 2010.

Happy New Year 2013! - ARMBRUST ACRES...Happy New Year 2013! Armbrust Acres Membership Dues 2013 PAGE 2 VOLUME 2, ISSUE 7 CBS Home Real Estate Sponsors 2nd Annual Snowman Building

Vendor Name YTD Payments 2016 NATIONAL MIGRANT ... · PDF fileaccelerate learning accountemps ... chase inc chelsea kraska lee chemsearch s,974.28 1,960.20 ... cooper painting company

Bill Howe - University of Washingtonbillhowe/cv/billhowe_cv.pdf · Bill Howe, Francois Ribalet, Daniel Halperin, Sagar Chitnis, E Virgnia Armbrust Computing in Science & Engineering

Kraska, Kayla Capstone Project

Bioassay of Two Ponds Katie Kraska Stephen Hesterberg Matt Brown.

Geocaching: Using Multi-Billion Dollar Technology (and Math) to find Tupperware in the Woods CMC3 Recreational Conference Bruce Armbrust- Lake Tahoe Community.

Spark SQL: Relational Data Processing in Spark · PDF fileSpark SQL: Relational Data Processing in Spark Michael Armbrust†, Reynold S. Xin†, Cheng Lian†, Yin Huai†, Davies

Kraska Thesis Reviewed 3

The Militarization of Law Enforcement: Evidence from Latin ... · Kraska (2007, 3) deﬁnes militarization of police as “the process whereby civilian police increasingly draw from,

amberif.amberexpo.plamberif.amberexpo.pl/mtgsa2010/library/File/AMBERIF/SYMPOZJUM/... · KRASKA Z. Amber in Legnica .....83 MAZUROWSKI R.F. Amber classification principles in archaeology

· Author: Armbrust Created Date: 4/10/2019 3:26:44 PM