Megastore: Providing Scalable, Highly Available Storage for Interactive Services

Megastore: Providing Scalable, Highly Available

Storage for Interactive Services

J. Baker, C. Bond, J.C. Corbett, JJ Furman, A. Khorlin,J. Larson, J-M Léon, Y. Li, A. Lloyd, V. Yushprakh

Google Inc.

Originally presented at CIDR 2011 by James Larson

Presented by George Lee

With Great Scale Comes Great

Responsibility

•A billion Internet users•Small fraction is still huge

•Must please users•Bad press is expensive - never lose

data•Support is expensive - minimize

confusion•No unplanned downtime•No planned downtime•Low latency

•Must also please developers, admins

Megastore•Started in 2006 for app

development at Google•Service layered on:•Bigtable (NoSQL scalable data

store per datacenter)•Chubby (Config data, config locks)

•Turnkey scaling (apps, users)•Developer-friendly features•Wide-area synchronous replication•partition by "Entity Group"

Entity Group Examples

Application Entity Groups Cross-EG OpsEmail User accounts none

Blogs Users, Blogs

Access control,

notifications, global indexes

Mapping Local patchesPatch-

spanning ops

Social Users, GroupsMessages,

relationships, notifications

Resources Sites Shipments

Achieving Technical Goals

• Scale• Bigtable within datacenters• Easy to add Entity Groups (storage,

throughput)• ACID Transactions

• Write-ahead log per Entity Group• 2PC or Queues between Entity

Groups• Wide-Area Replication

• Paxos• Tweaks for optimal latency

Two Phase Commit

•Commit request/Voting phase

•Coordinator sends query to commit

•Cohorts prepare and reply

•Commit/Completion phase

•Success: Commit and acknowledge

•Failure: Rollback and acknowledge

•Disadvantage: Blocking protocol

Basic Paxos•Prepare and Promise

•Proposer selects proposal number N and sends promise to acceptors

•Acceptors accept or deny the promise

•Accept! and Accepted

•Proposer sends out value

•Acceptors respond to proposer and learners

Message Flow: Basic Paxos

Client Proposer Acceptor Learner | | | | | | | X-------->| | | | | | Request | X--------->|->|->| | | Prepare(N) | |<---------X--X--X | | Promise(N,{Va,Vb,Vc}) | X--------->|->|->| | | Accept!(N,Vn) | |<---------X--X--X------>|->| Accepted(N,Vn) |<---------------------------------X--X Response | | | | | | |

Paxos: Quorum-based Consensus

"While some consensus algorithms, such as Paxos, have started to find their way into [large-scale distributed storage systems built over failure-prone commodity components], their uses are limited mostly to the maintenance of the global configuration information in the system, not for the actual data replication."

-- Lamport, Malkhi, and Zhou, May 2009

Paxos and Megastore

•In practice, basic Paxos is not used

•Master-based approach?

•Megastore’s tweaks

•Coordinators

•Local reads

•Read/write from any replica

•Replicate log entries on each write

Omissions

•These were noted in the talk:

•No current query language

•Apps must implement query plans

•Apps have fine-grained control of physical placement

•Limited per-Entity Group update rate

Is Everybody Happy?

• Admins• linear scaling, transparent rebalancing (Bigtable)• instant transparent failover• symmetric deployment

• Developers• ACID transactions (read-modify-write)• many features (indexes, backup, encryption,

scaling)• single-system image makes code simple• little need to handle failures

• End Users• fast up-to-date reads, acceptable write latency• consistency

Take-Aways

•Constraints acceptable to most apps

•Entity Group partitioning

•High write latency

•Limited per-EG throughput

•In production use for over 4 years

•Available on Google App Engine as HRD (High Replication Datastore)

Questions?

•Rate limitation on writes are insignificant?

•Why not lots of RDBMS?

•Why not NoSQL with “mini-databases”?

Megastore: Providing Scalable, Highly Available Storage for Interactive Services

Documents

Transcript of Megastore: Providing Scalable, Highly Available Storage for Interactive Services

Megastore by Google

Megastore: Providing Scalable, Highly Available Storage ... · Apache Hadoop’s HBase [1], or Facebook’s Cassandra [6] are highly scalable, but their limited API and loose consis-tency

Google Megastore

ANGER RANG - Greenhouse Megastore

Google Megastore & Google Spanner - Harvard Universitydaslab.seas.harvard.edu/classes/cs265/files/... · 2019-05-13 · databases. Can’t it just be easy and scalable and reliable?

Cruise Megastore Cruise Guide

Fireplace Megastore Brochure

Modeling, Analyzing, and Extending Megastore using Real ...publish.illinois.edu/.../09/101514...Megastore-Using-Real-Time-Maude.pdf · Megastore Megastore: Google’s wide-areareplicated

graphVizdb A Scalable Platform for Interactive Large Graph …bikakis/papers/graphVizdb.Scalable... · 2016. 1. 23. · graphVizdb: A Scalable Platform for Interactive Large Graph

Megastore: Providing Scalable, Highly Available Storage for Interactive Services

CS295 Week5: Megastore - Providing Scalable, Highly Available Storage for Interactive Systems.

Scalable and Interactive Segmentation and Visualization of ...people.seas.harvard.edu/~jbeyer/files/documents/Jeong_vis09.pdf · Scalable and Interactive Segmentation and Visualization

Megastore: Providing Scalable, Highly Available Storage for Interactive Services. Presented by: Hanan Hamdan Supervised by: Dr. Amer Badarneh 1.

Megastore: Providing Scalable, Highly Available Storage ...cidrdb.org/cidr2011/Papers/CIDR11_Paper32.pdf · NoSQL datastores such as Google’s Bigtable [15], Apache Hadoop’s HBase

The MegaStore Application

Node Interactive: Node.js Performance and Highly Scalable Micro-Services

Megastore: Providing scalable and highly available storage

DESIGN AND ANALYSIS OF SCALABLE AND INTERACTIVE NEAR VIDEO ...nabil.eng.wayne.edu/_resources/pdf/design-scalable-Interactive.pdf · DESIGN AND ANALYSIS OF SCALABLE AND INTERACTIVE

Fireplace Megastore Newpaper Advert

Megastore:*Providing*Scalable,*Highly* …daslab.seas.harvard.edu › ... › CS265_presentation_Kewei_LI.pdf · 2017-09-06 · Megastore:*Providing*Scalable,*Highly* Available*Storage*for*Interactive*Services*

Megastore:ProvidingScalable,Highly …daslab.seas.harvard.edu › ... › CS265_presentation_Kewei_LI.pdf · 2017-09-06 · Megastore:ProvidingScalable,Highly AvailableStorageforInteractiveServices*