Dynamo: Amazon’s Highly · 2017-06-03 · Dynamo: Amazon’s Highly Available Key-value Store...

Dynamo: Amazon’s Highly

Available Key-value Store

Giuseppe DeCandia, Deniz Hastorun,

Madan Jampani, Gunavardhan Kakulapati,

Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall

and Werner Vogels

Motivation

Build a distributed storage system:

Simple: key-value

Highly available

Guarantee Service Level Agreements (SLA)

System Assumptions and Requirements

Query Model: simple read and write operations to a data

item that is uniquely identified by a key.

ACID Properties: Atomicity, Consistency, Isolation,

Durability.

Efficiency: latency requirements which are in general

measured at the 99.9th percentile of the distribution.

Other Assumptions: operation environment is assumed

to be non-hostile and there are no security related requirements

such as authentication and authorization.

Service Level Agreements (SLA)

Application can deliver its

functionality in a bounded

time: Every dependency in the

platform needs to deliver its

functionality with even tighter

bounds.

Example: service guaranteeing

that it will provide a response within

300ms for 99.9% of its requests for a

peak client load of 500 requests per

second.

Service-oriented architecture of

Amazon’s platform

Design Consideration

Sacrifice strong consistency for availability

Conflict resolution is executed during read

instead of write, i.e. “always writeable”.

Other principles:

Incremental scalability.

Symmetry.

Decentralization.

Heterogeneity.

Summary of techniques used in Dynamo

and their advantages

Problem Technique Advantage

Partitioning Consistent Hashing Incremental Scalability

High Availability for writesVector clocks with reconciliation

during reads

Version size is decoupled from

update rates.

Handling temporary failures Sloppy Quorum and hinted handoffProvides high availability and

durability guarantee when some of

the replicas are not available.

Recovering from permanent

failuresAnti-entropy using Merkle trees

Synchronizes divergent replicas in

the background.

Membership and failure detectionGossip-based membership protocol

and failure detection.

Preserves symmetry and avoids

having a centralized registry for

storing membership and node

liveness information.

Partition Algorithm

Consistent hashing: the output

range of a hash function is treated as a

fixed circular space or “ring”.

”Virtual Nodes”: Each node can

be responsible for more than one

virtual node.

Advantages of using virtual nodes

If a node becomes unavailable the

load handled by this node is evenly

dispersed across the remaining

available nodes.

When a node becomes available

again, the newly available node

accepts a roughly equivalent

amount of load from each of the

other available nodes.

The number of virtual nodes that a

node is responsible can decided

based on its capacity, accounting

for heterogeneity in the physical

infrastructure.

Replication

Each data item is

replicated at N hosts.

“preference list”: The list of

nodes that is responsible

for storing a particular key.

Data Versioning

A put() call may return to its caller before the

update has been applied at all the replicas

A get() call may return many versions of the

same object.

Challenge: an object having distinct version sub-histories,

which the system will need to reconcile in the future.

Solution: uses vector clocks in order to capture causality

between different versions of the same object.

Vector Clock

A vector clock is a list of (node, counter)

pairs.

Every version of every object is associated

with one vector clock.

If the counters on the first object’s clock are

less-than-or-equal to all of the nodes in the

second clock, then the first is an ancestor of

the second and can be forgotten.

Vector clock example

Execution of get () and put ()

operations

1. Route its request through a generic load

balancer that will select a node based on

load information.

2. Use a partition-aware client library that

routes requests directly to the appropriate

coordinator nodes.

Sloppy Quorum

R/W is the minimum number of nodes that must participate in a successful read/write operation.

Setting R + W > N yields a quorum-like system.

In this model, the latency of a get (or put) operation is dictated by the slowest of the R (or W) replicas. For this reason, R and W are usually configured to be less than N, to provide better latency.

Hinted handoff

Assume N = 3. When A

is temporarily down or

unreachable during a

write, send replica to D.

D is hinted that the

replica is belong to A and

it will deliver to A when A

is recovered.

Again: “always writeable”

Other techniques

Replica synchronization:

Merkle hash tree.

Membership and Failure Detection:

Gossip

Implementation

Local persistence component allows for

different storage engines to be plugged in:

Berkeley Database (BDB) Transactional Data

Store: object of tens of kilobytes

MySQL: object of > tens of kilobytes

BDB Java Edition, etc.

Evaluation

Dynamo: Amazon’s Highly · 2017-06-03 · Dynamo: Amazon’s Highly Available Key-value Store...

Documents

Transcript of Dynamo: Amazon’s Highly · 2017-06-03 · Dynamo: Amazon’s Highly Available Key-value Store...

Dynamo: Amazon’s Highly Available Key-value Store Giuseppe DeCandia et al. [Amazon.com] Jagrut Sharma jagrutsh@usc.edu CSCI-572 (Prof. Chris Mattmann)

Dynamo: Amazon’s Highly Available Key-Value Storecourses.cs.vt.edu/cs5204/fall11-butt/lectures/Dynamo.pdf · 2011-11-28 · Dynamo: Amazon’s Highly Available Key-Value Store DeCandia

Dynamo: Amazon’s Highly Available Key-value Store DAAS – Database as a service.

Dynamo: Amazon’s Highly Available Key-value Store Presented By: Devarsh Patel 1CS5204 – Operating Systems.

Dynamo A presentation that look’s at Amazon’s Dynamo service (based on a research paper published by Amazon.com) as well as related cloud storage implementations.

Dynamo: Amazon’s Highly Available Key-value StoreClient vs Server coordination •Read requests coordinated by any Dynamo node •Write requests coordinated by a node replicating

Dynamo: Amazon’s Highly Available Key-value Storeagrawal/fall2009/dynamo.pdfDynamo: Amazon’s Highly Available Key-value Store . Giuseppe DeCandia, Deniz Hastorun, Madan Jampani,

DYNAMO/ DYNAMO 250/ DYNAMO 250 BARREL€¦ · ENGLISH OPERATION MANUAL JB SYSTEMS® 1/41 DYNAMO & DYNAMO 250 (BARREL) Thank you for buying this JB Systems® product. To …

Dynamo: Amazon’s Highly Available Key- value Store (SOSP’07) Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman,

Dynamo: Amazon’s Highly Available Key-value Storecourses.cs.vt.edu/.../Student-Presentations/Dynamo-Patel.pdf · 2012-11-27 · Dynamo CS5204 – Operating Systems Introduction

Dynamo: Amazon's Highly Available Key-value Store › files › download › dic14 › dynamo.pdfDynamo: Amazon’s Highly Available Key-value Store Amir H. Payberah amir@sics.se Amirkabir

Dynamo amazon’s highly available key value store

Dynamo: Amazon's Highly Available Key-value Store Guiseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin,

Dynamo: Amazon’s Highly Available Key-Value Store

Dynamo: Amazon’s Highly Available Key-value Store · PDF fileDynamo: Amazon’s Highly Available Key-value Store Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati,

Dynamo: Amazon’s Highly Available Key-value Store DeCandia, Hastorun, Jampani, Kakulapati, Lakshman, Pilchin, Sivasubramanian, Vosshall, Vogels PRESENTED.

Dynamo: Amazon’s Highly Available Key-value Store · 2012. 11. 27. · Dynamo CS5204 – Operating Systems Introduction Amazon’s e-commerce platform Requires performance, reliability

DC/OS AND FAST DATA (THE SMACK STACK) June 2017krumbach.us/presentations/2017/NYC_2017-06-06-Meetup.pdfA top level Apache project born at Facebook and built on Amazon’s Dynamo and

Dynamo: Amazon’s Highly Available Key-value Store · efficiently without any downtime during the busy holiday shopping season. For example, the service that maintains shopping cart

Amazon’s Highly-Available Key-value Store Dynamo · 2016-10-03 · Dynamo Amazon’s Highly-Available Key-value Store Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan