Autoplacer: Scalable Self-Tuning Data Placement in ......Autoplacer: Scalable Self-Tuning Data...

Autoplacer: Scalable Self-Tuning DataPlacement in Distributed Key-value Stores

ICAC’13

Joao Paiva, Pedro Ruivo, Paolo Romano, Luıs Rodrigues

Instituto Superior Tecnico / Inesc-ID, Lisboa, Portugal

June 27, 2013

Outline

Introduction

Our approach

Evaluation

Conclusions

Motivation

Collocating processing with storage can improve performance.

I Using random placement, nodes waste resources due tonode-intercommunication.

I Optimize data placement to improve locality and to reduceremote requests.

Motivation

Approaches Using Offline Optimization

Algorithm:

1. Gather access trace for all items

2. Run offline optimization algorithms on traces

3. Store solution in directory

4. Locate data items by querying directory

I Fine-grained placement

I Costly to log all accesses

I Complex optimization

I Directory creates additional network usage

Approaches Using Offline Optimization

Algorithm:

1. Gather access trace for all items

2. Run offline optimization algorithms on traces

3. Store solution in directory

4. Locate data items by querying directory

I Fine-grained placement

I Costly to log all accesses

I Complex optimization

I Directory creates additional network usage

Main challenges

Cause: Key-Value stores may handle large amounts of data

Challenges:

1. Collecting Statistics: Obtaining usage statistics in anefficient manner.

2. Optimization: Deriving fine-grained placement for dataobjects that exploits data locality.

3. Fast lookup: Preserving fast lookup for data items.

Approaches to Data Access Locality

1. Consistent Hashing (CH):The “don’t care” approach

2. Distributed Directories:The “care too much” approach

Consistent Hashing

Don’t care for locality: items placed deterministically according tohash functions and full membership information.

I Simple to implement

I Solves lookup challenge by using local lookups

I No control on data placement → bad locality

I Does not address optimization challenge

Consistent Hashing

Don’t care for locality: items placed deterministically according tohash functions and full membership information.

I Simple to implement

I Solves lookup challenge by using local lookups

I No control on data placement → bad locality

I Does not address optimization challenge

Distributed Directories

Care too much for locality: nodes report usage statistics tocentralized optimizer, placement defined in a distributed directory(may be cached locally)

I Can solve statistics challenge using coarse statistics

I Solves optimization challenge with precise data placementcontrol

Hindered by lookup challenge:

I Additional network hop

I Hard to update

Distributed Directories

Care too much for locality: nodes report usage statistics tocentralized optimizer, placement defined in a distributed directory(may be cached locally)

I Can solve statistics challenge using coarse statistics

I Solves optimization challenge with precise data placementcontrol

Hindered by lookup challenge:

I Additional network hop

I Hard to update

Outline

Introduction

Our approach

Evaluation

Conclusions

Our approach: beating the challenges

Best of both worlds

I Statistics Challenge: Gather statistics only for hotspot items

I Optimization Challenge: Fine-grained optimization forhotspots

I Lookup Challenge: Consistent Hashing for remaining items

Algorithm overview

Online, round-based approach:

1. Statistics: Monitor data access to collect hotspots

2. Optimization: Decide placement for hotspots

3. Lookup: Encode / broadcast data placement

4. Move data

Algorithm overview

4. Move data

Statistics: Data access monitoring

Key concept: Top-K stream analysis algorithm

I Lightweight

I Sub-linear space usage

I Inaccurate result... But with bounded error

I Lightweight

Algorithm overview

4. Move data

Optimization

Integer Linear Programming problem formulation:

min∑j∈N

∑i∈O

X ij(crr rij + crwwij) + Xij(cl

r rij + clwwij) (1)

subject to:

∀i ∈ O :∑j∈N

Xij = d ∧ ∀j ∈ N :∑i∈O

Xij ≤ Sj

Inaccurate input:

I Does not provide optimal placement

I Upper-bound on error

Accelerating optimization

1. ILP Relaxed to Linear Programming problem

2. Distributed optimization

LP relaxation

I Allow data item ownership to be in [0− 1] interval

Distributed Optimization

I Partition by the N nodes

I Each node optimizes hotspots mapped to it by CH

I Strengthen capacity constraint

LP relaxation

Algorithm overview

4. Move data

Lookup: Encoding placement

Probabilistic Associative Array (PAA)

I Associative array interface (keys→values)

I Probabilistic and space-efficient

I Trade-off space usage for accuracy

Probabilistic Associative Array: Usage

Building

1. Build PAA from hotspot mappings

2. Broadcast PAA

Looking up objects

I If item not in PAA, use Consistent Hashing

I If item is hotspot, return PAA mapping

Probabilistic Associative Array: Usage

Building

1. Build PAA from hotspot mappings

2. Broadcast PAA

Looking up objects

I If item not in PAA, use Consistent Hashing

I If item is hotspot, return PAA mapping

PAA: Building blocks

I Bloom FilterSpace-efficient membership test (is item in PAA?)

I Decision tree classifierSpace-efficient mapping (where is hotspot mapped to?)

PAA: Building blocks

I Bloom FilterSpace-efficient membership test (is item in PAA?)

I Decision tree classifierSpace-efficient mapping (where is hotspot mapped to?)

PAA: Properties

Bloom Filter:

I False Positives: match items that it was not supposed to.

I No False Negatives: never return ⊥ for items in PAA.

Decision tree classifier:

I Inaccurate values (bounded error).

I Deterministic response: deterministic (item→node)mapping.

PAA: Properties

Bloom Filter:

PAA: Properties

Bloom Filter:

Algorithm Review

1. Statistics: Monitor data access to collect hotspotsTop-k stream analysis

2. Optimization: Decide placement for hotspotsLightweight distributed optimization

3. Lookup: Encode / broadcast data placementProbabilistic Associative Array

4. Move data

Algorithm Review

4. Move data

Algorithm Review

4. Move data

Algorithm Review

4. Move data

Outline

Introduction

Our approach

Evaluation

Conclusions

Experimental settings

I Integrated in Distributed Key-Value store (JBoss Infinispan)

I 40 Virtual Machines (10 physical machines)

I Gigabit network

Modified TPC-C benchmark

Induce controllable locality:

I Probability p: Nodes access data associated with a givenwarehouse.

I Probability 1− p: Nodes access data associated a randomwarehouse.

Remote operations

0 5 10 15 20 25 30

Time (minutes)

100% locality90% locality50% locality

0% localitybaseline

Throughput

0 5 10 15 20 25 30

Time (minutes)

100% locality90% locality50% locality

0% localitybaseline

Directory effects

100% Locality 90% Locality 0% Locality

nsaction p

second (

)Autoplacer

DirectoryBaseline

Outline

Introduction

Our approach

Evaluation

Conclusions

I Gather statistics only for hotspots

I Fine-grained hotspot placement

I Retain Local lookups using PAA

I Effective locality improvement

I Good network usage

I Considerable performance improvements

Conclusions

I Gather statistics only for hotspots

I Fine-grained hotspot placement

I Retain Local lookups using PAA

I Effective locality improvement

I Good network usage

I Considerable performance improvements

Thank you

Autoplacer: Scalable Self-Tuning Data Placement in ......Autoplacer: Scalable Self-Tuning Data...

Documents

Transcript of Autoplacer: Scalable Self-Tuning Data Placement in ......Autoplacer: Scalable Self-Tuning Data...

Advanced Network Architecture Research Group 2001/11/149 th International Conference on Network Protocols Scalable Socket Buffer Tuning for High-Performance.

INDUSTRY SOLUTION PACKAGE SORTATION AND PLACEMENT · Implementation – Sortation and Placement . Highlights • NJ501 is a scalable machine controller that can handle multiple functionalities

Fuji Scalable Placement Platform Machine Specifications · Fuji Scalable Placement Platform Machine Specifications Model: NXT Preliminary. ... machine during operation, and as individual

Intel® VTune™ Amplifier Tuning Guide for the Intel® Xeon ......Intel® VTune Amplifier Tuning uide for the Intel® Xeon® Processor Scalable amily, 1st en 3 The pipeline slot is

Scalable Tuning of Building Models to Hourly Dataweb.eecs.utk.edu/~jnew1/publications/2015_Energy_Hourly... · 2015. 4. 27. · Scalable Tuning of Building Models to Hourly Data Aaron

Alive Tuning TDV6 EGR Bypass Kit - Select Divisionalivetuning.com/wp-content/uploads/2014/02/TDV6_EGR_Kit.pdf · Alive Tuning TDV6 EGR Bypass Kit This document is scalable, so you

Columbia Application Performance Tuning Case Studies · techniques enabled approximately 2- to 20-fold improvements in application performance. Key words: Code tuning, process-placement,

Algorithms for Independent Task Placement and Their Tuning in Demand-Driven Ray Tracing

Power System Stabilizer Placement and Tuning

Scalable Performance Measurement and Analysistgamblin/pubs/dissertation.pdf · Scalable Performance Measurement and Analysis Todd Gamblin ... Tuning the performance of large-scale

SWORD: Scalable Workload-Aware Data Placement for ...amol/papers/edbt13.pdf · SWORD: Scalable Workload-Aware Data Placement for Transactional Workloads Abdul Quamar University of

Power System Stabilizer Placement and Tuning Methods for Inter-area Oscillation Damping

Towards Dynamic Data Placement for Polystore Ingestionpeople.csail.mit.edu/tatbul/publications/sstore_birte17.pdfmust be collected, stored, and analyzed in a reliable and scalable

Config and e-O Tuning Assessment Implement Integration ... · Risk Reduction with existing Recommendations tools Thresholds and Tuning Sensor Placement Assessment Security Analysis

A Scalable Auto-tuning Framework for Compiler Optimizationhollings/papers/ipdps09.pdf · produced a unique and powerful framework for auto-tuning compiler-generated code that explores

A Scalable Cross-Platform Infrastructure for Application Performance … · · 2000-08-22A Scalable Cross-Platform Infrastructure for Application Performance Tuning Using Hardware

CRUSH: Controlled, Scalable, Decentralized Placement of ...tom.nos-eastchina1.126.net/weil-crush-sc06.pdf · CRUSH: Controlled, Scalable, Decentralized Placement of Replicated Data

A Compiler for Scalable Placement and Routing of Brain ...

Text miner’s little helper: scalable self-tuning ... DI CORSO_presentation.pdf · Data analytics Large volumes of heterogeneous data are being collected in •social networks •scientific

STAR: Self-Tuning Aggregation for Scalable Monitoringyzhang/papers/star-tr07.pdf · Scalable system monitoring is a fundamental abstraction for large-scale networked systems. It serves