A Distributed Storage System for Structured Data

Bigtable:A Distributed Storage

System for Structured DataFay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach,

Mike Burrows, Tushar Chandra, Andrew Fikes, Rebert E. Gruber

November 2006OSDI’06: Proceedings of the 7th symposium on Operating Systems Design and ImplementationPublisher: USENIX Association Feb 18, 2009

OUTLINE

• INTRODUCTION

• DATA MODEL

• IMPLEMENTATION

• REFINEMENTS

• PERFORMANCE

• CONCLUSIONS

INTRODUCTION

• A distributed storage system for managing structured data that is designed to scale to very large size

• Achieved several goals: wide applicability, scalability, high performance, and high availability

• Resembles a database, but does not support a full relational data model

DATA MODEL

• A Bigtable is a sparse, distributed, persistent multi-dimensional stored map

• The map is indexed by a row key, column key, and a timestamp; each value in the map is an uninterpreted array of bytes

(row:string, column string, time:int64) string

DATA MODEL

A slice of an example table stores Web pages

Webtable

DATA MODEL

• The row keys are arbitrary strings (up to 64KB in size)

• Every read or write of data under a single row key is atomic

• The row range for a table is dynamically partitioned

• Each row range is called a tablet which is the unit of distribution and

load balancing

Webtable

DATA MODEL

Column Families

• Column keys are grouped into sets called column families which form

the basic unit of access control

• A column family must be created before data can stored

• A column key is named using syntax: family:qualifier

• Access control and both disk and memory accounting are performed at

the column-family level

Webtable

DATA MODEL

Webtable

Timestamps

• Each cell in a Bigtable can contain multiple versions of the same data

• The timestamps can be assigned by Bigtable or client applications

• Supports two per-column-family settings to garbage-collect cell

versions automatically: the last n versions or new-enough versions

IMPLEMENTATION

• The Bigtable has three major components:

• a library that is linked into every client

• one master server

• and many tablet servers

• Tablet servers can be dynamically added (or removed) from a cluster

Tablet LocationIMPLEMENTATION

Tablet location hierarchy

Three-level hierarchy analogous to that of a B-Tree to store tablet location information

Tablet Assignment• Each tablet is assigned to one tablet server

at a time and tracked by the master

• The master is responsible for detecting when a tablet server is no longer serving

• When a master is started, it needs to discover the current tablet assignment before it can change them

IMPLEMENTATION

Tablet ServingIMPLEMENTATION

Tablet Representation

Compactions• Minor compaction process

• Shrinks the memory usage

• Reduce the amount of data that has to be read from the log during recovery

• Merging compaction

• A merging compaction that rewrites all SSTables into exactly one SSTable is called a major compaction

IMPLEMENTATION

REFINEMENTS

• Locality groups

• Compression

• Caching for read performance

• Bloom filters

• Commit-log implementation

• Speeding up tablet recovery

• Exploiting immutability

PERFORMANCE

• A Bigtable cluster with N tablet servers

• The tablet servers were configured to use 1 GB of memory and to write to GFS cell consisting of 1786 machines with two 400 GB IDE hard drives each

• N client machines generated the Bigtable load used for these tests

PERFORMANCE

• Each machine had two dual-core Opteron 2 GH chips, enough physical memory to hold the working set of all running processes, and a single Ethernet link

• The tablet servers and master, test clients, and GFS servers all ran on the same set of machines

PERFORMANCE

• R is the distinct number of Bigtable row keys involved in the test

• R was chosen so that each benchmark read or wrote approximately 1 GB of data per tablet server

PERFORMANCE

Number of 1000-byte values read/written per second

Rate per tablet server Aggregate rate

CONCLUSIONS

• Why always relational database?

• The fact that many Google products successfully use Bigtable demonstrates that the design works well in practice

• There are significant advantages to building our own storage solution

A Distributed Storage System for Structured Data

Documents

Transcript of A Distributed Storage System for Structured Data

Bigtable - A Distributed Storage System for …...Bigtable - A Distributed Storage System for Structured Data Fay Chang, Je rey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach,

Bigtable: A Distributed Storage System for Structured Dataokennedy/courses/papers/bigtable.pdfBigtable is a distributed storage system for managing structured data that is designed

Bigtable:A Distributed Storage System for Structured Data

Content Overlays (Nick Feamster). 2 Content Overlays Distributed content storage and retrieval Two primary approaches: –Structured overlay –Unstructured.

Distributed Systems 18. BigTable - School of Computinglifeifei/cs6931/bigtable.pdf · BigTable • Highly available distributed storage for structured data • Built with structured

Cassandra-A Decentrilized Structured Storage System Azad Kurdistan University Subject : Cassandra - A Decentralized Structured Storage System Professor.

BigTable A System for Distributed Structured Storage

Distributed Storage Taxonomy

Bigtable: A Distributed Storage System for Structured Data

BigTable: A System for Distributed Structured Storage · 2019. 10. 22. · 1 BigTable: A System for Distributed Structured Storage Jeff Dean !! Joint work with: Mike Burrows, Tushar

P2P Network Structured Networks: Distributed Hash Tables

Bigtable: A Distributed Storage System for Structured Data · 2010-09-14 · 4 Bigtable: A Distributed Storage System for Structured Data FAY CHANG, JEFFREY DEAN, SANJAY GHEMAWAT,

Distributed Systems [Fall 2012] - Columbia Universitydu/ds/assets/lectures/...Bigtable: A Distributed Storage System for Structured Data Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson

Log Structured Merge Tree · 1. SSTable and log structured storage leveldb 2. Notes for reading LSM paper 3. Cassandra: a decentralized structured storage system 4. Bigtable: A distributed

Cassandra: A Decentralized Structured Storage System · PDF fileCASSANDRA: A DECENTRALIZED STRUCTURED STORAGE SYSTEM ... •CFs have to be defined in advance → structured ... a decentralized

Bigtable: A Distributed Storage System for Structured Data 1.

Bigtable: A Distributed Storage System for …okennedy/courses/papers/bigtable.pdfGoogle, Inc. Bigtable is a distributed storage system for managing structured data that is designed

Bigtable: A Distributed Storage System for Structured Data · 2019-02-25 · Bigtable: A Distributed Storage System for Structured Data Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson

Distributed Storage Networks and Computer Forensicsarchive.cone.informatik.uni-freiburg.de/teaching/lecture/distributed-storage-w11/...Distributed Storage Networks and Computer Forensics

Bigtable: A Distributed Storage System for Structured Data ...cs655/lectures/CS655-Louis_Rabiet_BigTable.pdf · Bigtable. . .. . ... .. . Other NoSQL Thoughts. . . Conclusion Bigtable: