Data sharding

Data Sharding

Michał Gruchała michal@gruchala.info

WebClusters 2011

● Background● Theory● Practice● Summary

Background

Microblogging site● user messages (blog)● cockpit/wall

Classic architecture● database● web server(s)● loadbalancer(s)

Background

Web servers, load balancers● one server● ...● 1000 servers● not a problem

Database ● one database● two databases (master -> slave)● two databases (master <-> master)● n databases (slave(s)<-master<->master->slave(s))

a lot of replication ;)

Background

Replication ● increase read performance (raid1)● increase data safety (raid1)● does not increase system's capacity (GBs)

Background

Scalability

● stateless elements scale well

● stateful elements ○ quite easy to scale

■ if we want more reads (cache, replication)○ hard to scale

■ if we want more writes■ if we want more capacity

Background

Sharding ;)

A B C DE F G HI J K L

A B C D

E F G H

I J K L

Theory

Scaling● Scale Back

○ delete, archive unuset data● Scale Up (vertical)

○ more power, more disks● Scale Out (horizontal)

○ add machines■ functional partitioning■ replication■ sharding

Theory

Sharding● split one big database into many smaller databases

○ spread rows○ spread them across many servers

● shared-nothing partitioning● not a replication

Theory

Sharding key�

● shard by a key● all data with that key will be on the same shard● i.e. shard by user - all informations connected to user are on

one shard (user info, messages, friends list)

user 1 -> shard 1user 2 -> shard 2user 3 -> shard 1user 4 -> shard 2

● choosing a right key is very important!

Theory

Sharding function

● maps keys to shards● where to find the data● where to store the data

shard number = sf(key)

Theory

Sharding function

● Dynamic○ Mapping in a database table

● Fixed○ Modulo

shard number = id % shards_count○ Hash + Modulo

shard number = md5(email) % shards_count○ Consistent hasing

http://en.wikipedia.org/wiki/Consistent_hashing

Theory

Advantages

● Linear write/read performance scalability (raid0)● Capacity increase (raid0)● Smaller databases are easier to manage

○ alter○ backup/restore○ truncate ;)

● Smaller databases are faster○ as may fit into memory

● Cost effective○ 80core, 20 HD, 80GB RAM vs○ 10 x (8core, 2HD, 8GB RAM)

Theory

Challenges

● Globally unique IDs○ unique across all shards

■ auto_increment_increment, auto_increment_offset■ global IDs table

○ not unique across shards■ IDs in dbs - not unique■ shard_number - unique

■ global unique ID = shard_number + db ID

Challenges

Re-sharding

● consistent hasing or

● more shards than machines/nodes (i.e. 100 shards on 10 machines)

1,4,7 2,5,8 3,6,9

1,6 2,7 3,8 4,9 5

Challenges

Cross-shard

● queries○ sent to many shards○ collect result from one ○ avoidable (better sharding key, more sharding keys)

● joins○ send query to many shards○ join results in an application○ sometimes unavoidable

Challenges

Network

● more machines, more smaller streams● full-mesh between webservers and shards● pconnect vs. connect

Complexity

● usually sharding is done in application logic

Practice

Microblogging site● see users messages● see stream/wall

Classic architecture● database● web server(s)● loadbalancer(s)

Practice

id login

1 John

3 Andy

4 Claire

5 Megan

id owner message

1 2 M1

2 1 M2

3 2 M3

4 3 M4

5 2 M5

who whose

John's messages?John's follows?

Practice

User ● no need for sharding

Messagesharded by user (owner field)

● shard_number = owner % 2

Followsharded by user (who field)

● shard_number = who % 2

2 shards, 3 machines

MessageFollow

MessageFollowFollow

shard0 shard1

Practice

id login

1 John

3 Andy

4 Claire

5 Megan

id owner message

1 2 M1

3 2 M3

5 2 M5

who whose

id owner message

2 1 M2

4 3 M4

who whose

shard0

shard1

mapping?

Practice

Bob's blog

● Bob's messages ○ find Bob's id in User table (id = 2)○ find Bob's shard (2%2 = 0, shard0)○ fetch Messages (shard0) where owner = 2

● People Bob follows○ find Bob's id in User table (id = 2)○ find Bob's shard (2%2 = 0, shard0)○ fetch whose id from Follow table (shard0) ○ fetch people info from User table

Practice

id login

1 John

3 Andy

4 Claire

5 Megan

id owner message

1 2 M1

3 2 M3

5 2 M5

who whose

id owner message

2 1 M2

4 3 M4

who whose

shard0

shard1

Practice

Who follows Andy ?

● find Andy's id in User table (id=3)● find Andy's shard (3%2 = 1, shard1)● hmmm

Practice

id login

1 John

3 Andy

4 Claire

5 Megan

id owner message

1 2 M1

3 2 M3

5 2 M5

who whose

id owner message

2 1 M2

4 3 M4

who whose

shard0

shard1

Cross-shardquery!

Practice

id login

1 John

3 Andy

4 Claire

5 Megan

id owner message

1 2 M1

3 2 M3

5 2 M5

who whose

id owner message

2 1 M2

4 3 M4

who whose

shard0

shard1

Ideas?

Summary

Shard or not to shard

● many reads, little writes? - don't● many writes and no capacity problems? - don't (use SSD)● capacity problems? - yes● many writes and capacity problems? - yes● scale-up is affordable? - don't shard

As You see... it depends!

Summary

If You have to shard

● always use sharding + replication = raid10○ sharding reduces high availability (like raid0)

● more shards than You need○ i.e. 4 machines, 100 shards○ or dynamic allocation

● think of network capacity (full-mesh)○ load sharding (google it ;))

● sharding key - important!○ cross-shard queries

Wake Up!

Thanks

Questions?

Data sharding

Documents

Transcript of Data sharding

Sharding Architectures

The Future of Postgres Sharding - Bruce Momjian · The Future of Postgres Sharding BRUCE MOMJIAN This presentation will cover the advantages of sharding and future Postgres sharding

2 Proprietary & Confidential What is Sharding Benefits of Sharding Alternatives of Sharding When to start Sharding Agenda.

MongoDB for Time Series Data Part 3: Sharding

Sharding for Mere Mortals

Global Scale Web 2.0 Data Platforms Sharding with Oracle Database

Oracle Sharding MAA Best Practices · 2 | ORACLE SHARDING MAA BEST PRACTICES » Gold: Silver, plus Oracle Active Data Guard or Oracle GoldenGate1 for shard-level high availability

Geographical sharding in MongoDB - ULisboa · Geographical sharding in MongoDB Using Voronoi partitioning to geographically shard point data ... of course, geography [8]. It can generally

Sharding Key-Value Data IN Isis2

Introduction to MongoDB sharding

Sharding Architectures 777

Studying Data Sharding using MongoDB - Vargas-Solarvargas-solar.com/data-management-services-cloud/wp-content/upload… · Studying Data Sharding using MongoDB Technical requirements

Lightning Talk: MongoDB Sharding

Webinar: Sharding

Sharding and MongoDB - Genoveva Vargas-Solarvargas-solar.com/.../sites/32/2014/01/MongoDB-sharding-guide.pdf · Sharding and MongoDB Release 2.8.0-rc3 MongoDB Documentation Project

MongoDB Sharding Guide

System Managed Sharding with Active Data Guard using … · System Managed Sharding with Active Data Guard using ADD ... simply by adding additional ... will automatically trigger

Sharding Methods for MongoDB

Michał Gruchała - Data sharding

Life After Sharding: Monitoring and Management of a Complex Data Cloud