Triple R â€“ Riak, Redis and RabbitMQ at XING

Triple R – Riak, Redis and RabbitMQ at XING Dr. Stefan Kaes, Sebastian Röbke

NoSQL matters Cologne, April 27, 2013

ActivityStream Intro

3 Types of Feeds

News Feed

Me Feed

Company Feed

Activity Creation

ActivityStream

POST /activitystream/activities

ActivityStream

events.participation.changed

Events Appevents.event.created

groups.member.joined

Groups Appgroups.article.created

users.contact.created

User Appusers.profile.updated

... ActivityStream

Old Approach

Comment

Activity

INSERT INTO `activities` ...

LikeINSERT INTO `comments` ...

INSERT INTO `likes` ...

several hundred millions

Efficient Activity Creation

But ... slow reads

User App

Groups App

GET group memberships

Companies App

Settings

ActivityStream

likes comments

activities

GET contacts

GET privacy settings

GET followed companies

User App

Groups App

GET group memberships

Companies App

Settings

ActivityStream

likes comments

activities

GET contacts

GET privacy settings

GET followed companies

Activities immediately visible

Consistency

SQL Databases arewell understood

Database Master is

Single Point of Failure

Sharding

Unsatisfactory Read Performance

New Approach

Materialized Feeds

ActivityStream

comments

activities news feeds me feeds company feeds

Activity

Storage

User App

GET contacts

etc....

create

Requirements

Better Read Performance

Activities created by me must be visible to myself immediately

Activities created by others should appear within a reasonable time

frame in my stream

Storage layer must tolerate high read and write loads

Storage layer must provide easy capacity scaling

Low maintenance

Option 1:

Do it yourself SQLdatabase design

Option 2:

Off the shelf NoSQL database

We chose

We tend to view it as a highly available distributed hash table

Eventual consistency/conflict resolution is the hard part

Bounded size feeds are easyhttp://www.paperplanes.de/2011/12/15/storing-timelines-in-riak.html

Unbounded feeds are much harder

Object Model

JSON Documents

Activities

2P-Set

JSON Documents

bounded list of chunk references

chunk sequence number

youngest activity ref

oldest activity ref

size of referenced chunk

JSON Documents

FeedChunk

2P-Set

The Migration

Incremental rollout

Part 1:

From old to new

Let’s start simple!

Replicating some data

Old ActivityStream

New ActivityStream

activity.deleted

activity.created

comment.deleted

comment.created

like.deleted

like.created

data migration processors

Measuring performance

Old ActivityStream

New ActivityStream

mefeed.viewed

newsfeed.viewed

companyfeed.viewed

shadow query processors

data migration processors

Part 2:

From new to old

Old ActivityStream

New ActivityStream

DELETE /activitystream/activities/{id}

DELETE /activitystream/activities/{activity_id}/comments/{id}

POST /activitystream/activities/{activity_id}/comments

DELETE /activitystream/activities/{activity_id}/likes/{user_id}

PUT /activitystream/activities/{activity_id}/likes/{user_id}

Beta User B

Beta User A

Beta User C

Old ActivityStream

New ActivityStream

DELETE /activitystream/activities/{id}

DELETE /activitystream/activities/{activity_id}/comments/{id}

POST /activitystream/activities/{activity_id}/comments

DELETE /activitystream/activities/{activity_id}/likes/{user_id}

PUT /activitystream/activities/{activity_id}/likes/{user_id}

activity idBeta User B

Beta User A

Beta User C

Part 3:

What about the old data?

Bulk Data Migration:

Failed Version 1

Failed Version 11. Reset data in the new system2. Query the old system REST API for the feeds3. Store them in the new system4. Switch to the new system

This was naive

The old system was way too slow to return the millions of feeds in

their full length

Failed Version 2

Failed Version 21. Reset data in the new system2. Read all activities based on a dump of the

old system3. Publish “created” messages to RabbitMQ for

each activity/comment/like4. Let the new system build its data structures5. Switch to the new system

This was naive

You can’t replay the history of 2.5 years this way

Successful Version

Successful Version1. Reset data in the new system2. Obtain data dump from old system3. Extract data from the dumps and compute a

representation of the feeds in Redis with a massive amount of batch processors

4. Use this data to build up the structures in the new system

5. Switch to the new system

It worked!

But...

A lot of additional code

Fragile, manual steps involved

A lot of technology:

RabbitMQ, Riak, Redis, Varnish

One run took 5 days

But it worked!

Current Status

New system is live for all users since 12/12/12

Old and new system were kept in sync till April 2013

In case of serious trouble, we could have switched to the old

system within seconds

Performance goals have been met

Old system New system

happy t < 0.1s 0.17% 62.01%

satisfied t < 0.5s 41.36% 99.71%

tolerating 0.5s ≤ t < 2s 58.20% 0.28%

frustrated t ≥ 2s 0.44% 0.00%

Apdex Score 0.70 1.00

Production setup

‣ 10 Riak Servers as Primary Cluster

‣ 10 Riak Servers as Backup Cluster (Multi-DC Replication)

‣ SSDs, Raid 0 and a proper Linux file I/O scheduler setting (noop)

‣ Bitcask storage backend

‣ 4 REST API Servers

‣ 4 Background Worker Servers

‣ Monitoring using Ganglia and Logjam (App Performance)

Lessons learned

‣Eventual consistency sounds easy, but is hard to implement correctly in practice‣There’s a steep learning curve at the beginning‣High update rates and large objects don’t go

together well, if your storage system offers just get, put and delete operations‣Achieving high performance requires careful

thought about data structures, algorithms and access patterns‣Building a new system from scratch is lot easier

than migrating an existing system

‣Protobuffs API faster than HTTP ‣Use the best performing JSON library you can

find (Ruby: Oj gem)‣Avoid a full-blown ORM for Riak if you care

about performance (Ruby: Ripple gem)

‣At one point we saturated the Gigabit network cards on the Riak cluster‣This lead to compressing all data before storing

it on the cluster and breaking news feeds into chunks

The professional network www.xing.com

Thank you for your

attention!

Dr. Stefan Kaes

Twitter: @stkaes

Sebastian Röbke

Twitter: @boosty

We’re hiring: careers@xing.com

Triple R â€“ Riak, Redis and RabbitMQ at XING

Documents

Transcript of Triple R â€“ Riak, Redis and RabbitMQ at XING

Timeseries data in Riak - Riak Meetup Stockholm 1/11/2012

Key-Value Databases, Riak, Redis - Univerzita KarlovaKey-value store When Not to Use Relationships among Data Relationships between different sets of data Some key-value stores provide

TALK 1: CONVINCE YOUR BOSS: CHOOSE THE RIGHT DATABASEgotocon.com/dl/goto-cph-2012/slides/big-data/Edlich-merged.pdf · HBase Cassand. DynDB Mongo CouchBS Riak Redis ES schema free

I can't pass an extremely competitive test to become a ...vvtesh.co.in/teaching/bigdata-2020/slides/Lecture10-NoSQL.pdf · •Examples: Riak, Voldemort, and Redis •Document DB •Complex

Redis : Play buzz uses Redis

AMQP and Beyond - RabbitMQ rabbitmq-jsonrpc-channel rabbitmq-jsonrpc rfc4627_jsonrpc rabbitmq-mochiweb mochiweb rabbit_stomp rabbithub rabbitmq-bql amqp_client rabbitmq-shovel rabbitmq-toke

L23: NoSQL (continued) - Northeastern University · • Redis-Next slides and in our Jupyternotebooks • Riak-Focuses on high availability, BASE-“As long as your Riakclient can

The NoSQL Ecosystem, Relaxed Consistency, and …The List, So You Don't Yell at Me Cassandra HBase Voldemort Riak Redis MongoDB HyperTable Neo4j HyperGraphDB DEX InfoGrid VertexDB

Riak at Posterous

Misusing Open Services on the Internet · I NoSQLdatabases: MongoDB,CouchDB I Key-valuestore: Redis,Memcached I Messagequeue: RabbitMQ I Printingprotocols: CUPS(andIPPprinters) JelteFennema,BendeGraaﬀ

Seven database (PosgreSQL, Riak, Hbase, MongoDB, CouchDB, Neo4J, Redis)

Working with Velti - Trifork€¦ · • Erlang • RIAK & leveldb • Redis • Ubuntu • Ruby on Rails • Java • Node.js ... apt-get install numactl • => No timeouts ... –

RabbitMQ for PCF - resources.docs.pivotal.io · RabbitMQ for PCF About RabbitMQ for PCF RabbitMQ for Pivotal Cloud Foundry (PCF) enables PCF app developers to provision and use the

Centralized vs. DistributedDynamo Voldemort TokyoCabinet KAI Cassandra SimpleDB CouchDB Riak BigTable Hypertable HBase MongoDB Terrastore Scalaris BerkeleyDB MemcacheDB Redis ... BigTable

Riak Intro

TECHNOLOGIES WE USELARAVEL - VUE.JS - REDIS - POSTGRESQL - RABBITMQ . Gra"itibags allows people to create custom designed bags. GRAFFITIBAGS #UI #UX #BACK-END #FRONT-END #BRANDING

Proxy v1.1 Guide VMware Application · 2018. 10. 10. · 28 MySQL 29 nginx_plus 30 Nginx 31 php-fpm 32 Pivotal Server 33 Postgres 34 Redis 35 Riak 36 RabbitMQ 37 Sharepoint 38 Twemproxy

Redis - files.meetup.com fileExample: AMQP message de-dupe Route messages through 2 separate rabbitmq brokers for redundancy Use redis for message de-dupe by tagging messages with

Infrastructure as a Service Global Providers Leadershi… · KuppingerCole Leadership Compass Infrastructure as a Service Global Providers ... MongoDB, PostgreSQL, Redis, RabbitMQ,

mongo-scaling-phpnw16 - Derick Rethans · NoSQL redis Key/value Cassandra HBASE ..*riak amazon Brnazon )ynarnoDB Column Ne04j the graph database Orient Graph 3/87 CouchDB relax mongoDB