[B1]real time large data at twitter

23
real-time large data @ra deview - 17 september 2012

description

 

Transcript of [B1]real time large data at twitter

Page 1: [B1]real time large data at twitter

real-time large data

@ra!deview - 17 september 2012

Page 2: [B1]real time large data at twitter
Page 3: [B1]real time large data at twitter
Page 4: [B1]real time large data at twitter
Page 5: [B1]real time large data at twitter

there are over400 million tweetsa day

Page 6: [B1]real time large data at twitter

a second4600 tweets

0.2 msa tweet≈

Page 7: [B1]real time large data at twitter

Pull Push

Targeted twitter.comhome_timeline API

User / Site StreamsMobile Push (SMS, etc.)

Queried Search API Track / Follow Streams

Page 8: [B1]real time large data at twitter

Write API

Fanout

Tim

elin

e C

ache

Redis

Ingester

Sear

ch C

ache

Redis

Push

Com

pute

HTTP Push

Bat

ch C

ompu

te

Hadoop

Page 9: [B1]real time large data at twitter

Timeline Service

Ingester

Sear

ch C

ache

RedisRedisEarlybird

Blender

Push

Com

pute

HTTP Push

Mobile Push B

atch

Com

pute

Hadoop

Write API

Fanout

RedisRedis

Tim

elin

e C

ache

Redis

Social Graph Service

Page 10: [B1]real time large data at twitter

Timeline Service

Ingester

Sear

ch C

ache

RedisRedisEarlybird

Blender

Push

Com

pute

HTTP Push

Mobile Push B

atch

Com

pute

Hadoop

Write API

Fanout

RedisRedis

Tim

elin

e C

ache

Redis

Social Graph Service

Page 11: [B1]real time large data at twitter

Ingester

Sear

ch C

ache

RedisRedisEarlybird

Blender

Push

Com

pute

HTTP Push

Mobile Push B

atch

Com

pute

Hadoop

Write API

Fanout

RedisRedis

Tim

elin

e C

ache

Timeline Service

Redis

Page 12: [B1]real time large data at twitter

Bat

ch C

ompu

te

Hadoop

Push

Com

pute

HTTP Push

Sear

ch In

dex

Blender

Redis

Timeline Service

Ingester

Earlybird

Write API

Fanout

RedisRedis

Tim

elin

e C

ache

RedisEarlybirdMobile Push

Page 13: [B1]real time large data at twitter

STORAGE & RETRIEVAL

LOGICPRESENTATIONROUTING

Monorail

T-Bird

T-Flock + Haplo

Flock(s)

Darkwing

Page 14: [B1]real time large data at twitter

Macaw+Swift

Story Service

Macaw+Disco

STORAGE & RETRIEVAL

LOGICPRESENTATIONROUTING

T-Bird

T-Flock + Haplo

Flock(s)

Darkwing

Tweetypie

Gizmoduck

TLS

Social Graph Service

Monorail

Woodstar

TFE

Page 15: [B1]real time large data at twitter
Page 16: [B1]real time large data at twitter

timeline delivery statistics⇢30b deliveries / day (~20m / min)

⇢3.5 seconds @ p50 to deliver to 1m

⇢~350k deliveries / sec

Page 17: [B1]real time large data at twitter
Page 18: [B1]real time large data at twitter
Page 19: [B1]real time large data at twitter
Page 20: [B1]real time large data at twitter
Page 21: [B1]real time large data at twitter
Page 22: [B1]real time large data at twitter
Page 23: [B1]real time large data at twitter

#JoinTheFlock