2015-08-22 Stream billions of data to Google Big Query

13
[email protected] Teddy Chan CEO& Co-Founder

Transcript of 2015-08-22 Stream billions of data to Google Big Query

Page 1: 2015-08-22 Stream billions of data to Google Big Query

[email protected]

Teddy ChanCEO& Co-Founder

Page 2: 2015-08-22 Stream billions of data to Google Big Query

Shipment tracking APISupporting 270 290+ carriers

Page 3: 2015-08-22 Stream billions of data to Google Big Query

Free Shipping APIsSingle API for 5 carriers

Page 4: 2015-08-22 Stream billions of data to Google Big Query

How do we find you?

Page 5: 2015-08-22 Stream billions of data to Google Big Query

Google BigQuery is great

- Charge only when you do query

- Super cheap in storage

- Super cheap in sending the data

Page 6: 2015-08-22 Stream billions of data to Google Big Query

Perfect for log everything

3 billions records / month

1,200 records / sec

Page 7: 2015-08-22 Stream billions of data to Google Big Query

API Log BigQuery

Goal

Page 8: 2015-08-22 Stream billions of data to Google Big Query

API Log BigQuery

It does not work!

- http RESTful- BigQuery API call limit

Page 9: 2015-08-22 Stream billions of data to Google Big Query

API Log Redis Streamer BigQuery

How do we solve?

Page 10: 2015-08-22 Stream billions of data to Google Big Query

Why Redis?

- TCP persistent connection- Fast, really FAST

Page 11: 2015-08-22 Stream billions of data to Google Big Query

What does the STREAMER do?

- Read data from redis- Send BATCH data to BQ- Pre / Post process data

Page 12: 2015-08-22 Stream billions of data to Google Big Query

Demo

Page 13: 2015-08-22 Stream billions of data to Google Big Query

http://www.slideshare.net/aftership

FAQ

OPEN SOURCEhttps://github.com/AfterShip/demo_streamer2bigquery