Post on 12-Aug-2015
©2013 DataStax Confidential. Do not distribute without consent.
Christopher Batey@chbatey
So where do I put all the data?
@chbatey
The IoT stack
@chbatey
The IoT stack
@chbatey
The IoT stack
@chbatey
@chbatey
@chbatey
@chbatey
@chbatey
@chbatey
@chbatey
@chbatey
Problem #1 - Too much data
@chbatey
Problem #1 - Too much data• Size
@chbatey
Problem #1 - Too much data• Size• Throughput
@chbatey
Problem #1 - Too much data• Size• Throughput
@chbatey
Problem #2 - Pruning old data
@chbatey
Problem #2 - Pruning old data• All data in Cassandra can be inserted with a Time To
Live e.g 3 months
@chbatey
Problem #3 - Large scale analytics
@chbatey
Problem #3 - Large scale analytics• Apache Spark runs very nicely on Cassandra- Full data scan analytics
@chbatey
Problem #3 - Large scale analytics• Apache Spark runs very nicely on Cassandra- Full data scan analytics- Stream analytics
@chbatey
Want more tech?• http://zeroturnaround.com/rebellabs/so-why-would-i-use-
a-distributed-database-like-apache-cassandra-by-christopher-batey/• https://academy.datastax.com/• http://www.planetcassandra.org/
@chbatey
Thanks for listening• Follow me on twitter @chbatey• Cassandra + Fault tolerance posts a plenty: • http://christopher-batey.blogspot.co.uk/
23