#dido12 presentation

23

description

Presentation at #DiDo, 2013-04-04.

Transcript of #dido12 presentation

Page 2: #dido12 presentation

Big Data?

Page 3: #dido12 presentation

Big Data?

Page 4: #dido12 presentation

Big Data?

Page 5: #dido12 presentation

Big Data?

Page 6: #dido12 presentation

Hadoop?

Page 7: #dido12 presentation

Hadoop?

Page 8: #dido12 presentation

Hadoop?

Page 9: #dido12 presentation

Hadoop?

Page 10: #dido12 presentation

Doing Big DataCollecting / obtaining data:•Forget about retention policy•Storage is cheap (it really is!)•Keep all of it online (or at least almost all)•Make it scalable (no manual processes)

Page 11: #dido12 presentation

Doing Big DataClean:•Data is always a mess•Don’t treat it as an expception•Make it your problem

Page 12: #dido12 presentation

Doing Big DataExplore:•Think about products, not insights per se•90% people / 10% tools•Support ad hoc querying•Never assume•This is where the fancy charts come in•Never assume

Page 13: #dido12 presentation
Page 14: #dido12 presentation
Page 15: #dido12 presentation
Page 16: #dido12 presentation
Page 17: #dido12 presentation
Page 18: #dido12 presentation
Page 19: #dido12 presentation

Doing Big DataModel:•What do I want to predict•Keep it simple (let the data work)•Think about scale

Page 20: #dido12 presentation

Doing Big Data

Page 21: #dido12 presentation

Doing Big DataBuild:•Build functionality•Think about scale, once more•A dashboard is not a data driven solution

Page 22: #dido12 presentation

Doing Big Data