#dido12 presentation
-
Upload
fvanvollenhoven -
Category
Technology
-
view
288 -
download
1
description
Transcript of #dido12 presentation
Big Data?
Big Data?
Big Data?
Big Data?
Hadoop?
Hadoop?
Hadoop?
Hadoop?
Doing Big DataCollecting / obtaining data:•Forget about retention policy•Storage is cheap (it really is!)•Keep all of it online (or at least almost all)•Make it scalable (no manual processes)
Doing Big DataClean:•Data is always a mess•Don’t treat it as an expception•Make it your problem
Doing Big DataExplore:•Think about products, not insights per se•90% people / 10% tools•Support ad hoc querying•Never assume•This is where the fancy charts come in•Never assume
Doing Big DataModel:•What do I want to predict•Keep it simple (let the data work)•Think about scale
Doing Big Data
Doing Big DataBuild:•Build functionality•Think about scale, once more•A dashboard is not a data driven solution
Doing Big Data