Briefing room 20160510-ep013-prepare_and_share-the_advantages_of_self-service_data_prep-dez-slides

5
DATA PREPARATION SELF SERVICE DATA PREPARATION IN THE WORLD OF BIG DATA Garbage in.. Garbage out.. A term that’s more relevant now than ever!

Transcript of Briefing room 20160510-ep013-prepare_and_share-the_advantages_of_self-service_data_prep-dez-slides

Page 1: Briefing room 20160510-ep013-prepare_and_share-the_advantages_of_self-service_data_prep-dez-slides

DATA PREPARATION

SELF SERVICE DATA PREPARATION IN THE WORLD OF BIG DATAGarbage in.. Garbage out.. A term that’s more relevant now than ever!

Page 2: Briefing room 20160510-ep013-prepare_and_share-the_advantages_of_self-service_data_prep-dez-slides

DIY DATA PREP IN BIG DATA ERA An old issue made more challenging due to scale Spreadsheet data wrangling is a fools errand More Unstructured - Less Structured Manual wrangling VS Scripted Transformation Connectors are joining the dots to all data sources Just in time manufacturing in the world of Data Living with the V’s ( Volume, Velocity & Variety ) Start small, don’t try to boil the ocean right away COST OF Data Prep VS Big Iron processing at back end

Page 3: Briefing room 20160510-ep013-prepare_and_share-the_advantages_of_self-service_data_prep-dez-slides

TYPICAL BIG DATA ARCHITECTURE

Page 4: Briefing room 20160510-ep013-prepare_and_share-the_advantages_of_self-service_data_prep-dez-slides

WE CAN’T SPREADSHEET THE IOT

Page 5: Briefing room 20160510-ep013-prepare_and_share-the_advantages_of_self-service_data_prep-dez-slides

DATA MANAGEMENT FRAMEWORK