Making Big Data Work

23
Making Big Data work Lewis Crawford Principal Architect @ the DataShed thedatashed.co.uk [email protected] © the DataShed Limited 2015

Transcript of Making Big Data Work

Page 1: Making Big Data Work

Making Big Data workLewis CrawfordPrincipal Architect @ the DataShed

thedatashed.co.uk

[email protected]

©theDataShedLimited 2015

Page 2: Making Big Data Work

intro

Page 3: Making Big Data Work

Who am I?

• Forthelast3years,theDataShed hasbeenprovidingconsultancyservicestoavastarrayoflargeclients.Ourprimaryfocusisensuringthattechnologyandanalyticalstrategiesaretrulyalignedsothatbusinessescanleveragethelatestandgreatestintechnologytomodel,mineanddescribetheirdataasset.

• WewereworkingwithBigDatatechnologybeforethetermwascoined,wehaveexperiencedeliveringanalyticalsystemsdrivenbyPetabytedatasets,andhavedesigned,implementedandsupportedoneofthelargestreal-timedataintegrationandpredictiveanalyticsplatformsintheaviationworld.

• Ourmodelisbasedonusingasmallnumberofexceptionallyhighlyskilledindividualstodeliverdisruptiveandinnovativesolutionsinanagileanddelivery-focusedmanner.

©theDataShedLimited 2015

Page 4: Making Big Data Work

So what is ‘Big Data’?

©theDataShedLimited 2015

Page 5: Making Big Data Work
Page 6: Making Big Data Work

Why do Big Data projects fail?

ToomanypeoplethinkthatBigDatais:

“Thebeliefthatthemoredatayouhave,themoreinsightsandanswerswillriseautomaticallyfromthepoolofonesandzeros.”

GillPress,Forbes.com

©theDataShedLimited 2015

Page 7: Making Big Data Work

How to make Big Data work?

1. Understandyourproblem

2. Applyappropriatetools

3. Automateeverything.

©theDataShedLimited 2015

Page 8: Making Big Data Work

Real-time data

©theDataShedLimited 2015

Page 9: Making Big Data Work

©theDataShedLimited 2015

Page 10: Making Big Data Work
Page 11: Making Big Data Work

©theDataShedLimited 2015

Page 12: Making Big Data Work

Continuous Integration Demo

©theDataShedLimited 2015

Page 13: Making Big Data Work

How to make Big Data work?

1. Understandyourproblem

2. Applyappropriatetools

3. Automateeverything.

©theDataShedLimited 2015

Page 14: Making Big Data Work

Little Big Data

©theDataShedLimited 2015

Page 15: Making Big Data Work

A problem closer to home…

• Everybusinessneedstounderstand:• Theirpotentialcustomersandmarket• Currentcustomers• Theirproductsandsales• Howandwhentheyengageprospectsandcustomers

• Analyticsanddataareexpensive• Manyofthemandatoryelementsareverysimilarforeveryone• TheDataShedisAnalyticsasaServiceandSingleCustomerViewasaService.

©theDataShedLimited 2015

Page 16: Making Big Data Work

The deduplication problem…

• SMEhas250,000customers(twosystemsofrecord)• Toidentifyduplicatesbruteforceapproach: 31,249,875,000comparisons• Buildingasystemtoprocessaminimumof100clientsaday…• 3.1trillionrecordstocompareusing>10differentalgorithms

• Traditionalscaleupapproachwouldbeexpensive,andmakeslargeassumptionsaroundblockingandpartitioningrules• Asmalldataproblembutabigdatasolution?

Title FirstName Surname Address 1 Address2 Address3

Dr RJ Smith TwoOaks 112OldSt. CountyDurham

Mrs Robyn Smith 112OldStreet Durham DH15YJ

©theDataShedLimited 2015

Page 17: Making Big Data Work

©theDataShedLimited 2015

Page 18: Making Big Data Work

The Shed demo

©theDataShedLimited 2015

Page 19: Making Big Data Work

How to make Big Data work?

1. Understandyourproblem

2. Applyappropriatetools

3. Automateeverything.

©theDataShedLimited 2015

Page 20: Making Big Data Work

How to make Big Data work?1. Understandyourproblem

• ’BigData’challengesaren’tnecessarilynew,howevermuchofthetechnology is• Articulateandcommunicate– focusondistillingyourproblemdown• Incremental improvementnotwholesalereplacement

2. Applyappropriate tools• Understandtheeconomics aswellasthetechnology• Newtechnologiesneedtobeevaluatedwithinthecontextofyourproblemscope• Newtechnologiesareenablers notdeliverables(#datalake)• ’BigData’technologyshouldbeseenascomplementarytoexistingtechnology

3. Automateeverything• Continuousintegrationtoincludeall testing• Containerisewherepossible• Measureeverything

©theDataShedLimited 2015

Page 21: Making Big Data Work

If you really want to get involved…

©theDataShedLimited 2015

Page 22: Making Big Data Work

Get your hands dirty

Ifyou’reinterestedinlearningmore,we’llbehostingahands-onlabseventinthenearfuture.

Sendyourdetailsto:Email:[email protected]:@thedatashed

©theDataShedLimited 2015

Page 23: Making Big Data Work

Any questions?

©theDataShedLimited 2015

Lewis CrawfordPrincipal Architect @ the DataShed

thedatashed.co.uk

[email protected]