DI&A Slides: Data Insights and Analytics Frameworks

18
www.firstsanfranciscopartners.com Produced by: MONTHLY SERIES Brought to you in partnership with: January 5, 2017 Data Insights and Analytics Frameworks

Transcript of DI&A Slides: Data Insights and Analytics Frameworks

Page 1: DI&A Slides: Data Insights and Analytics Frameworks

The First Step in Information Management

www.firstsanfranciscopartners.com

Producedby:

MONTHLY SERIES

Broughttoyouinpartnershipwith:

January 5, 2017Data Insights and Analytics Frameworks

Page 2: DI&A Slides: Data Insights and Analytics Frameworks

Welcometothenewseries

§ Thepurposeofournewseriesisto:

− GrowunderstandingonDataInsightsandAnalytics

− CovertheinsandoutsoftheBigData,Analytics,BusinessIntelligenceand reportinguniverse

− Focusonpractical,realistic,value

− Wanttobypassfluff,hypeandfalsepromises

− Needyourfeedback− UseQ&A

pg 2© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

DataLakevs.Data

Warehouse

Descriptive,Prescriptive

andPredictiveAnalytics

GoverningQualityAnalytics

TheRoleofaDataScientist(InterviewwithaCDS)

Analytics,BIandDataScience:

What’stheProgression?

Page 3: DI&A Slides: Data Insights and Analytics Frameworks

Topicsfortoday’swebinar

FrameworksdefinedEnterpriseanalyticsarchitectureOverviewofstandarddatainsightsandanalyticscomponentsBigDataSandboxReal-timeAnalyticsFromLegacyarchitecturestodatainsightKeytakeawaysQ&A

pg 3

Frameworksdefined

Enterpriseanalyticsarchitecture

• BigData• Sandbox• Real-timeAnalytics• LegacyArchitectures

Overviewofstandarddatainsightsandanalyticscomponents

Keytakeaways

Q&A

© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 4: DI&A Slides: Data Insights and Analytics Frameworks

Frameworksdefined

▪ Thestructurefordeliveringandgettingvalueoutofyourdataandanarchitecturefordecision-making,includingorganizationalmodels

▪ Yourenterpriseanalyticsarchitectureneedstoreflectholisticthinking▪ Yourstartingpointandbusinessneedsdeterminehowyouprogress,notapre-definedcurve

▪ Organizationsthatcanbarelydeliveranaccurateproductionreportaredoingpredictiveanalytics– Right?Wrong?

pg 4

Predictive

ManagingProactive

Operating

DataInsightandAnalyticsMaturity© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 5: DI&A Slides: Data Insights and Analytics Frameworks

TheDataInsightandAnalyticsFramework

DataInsightandAnalyticsFramework

DataInsightandAnalyticsStrategy

TechnologyInfrastructure

DataInsightandAnalyticsOperatingModel

Compo

nents “BestFit”Data

Architecture DataQuality DemandManagement

Presentation DataWrangling MetadataManagement

GOVERNANCE

ORGANIZATIONAL ALIGNMENT

pg 5

Page 6: DI&A Slides: Data Insights and Analytics Frameworks

Sampleenterpriseanalyticsarchitecture

ODS*

DM Big Data*

Sandbox

Securityandgovernance

Presentation(visualization,reports,algorithms,queries)

Dataingestion

Operationsscheduling,

managem

ent,DataQuality,ControlsSources

* = Includes Real Time ETLandMovement

DW*

Metadata

Data Lake

pg 6© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 7: DI&A Slides: Data Insights and Analytics Frameworks

ODS*

DM

Big Data*

Sandbox

Securityandgovernance

Presentation(visualization,reports,algorithms,queries)

Dataingestion

Operationsscheduling,m

anagement,DataQ

uality,Controls

Sources

ETLandMovement

DW*

Metadata

Data Lake

Enterpriseanalyticsarchitecture– BigData

§ MorethantheBigData“stack”

§ Nolongerlinear– ProductiontoAccess

§ Arrangedbylatency,access,intendedvalue,datavelocity,datavolumeanddatamovementcapacity1. Standard“BigData”2. Sandbox3. Realtime4. Heritage

pg 7

12*3

4

© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 8: DI&A Slides: Data Insights and Analytics Frameworks

BigData

§ Components− DataSources− Ingestion− Structuring− AnalyticsandVisualization− Metadata

§ HighPriorityConcerns− Metadata– Technical,Business,Lineage,Meaning,Interpretation− Securityandprivacy- Accessandusagemustbemanaged

accordingtorisk,permissions,policy,contractualagreements− DataGovernance

§ Oversightofsemantics,lineage,quality

− Latency,access,usage§ Persistent§ Typeofdatastructure– Hive,Hbase

pg 8

DataSources IngestionandTransformation

StructuringData

AnalyticsandVisualization

MonitorTechnicalMetadataBusinessMetadata

© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 9: DI&A Slides: Data Insights and Analytics Frameworks

Usecase– BigData

§ Telecom− Analyze500discretedataelementsincludingsupportcallpatterns,lateor

delinquentpaymentsandotherongoingvitalsignsviapredictiveanalytics− Identify“churn”prospectsandtakestepstopreventit

§ Results− 47%reductionincustomerchurn,protecting$15millioninrevenue− Predictiveanalyticshasspreadorganicallytootherpartsofthecompany,

includingcollections

pg 9© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 10: DI&A Slides: Data Insights and Analytics Frameworks

Sandbox

§ Components– SimilartoBigData− Standalonesandboxisalsoarelevantcomponent− Ingestion– Batch− AnalyticsandVisualization- DataScientist/Analystonly

§ HighPriorityConcerns− Data

§ Discovery§ Understanding§ Standardization§ Usefulness

− SecurityandPrivacy§ Rawaspectimpliesnocontrols§ Controlthedata,nottheenvironment

− DataGovernancefocus,Usage,access− Metadata,provenanceandpedigree

§ Latency,access,usage− Sandboxequalsnon-persistent,non-production− Self-service− Housekeeping

pg 10

Stand Alone Sandbox

DataSources IngestionandTransformationAnalyticsandVisualization

MonitorTechnicalMetadataBusinessMetadata

© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 11: DI&A Slides: Data Insights and Analytics Frameworks

Usecase– Sandbox

§ PredictiveMaintenance,Manufacturing− Collectedmachinesensordata− Createdasandboxenvironmenttocentralizethepartsfailureanalysis− Combinedsensordatawithoperationaldata− Afterfindinginsights,operationalizetheprocessesinthelineofbusiness

§ Results− Shortertime-to-insight;Ittookonlythreeweeks(from6months)todevelopapartsfailure

predictionalgorithm

pg 11© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 12: DI&A Slides: Data Insights and Analytics Frameworks

Real-timeanalytics

§ Components- similartoregularBigData− Streamingprocessinganaddedcomponent− Realtimecapabilityonnon-BigDataaswell

§ HighPriorityConcerns− Speedofingestion

§ Real-timeDataIngestion− DataStreaming− DataMessaging− In-memorydatabaseforextremelowlatencyrequirements

− Securityandprivacy− DataGovernanceFocus

§ Metadata§ Compliance

− Latency,access,usage− Real-timeDataQuery

§ EnterpriseQueryandReporting§ FastQueryDatabasetostoreanalyticalresults§ Agents,messaging,newevents§ Flexiblepersistencyandaccessibility§ Verylowlatency,highperformance

pg 12

DataSources

IngestionandTransformation

StructuringData

AnalyticsandVisualization

MonitorTechnicalMetadataBusinessMetadata

StreamingData

Processing

Real time DW, or ODS

© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 13: DI&A Slides: Data Insights and Analytics Frameworks

Usecase– Real-timeanalytics

§ Health− Automaticallysiftsthroughmillionsofpostsondozensofsocialmediasites,localnews

reports,medicalworkers’socialnetworksandgovernmentwebsitestotrackinstancesofdisease

− Continuallyplotsdiseasehotspotsonamap

§ Results− Identifiedaclusterof“mysteryhemorrhagicfever”inGuineaoveraweekbeforethe

MinistryofHealthofGuineanotifiedtheWorldHealthOrganization(WHO),thatadaylaterconfirmedtheEbola outbreak

pg 13© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 14: DI&A Slides: Data Insights and Analytics Frameworks

Relevanceoflegacyortraditionaldatainsight

§ Legacystructuresstillhaverelevance− Reporting− StandardBI

§ Components− Familiarnames– ETL,DW,ODS,DM− ManyaspectsofBigDatatechnologyarenotrelevantto

manydatauses

§ TraditionalConcerns− ETLvs.webservicespipelinesviadatalayer− Understandingneedfortraditionaluses:

§ Departmentaluse§ Historicalreporting§ Operationalandad-hocreporting

− Supportofmulti-latency,historical,operational,etc.,requirements

pg 14

ODS*

DM

Presentation(visualization,reports,algorithms,queries)

Sources

ETLandMovement

DW*

© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 15: DI&A Slides: Data Insights and Analytics Frameworks

Usecase– Legacyortraditionaldatainsight

§ Healthcare− Adatawarehousetofacilitateinformationandbestpracticessharingbetweenthousandsof

providersandresearchprofessionals.− Alsodeployedpredictiveanalyticsandartificialintelligencetoderivebetterinsightsfrom

ElectronicHealthRecordsandimprovepatientoutcome.

§ Results− 400,000patientrecordscentralizedinasingledatawarehousewhichcanscaleupto20

millionrecords.− 42%anticipatedimprovementinpatientoutcomeswithArtificialIntelligence− 58%anticipatedreductionincostperunitofoutcomechange

pg 15© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 16: DI&A Slides: Data Insights and Analytics Frameworks

Keytakeaways

§ Referencearchitecturesarejustthat−Youmaynotneedalake,datawarehouseorsandbox

§ Avoidcobblingtogethertechnicalcomponents

§ Plantomatchyourarchitecturetoneedsandusage,vs.existingcomponents

§ WebServicesareanimportanttool− Ifusingservices,pleaseconsideradistinctdatalayer

pg 16© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 17: DI&A Slides: Data Insights and Analytics Frameworks

Q&A

pg 17© 2017 First San Francisco Partners www.firstsanfranciscopartners.com

Page 18: DI&A Slides: Data Insights and Analytics Frameworks

pg 18

ThankyouandHappyNewYear!SeeyouThursday,February2forDataLakevs.DataWarehouse

JohnLadley@[email protected]

KelleO’Neal@[email protected]

© 2017 First San Francisco Partners www.firstsanfranciscopartners.com