Mass tlc big data panel sep 20

24
What Does All This Data Mean? September 20, 2012 IBM Innovation Center Waltham MA MassTLC Big Data Seminar @masstlc #b igdata

description

 

Transcript of Mass tlc big data panel sep 20

Page 1: Mass tlc big data panel sep 20

What Does All This

Data Mean?

September 20, 2012IBM Innovation Center

Waltham MA

MassTLC Big Data Seminar

@m

asstlc #bigdata

Page 2: Mass tlc big data panel sep 20

What Does All This Data Mean?

Agenda •Setting the Context•Introducing the Panel•Panel Discussion•Q&A

– Hashtags: @masstlc #bigdata

Page 3: Mass tlc big data panel sep 20

Your Panel

• Richard Dale, Managing Director, Big Data Boston

Ventures – Twitter: @rdale

• Irene Greif, Fellow, IBM Visualization– Twitter: @igreif

• Martin Leach, CIO, Broad Institute– Twitter: @mdleach

• Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/

Page 4: Mass tlc big data panel sep 20

Richard Dale

Managing Director, Big Data Boston VenturesMicro-VC fund investing in big data companies located in or connected to the regional big data cluster

Database techie turned Entrepreneur turned VC– Database Performance Guru, SQL Solutions– Co-founder, Phase Forward– Principal, Sigma Partners– Founder & Managing Director, Big Data Boston Ventures

Page 5: Mass tlc big data panel sep 20

Setting the Context

• What is Big Data?

• Where does Big Data come from?

• What is Big Data going?

Page 6: Mass tlc big data panel sep 20

What is Big Data?

a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools

(wikipedia)

Page 7: Mass tlc big data panel sep 20

What is Big Data?

3 V’s: •volume •velocity•variety

(Doug Laney, Gartner)

Page 8: Mass tlc big data panel sep 20

What is Big Data?

Data easier and cheaper to collect than to analyze

(??)

Page 9: Mass tlc big data panel sep 20

What is Big Data?

Data that you can’t process on a single machine, however big your machine (and however long you wait)

or

Data growing faster than Moore’s law

(Richard Dale)

Page 10: Mass tlc big data panel sep 20

Where Does Big Data Come From?

Behavior•Social Media•User Generated Content•Click streams•Viewing, Purchasing, Liking, Sharing•The Quantified Self

Page 11: Mass tlc big data panel sep 20

Where Does Big Data Come From?

Observation (in ever finer granularity)•Machines

– Computers, Vehicles, Phones, Industrial Machines•Environments

– RFID, Traffic flow, Nature (and our impact)•People

– The Quantified Self– Medical imaging– Genetic sequencing

Page 12: Mass tlc big data panel sep 20

Where Does Big Data Come From?

Correlations•Each data item, image or observation can be cross-correlated with any other

•Even if N is tractable, N x N x N x … is not

Page 13: Mass tlc big data panel sep 20

Technology Landscape

Infrastructure: Storing, Managing, MovingInfrastructure: Storing, Managing, Moving

Analytics: Algorithms, Visualization, Machine Learning

Analytics: Algorithms, Visualization, Machine Learning

Applications: Horizontal and Verticalbusiness or domain applications

Applications: Horizontal and Verticalbusiness or domain applications

Data Services:

Collecting,Collating,

Correlating,Curating

Data Services:

Collecting,Collating,

Correlating,Curating

Source:

Page 14: Mass tlc big data panel sep 20

Technology Landscape

Infrastructure: Storing, Managing, MovingInfrastructure: Storing, Managing, Moving

Analytics: Algorithms, Visualization, Machine Learning

Analytics: Algorithms, Visualization, Machine Learning

Applications: Horizontal and Verticalbusiness or domain applications

Applications: Horizontal and Verticalbusiness or domain applications

Data Services:

Collecting,Collating,

Correlating,Curating

Data Services:

Collecting,Collating,

Correlating,Curating

Source:

Page 15: Mass tlc big data panel sep 20

Technology Landscape

Infrastructure: Storing, Managing, MovingInfrastructure: Storing, Managing, Moving

Analytics: Algorithms, Visualization, Machine Learning

Analytics: Algorithms, Visualization, Machine Learning

Applications: Horizontal and Verticalbusiness or domain applications

Applications: Horizontal and Verticalbusiness or domain applications

Data Services:

Collecting,Collating,

Correlating,Curating

Data Services:

Collecting,Collating,

Correlating,Curating

Source:

Page 16: Mass tlc big data panel sep 20

A Sea of Choices for Data Viz

• BI packages• Dashboard reporting tools • Ad hoc infographics• Whiteboards• Napkin scribbles

Page 17: Mass tlc big data panel sep 20

Turning Big Data into Big ClarityArt or Science? Let’s ask the Panel!

•Irene Greif, IBM Fellow– Twitter: @igreif

•Martin Leach, CIO, Broad Institute– Twitter: @mdleach

•Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/

Page 18: Mass tlc big data panel sep 20

Turning Big Data into Big ClarityArt or Science? Let’s ask the Panel!

•Irene Greif, IBM Fellow– Twitter: @igreif

•Martin Leach, CIO, Broad Institute– Twitter: @mdleach

•Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/

Page 19: Mass tlc big data panel sep 20

IBM Center for Social BusinessIrene Greif, IBM Fellow, Chief Scientist for Social Business

Many Eyes

Page 20: Mass tlc big data panel sep 20

Turning Big Data into Big ClarityArt or Science? Let’s ask the Panel!

•Irene Greif, IBM Fellow– Twitter: @igreif

•Martin Leach, CIO, Broad Institute– Twitter: @mdleach

•Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/

Page 21: Mass tlc big data panel sep 20

• The Broad Institute is a non-profit biomedical research institute

• Ten core faculty members and approximately 150 associate members from across MIT and Harvard

• Greater than 1900 research and administrative staff

 

Programs and Initiativesfocused on specific disease or biology areas

CancerGenome BiologyGenome Sequencing and AnalysisCell CircuitsPsychiatric DiseaseMetabolismMedical and Population GeneticsChemical Biology/Novel TherapeuticsInfectious DiseaseEpigenomics

Platformsfocused technological innovation and application

Genomics PlatformBiological SamplesGenome SequencingGenetic Analysis

Chemical Biology/Novel TherapeuticsImagingMetabolite ProfilingProteomicsRNAiTherapeutics Discovery & Development

The Broad Institute of MIT & Harvard

Martin Leach, CIO

Page 22: Mass tlc big data panel sep 20

Turning Big Data into Big ClarityArt or Science? Let’s ask the Panel!

•Irene Greif, IBM Fellow– Twitter: @igreif

•Martin Leach, CIO, Broad Institute– Twitter: @mdleach

•Andrew Pandre, Principal, Sears Holding Cos – http://apandre.wordpress.com/

Page 23: Mass tlc big data panel sep 20

Big Data VisualizationAndrew Pandre, Ph.D.,PrincipalSears Holdings Corporation

Google+ microblog: http://tinyurl.com/VisibleData

Data Visualization Bloghttp://apandre.wordpress.com

Page 24: Mass tlc big data panel sep 20

@masstlc #bigdata