Data Mashups -Data Science Summit
-
Upload
peter-skomoroch -
Category
Technology
-
view
3.063 -
download
0
description
Transcript of Data Mashups -Data Science Summit
![Page 1: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/1.jpg)
Data Mashups
Turning Data Exhaust into Insights
May 12, 2011Data Scientist SummitPete SkomorochLinkedIn@peteskomoroch
![Page 2: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/2.jpg)
We have an explosion of data
•DataWrangling
• InfoChimps
•Data.gov
• Factual
• SimpleGeo
![Page 3: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/3.jpg)
And the tools to make sense of it
•Hadoop
•NoSQL
•R
•Python
•Mechanical Turk
![Page 4: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/4.jpg)
Diverse datasets = better signal
![Page 5: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/5.jpg)
![Page 6: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/6.jpg)
![Page 7: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/7.jpg)
Find a meaningful problem
http://www.flickr.com/photos/aloshbennett/
• Identify pain points
•Work on stuff that matters
• Focus on underutilized data
![Page 8: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/8.jpg)
Trendingtopics.org @hourlytrends
![Page 9: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/9.jpg)
LinkedIn Skills
![Page 10: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/10.jpg)
The best mashups are actionable
•Reveal patterns
•Enable predictions
•Recommendations
![Page 11: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/11.jpg)
Mashup: Skills & Cities
![Page 12: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/12.jpg)
Yuba City, California: 21.3% Unemployment
![Page 13: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/13.jpg)
Ames, Iowa: 4.7% Unemployment
![Page 14: Data Mashups -Data Science Summit](https://reader035.fdocuments.in/reader035/viewer/2022081403/55501f0bb4c90535638b53c8/html5/thumbnails/14.jpg)
Make data mashups work for you
•Open Data = powerful mashups
•Mashup > sum of its parts
• Focus on meaningful problems
•Actionable mashups are better