Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014
-
Upload
yhat -
Category
Data & Analytics
-
view
727 -
download
0
description
Transcript of Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014
![Page 1: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/1.jpg)
Applied Data Science with Yhat
Greg Lamp
Data Science MD MeetupOctober 2014
![Page 2: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/2.jpg)
1) Intro2) The Problem3) Solutions4) Case Study: Beer Recommender5) Demo6) Q/A
![Page 3: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/3.jpg)
Here I am on the Internet.
Founder/CTO @ Yhat
Hi, I’m Greg!
![Page 4: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/4.jpg)
![Page 5: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/5.jpg)
Founders
Company
Investors
Greg Lamp, CTO
Austin Ogilvie, CEO
● Launched in 2013
● HQ in Brooklyn
![Page 6: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/6.jpg)
![Page 7: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/7.jpg)
![Page 8: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/8.jpg)
Data sciencein the real world.
regression
![Page 9: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/9.jpg)
Get Raw Data
Strategic Insights
Real World Scoring
Data Driven Products
Business Impact
Clean Data
Stages of the Analytics Project Life Cycle
Expert data teams
Management
Customers & Front Line Employees
![Page 10: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/10.jpg)
![Page 11: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/11.jpg)
What makes building analytical apps hard?
![Page 12: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/12.jpg)
Hi, I’m Trey.
Meet Trey, the Data Scientist
![Page 13: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/13.jpg)
We need to reduce churn.
Okay. I'll look into it.
![Page 14: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/14.jpg)
I figured out that....some complex stuff about vector space that'll improve...
....and that's how we'll reduce churn.
Sounds good. Let's do that...
![Page 15: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/15.jpg)
Any of you know what Gradient Boosting is?
So when can we go live with the new model?
![Page 16: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/16.jpg)
Now what?
![Page 17: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/17.jpg)
1)Translate Code
![Page 18: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/18.jpg)
![Page 19: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/19.jpg)
2 Rebel Policeme
n 2
![Page 20: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/20.jpg)
2)PMML
![Page 21: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/21.jpg)
![Page 22: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/22.jpg)
?
![Page 23: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/23.jpg)
3)Batch Jobs
![Page 24: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/24.jpg)
![Page 25: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/25.jpg)
![Page 26: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/26.jpg)
use your tools
![Page 27: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/27.jpg)
use your tools move quickly
![Page 28: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/28.jpg)
use your tools move quickly
any workflow
![Page 29: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/29.jpg)
use your tools move quickly
any workflow no translating
![Page 30: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/30.jpg)
Case Study
![Page 31: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/31.jpg)
+ =?
![Page 32: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/32.jpg)
A Beer Recommender in Python
![Page 34: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/34.jpg)
The Data
![Page 35: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/35.jpg)
http://snap.stanford.edu/data/web-BeerAdvocate.html
![Page 36: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/36.jpg)
![Page 37: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/37.jpg)
![Page 38: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/38.jpg)
![Page 39: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/39.jpg)
![Page 40: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/40.jpg)
![Page 41: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/41.jpg)
![Page 42: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/42.jpg)
![Page 43: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/43.jpg)
![Page 44: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/44.jpg)
![Page 45: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/45.jpg)
![Page 46: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/46.jpg)
Beers
![Page 47: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/47.jpg)
Users
![Page 48: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/48.jpg)
Ratings
![Page 49: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/49.jpg)
Distance
![Page 50: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/50.jpg)
vs
![Page 51: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/51.jpg)
![Page 52: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/52.jpg)
![Page 53: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/53.jpg)
![Page 54: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/54.jpg)
![Page 55: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/55.jpg)
![Page 56: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/56.jpg)
vs
![Page 57: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/57.jpg)
![Page 58: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/58.jpg)
calculating distance
![Page 59: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/59.jpg)
eeny
? ?
![Page 60: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/60.jpg)
eeny meeny
?
![Page 61: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/61.jpg)
?Cosine
eeny meeny miny
![Page 62: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/62.jpg)
?Cosine
moe
![Page 63: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/63.jpg)
pick one.you can always
change
![Page 64: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/64.jpg)
![Page 65: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/65.jpg)
Thank you,
![Page 66: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/66.jpg)
![Page 67: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/67.jpg)
![Page 68: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/68.jpg)
![Page 69: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/69.jpg)
![Page 70: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/70.jpg)
![Page 71: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/71.jpg)
![Page 72: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/72.jpg)
![Page 73: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/73.jpg)
Scoring
![Page 74: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/74.jpg)
![Page 75: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/75.jpg)
Aggregate
![Page 76: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/76.jpg)
Sort
![Page 77: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/77.jpg)
Filter
![Page 78: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/78.jpg)
Return
![Page 79: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/79.jpg)
![Page 80: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/80.jpg)
Deployment
![Page 81: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/81.jpg)
What does this mean?
![Page 82: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/82.jpg)
![Page 83: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/83.jpg)
Import Yhat
![Page 84: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/84.jpg)
Create a YhatModel
![Page 85: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/85.jpg)
Define execute
![Page 86: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/86.jpg)
Grab incoming data
![Page 87: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/87.jpg)
Call your function
![Page 88: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/88.jpg)
Format and return results
![Page 89: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/89.jpg)
![Page 90: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/90.jpg)
![Page 91: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/91.jpg)
![Page 92: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/92.jpg)
Demohttp://cloud.yhathq.com/http://beers.yhathq.com/
![Page 93: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/93.jpg)
![Page 96: Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014](https://reader035.fdocuments.in/reader035/viewer/2022062706/557d584ad8b42aba3d8b4923/html5/thumbnails/96.jpg)
Questions?