Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric...
-
Upload
carmella-gallagher -
Category
Documents
-
view
212 -
download
0
Transcript of Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric...
![Page 1: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/1.jpg)
Predicting Consumer Choice Using Supermarket Scanner Data:
Combining Parametric and Non-parametric Methods
Elena [email protected]
April 20 2001
CALD Lab
![Page 2: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/2.jpg)
Problem and Motivation
Profit optimization for storesKnow thy customerMicro-marketingPredicting Consumer ChoiceUse scanner dataPrevious research: profit increase Gross profit margin 4% to 10% Operating profit margin 33% to 83%
a classic business rule that is taught in every 100-level course
An under-utilized gold mine
![Page 3: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/3.jpg)
Retail Store Scanner Data
Chilled Juice Category14 products2 years100 storesStore-level aggregationWeekly reports
![Page 4: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/4.jpg)
Goal
Build an accurate predictor of consumer choice (that knows the customer). In-prices, out-quantities
Cate
gory
: C
hill
ed
Ora
ng
e Ju
ice
Price of Product 1
Price of Product 2
Price of Product 3
Price of Product 14
. . .
“I know your
customers”
PredictorPredictor
Quantity bought of Product 1
. . .
Quantity bought of Product 2
Quantity bought of Product 3
Quantity bought of Product 14
![Page 5: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/5.jpg)
Previous Work on Retail Data
Traditionally – using parametric models (linear regression)Recently – using non-parametric models (neural networks)
NN outperforms LR in accuracy, although LR performs adequatelyNN are mistrusted
![Page 6: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/6.jpg)
Our Niche
Advantage of LR: known functional form (linear in log space), extrapolation ability
Advantage of NN: flexibility, accuracy
extrapolation ability
acc
ura
cy
NN
new
LRTake Advantage: use the
assumed prior to bias the accurate learner
Higher level model from simpler models
![Page 7: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/7.jpg)
Related Research In Other Fields
Patrice Simard et al. “Tangent Prop” Possible to directly learn the invariance of the data independently
from the “real” learning task
Michael Perrone “Improving Regression Estimation: Averaging Methods for Variance Reduction with Extensions to General Convex Measure Optimization” generalized ensemble method estimator
N
iiiGEM xff
1
)(
![Page 8: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/8.jpg)
Combination Approaches
Train Separately, then Combine
Outputs as Inputs
Jumping Connections
![Page 9: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/9.jpg)
Train separately, then combine
train a NN and a LR separately, and calculate the weighted average for the final prediction
NN
Input Prices
Output Quantities
. . .
. . .
LR
Input Prices
Output Quantities
. . .
. . .
Final Prediction
![Page 10: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/10.jpg)
Outputs as Inputs
adding to a learner the prediction of the other learner (over the same 14 input prices) as an extra input
NN
Output Quantities
Input Prices
. . .
. . .
LR
. . .
Input Prices
. . .
Output Quantities
NN
Output Quantities
Input Prices
. . .
. . .
LR
. . .
Input Prices
. . .
Output Quantities
![Page 11: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/11.jpg)
Jumping Connections
Combining two types of NN connections in one NNGives the effect of simulating a LR and NN all together
. . .
. . .
. . .
![Page 12: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/12.jpg)
Results – RMS Errors
Linear RegressionNeural NetWeighted AverageJump ConnectionsOutputs as Inputs
0.1920.0750.0740.0720.070
![Page 13: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/13.jpg)
Results - % Error in Predicted Q
Comparison with IO method: percent error in predicted output
0
1
2
3
4
5
6
IO JC WA NN LR
![Page 14: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/14.jpg)
Summary
Combining NN and LR gives a more accurate and robust modelBetter in terms of understandability for the marketing communityLearns a better consumer modelImproves pricing strategies
![Page 15: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.](https://reader036.fdocuments.in/reader036/viewer/2022072015/56649eb65503460f94bbf392/html5/thumbnails/15.jpg)
Future Work
Include: demographics data promotional data competitor data
Apply Multitask LearningAnother data set with more density
End