Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric...

15
Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non- parametric Methods Elena Eneva [email protected] April 20 2001 CALD Lab

Transcript of Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric...

Page 1: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Predicting Consumer Choice Using Supermarket Scanner Data:

Combining Parametric and Non-parametric Methods

Elena [email protected]

  April 20 2001

CALD Lab

Page 2: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Problem and Motivation

Profit optimization for storesKnow thy customerMicro-marketingPredicting Consumer ChoiceUse scanner dataPrevious research: profit increase Gross profit margin 4% to 10% Operating profit margin 33% to 83%

a classic business rule that is taught in every 100-level course

An under-utilized gold mine

Page 3: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Retail Store Scanner Data

Chilled Juice Category14 products2 years100 storesStore-level aggregationWeekly reports

Page 4: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Goal

Build an accurate predictor of consumer choice (that knows the customer). In-prices, out-quantities

Cate

gory

: C

hill

ed

Ora

ng

e Ju

ice

Price of Product 1

Price of Product 2

Price of Product 3

Price of Product 14

. . .

“I know your

customers”

PredictorPredictor

Quantity bought of Product 1

. . .

Quantity bought of Product 2

Quantity bought of Product 3

Quantity bought of Product 14

Page 5: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Previous Work on Retail Data

Traditionally – using parametric models (linear regression)Recently – using non-parametric models (neural networks)

NN outperforms LR in accuracy, although LR performs adequatelyNN are mistrusted

Page 6: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Our Niche

Advantage of LR: known functional form (linear in log space), extrapolation ability

Advantage of NN: flexibility, accuracy

extrapolation ability

acc

ura

cy

NN

new

LRTake Advantage: use the

assumed prior to bias the accurate learner

Higher level model from simpler models

Page 7: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Related Research In Other Fields

Patrice Simard et al. “Tangent Prop” Possible to directly learn the invariance of the data independently

from the “real” learning task

Michael Perrone “Improving Regression Estimation: Averaging Methods for Variance Reduction with Extensions to General Convex Measure Optimization” generalized ensemble method estimator

N

iiiGEM xff

1

)(

Page 8: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Combination Approaches

Train Separately, then Combine

Outputs as Inputs

Jumping Connections

Page 9: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Train separately, then combine

train a NN and a LR separately, and calculate the weighted average for the final prediction

NN

Input Prices

Output Quantities

. . .

. . .

LR

Input Prices

Output Quantities

. . .

. . .

Final Prediction

Page 10: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Outputs as Inputs

adding to a learner the prediction of the other learner (over the same 14 input prices) as an extra input

NN

Output Quantities

Input Prices

. . .

. . .

LR

. . .

Input Prices

. . .

Output Quantities

NN

Output Quantities

Input Prices

. . .

. . .

LR

. . .

Input Prices

. . .

Output Quantities

Page 11: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Jumping Connections

Combining two types of NN connections in one NNGives the effect of simulating a LR and NN all together

. . .

. . .

. . .

Page 12: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Results – RMS Errors

Linear RegressionNeural NetWeighted AverageJump ConnectionsOutputs as Inputs

0.1920.0750.0740.0720.070

Page 13: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Results - % Error in Predicted Q

Comparison with IO method: percent error in predicted output

0

1

2

3

4

5

6

IO JC WA NN LR

Page 14: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Summary

Combining NN and LR gives a more accurate and robust modelBetter in terms of understandability for the marketing communityLearns a better consumer modelImproves pricing strategies

Page 15: Predicting Consumer Choice Using Supermarket Scanner Data: Combining Parametric and Non-parametric Methods Elena Eneva eneva@cs.cmu.edu April 20 2001 CALD.

Future Work

Include: demographics data promotional data competitor data

Apply Multitask LearningAnother data set with more density

End