Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

28
Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K Model-Logistic-MLE-1Y1X-Excel2013-Slides.pdf 1 Model Logistic MLE 1Y1X Excel 2013 V0K 1 by Milo Schield Member: International Statistical Institute US Rep: International Statistical Literacy Project Director, W. M. Keck Statistical Literacy Project Slides and data at: www.StatLit.org/ pdf/Model-Logistic-MLE-1Y1X-Excel2013-slides.pdf xls/Model-Logistic-MLE-1Y1X-Excel2013-Data.xlsx Model Logistic Regression MLE 1Y1X in Excel 2013 Model Logistic MLE 1Y1X Excel 2013 V0K 2 Background & Goals Modelling a binary outcome (buy/look, payoff/default, go/nogo or male/female) requires logistic regression. Doing logistic regression in Excel requires Solver. “Since its introduction in .. 1991, … Excel Solver has become the most widely distributed – and almost surely the most widely used – general-purpose optimization modeling system.” www.utexas.edu/courses/lasdon/design3.htm This presentation uses college student data: pulse.xls. This demo models gender (male) based on height. Goals: Create graphs on slides 4 and 22. Determine if slope is statistically significant. Model Logistic MLE 1Y1X Excel 2013 V0K Column H: 1=Male, 0 = Female (circled) Ave Heights: M: 70.75” 62% F: 65.3” 38% Difference: 5.35” 3 This demo uses Height (col C) to predict Gender (col H) Model Logistic MLE 1Y1X Excel 2013 V0K 4 To Do: Model Gender by Height Show Trend-line and Equation This trend-line does not satisfy the least-squares assumptions and it goes outside the valid range. but it does pass through the joint mean. Model Logistic MLE 1Y1X Excel 2013 V0K 5 Intuitive idea of solution This shape handles all heights: even if negative. Model Logistic MLE 1Y1X Excel 2013 V0K 1) Prepare data for logistic MLE regression 2) Solve for just the intercept 3) Solve for the intercept and slope 4) Generate various graphs 5) Test for statistical significance 6 Outline of Approach: Five Steps To do: Get data at www.StatLit.org/xls/ Model-Logistic-MLE-1Y1X-Excel2013-Data.xlsx

Transcript of Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Page 1: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model-Logistic-MLE-1Y1X-Excel2013-Slides.pdf 1

Model Logistic MLE 1Y1X Excel 2013V0K 1

byMilo Schield

Member: International Statistical InstituteUS Rep: International Statistical Literacy ProjectDirector, W. M. Keck Statistical Literacy Project

Slides and data at: www.StatLit.org/pdf/Model-Logistic-MLE-1Y1X-Excel2013-slides.pdfxls/Model-Logistic-MLE-1Y1X-Excel2013-Data.xlsx

Model Logistic Regression MLE 1Y1X in Excel 2013

Model Logistic MLE 1Y1X Excel 2013V0K 2

Background & Goals

Modelling a binary outcome (buy/look, payoff/default, go/nogo or male/female) requires logistic regression.

Doing logistic regression in Excel requires Solver. “Since its introduction in .. 1991, … Excel Solver has become the most widely distributed – and almost surely the most widely used – general-purpose optimization modeling system.” www.utexas.edu/courses/lasdon/design3.htm

This presentation uses college student data: pulse.xls. This demo models gender (male) based on height.

Goals: Create graphs on slides 4 and 22.Determine if slope is statistically significant.

Model Logistic MLE 1Y1X Excel 2013V0K

Column H: 1=Male, 0 = Female (circled)

Ave Heights:

M: 70.75” 62%

F: 65.3” 38%

Difference:5.35”

3

This demo uses Height (col C) to predict Gender (col H)

Model Logistic MLE 1Y1X Excel 2013V0K 4

To Do: Model Gender by HeightShow Trend-line and Equation

This trend-line does not satisfy the least-squares assumptions and it goes outside the valid range.

but it does pass through the joint mean.

Model Logistic MLE 1Y1X Excel 2013V0K 5

Intuitive idea of solution

This shape handles all heights: even if negative.

Model Logistic MLE 1Y1X Excel 2013V0K

1) Prepare data for logistic MLE regression

2) Solve for just the intercept

3) Solve for the intercept and slope

4) Generate various graphs

5) Test for statistical significance

6

Outline of Approach:Five Steps

To do: Get data at www.StatLit.org/xls/Model-Logistic-MLE-1Y1X-Excel2013-Data.xlsx

Page 2: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model-Logistic-MLE-1Y1X-Excel2013-Slides.pdf 2

Model Logistic MLE 1Y1X Excel 2013V0K 7

1) To do: Load Data;Hide columns; Enter formula

Hide columns A-B and D-G. Let I2=1; J2=0Enter formula in M2-Q2 and J4

Model Logistic MLE 1Y1X Excel 2013V0K 8

To do: Select M2:Q2, Pull to 93Copy/Paste Value of J4 onto J5

Odds = Prob(Y=1)/[1-Prob(Y=1)] Range: 0 to infinityLogit = LN(Odds). Range: -infinity to +infinity.

Logistic regress: Logit = intercept + Slope*Height

Model Logistic MLE 1Y1X Excel 2013V0K 9

Result: Ln-Likelihood of OK;Original Sum = -63.82

Intercept of 1 give P(Y = 1 = male) = 73%. But 62% of these students are male. Solver should adjust intercept so P(Y=1) = 62%.

Model Logistic MLE 1Y1X Excel 2013V0K 10

2a) Solve for Intercept

From Data Menu, select Solver

Model Logistic MLE 1Y1X Excel 2013V0K 11

2b) Select Solver parameters

Set Objective to J4Set “To” to “Max”Set “change variable”

to I2

Set “Solving Method” to “GRC Nonlinear”

Press OK

Model Logistic MLE 1Y1X Excel 2013V0K 12

2c) Results: All constraints and conditions are satisfied.

Page 3: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model-Logistic-MLE-1Y1X-Excel2013-Slides.pdf 3

Model Logistic MLE 1Y1X Excel 2013V0K 13

2d) Results are as expected:Probability of Male = 62%

If we must select a single value to predict the outcome, it would be the percentage of students who are men.

Model Logistic MLE 1Y1X Excel 2013V0K

Copy “Sum LnLk” from J4 to clipboard.Paste-Special Value in J6

14

2e) Store sum after each stepUse for statistical significance

Now solve for the slope in logistic regression

Model Logistic MLE 1Y1X Excel 2013V0K

.

15

3a) Solve for Slope and InterceptFrom Data menu, select Solver

Model Logistic MLE 1Y1X Excel 2013V0K 16

3b) Set Solver Parameters

Select Intercept and slope (I2:J2) in “Changing Variable Cells”.

Model Logistic MLE 1Y1X Excel 2013V0K 17

3c) Results: All constraints and conditions are satisfied

.

Model Logistic MLE 1Y1X Excel 2013V0K

Misclassified

18

3d) Error in row 6: short guy.Classified gals in 7, 11 & 12 OK

Page 4: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model-Logistic-MLE-1Y1X-Excel2013-Slides.pdf 4

Model Logistic MLE 1Y1X Excel 2013V0K 19

3e) Prepare to test slope for statistical significance

From J4, copy “Sum LnLk” to clipboard. Paste-Special Value onto J7.

Model Logistic MLE 1Y1X Excel 2013V0K 20

4a) Prepare data for Graphs

Start Ht-Graph at minimum of height in S2Enter formulas for Logit, Odds and Prob(Y=1).

Model Logistic MLE 1Y1X Excel 2013V0K

Select T2:V2. Drag down 1 row

Select S2:V3. Drag to bottom

Select columns S & Vfor logistic graph.

21

4b) Prepare X and Y data.X is Height. Y is Prob (Y=1)

Model Logistic MLE 1Y1X Excel 2013V0K

Original data: Col C & H; Logistic data: Col S & V

22

4c) Graph Logistic Regression of Gender by Height.

Logistic dataMarker No; Line Yes

Marker Yes; Line NoOriginal data

Model Logistic MLE 1Y1X Excel 2013V0K

Calculate difference with ‘full’; multiply by -2. Conduct a right-tail Chi2 test with 1 degree freedom.

Slope is statistically significant: P-value < 0.05

23

5) Hypothesis test: Is non-zeroslope statistically significant?

Model Logistic MLE 1Y1X Excel 2013V0K

ACKNOWLEDGMENT:This presentation closely follows the Carlberg

(2012) presentation in Chapter 2: pages 21-52.These slides present the how – step by step –

of logistic regression for a single case. Carlberg (2012) discusses the how and the why.

REFERENCE:Carlberg, Conrad (2012). Decision Analytics:

Microsoft Excel. Que Publishing.

24

Acknowledgment and Reference

Page 5: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K

byMilo Schield

Member: International Statistical InstituteUS Rep: International Statistical Literacy ProjectDirector, W. M. Keck Statistical Literacy Project

Slides and data at: www.StatLit.org/pdf/Model-Logistic-MLE-1Y1X-Excel2013-slides.pdfxls/Model-Logistic-MLE-1Y1X-Excel2013-Data.xlsx

Model Logistic Regression MLE 1Y1X in Excel 2013

1

Page 6: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 2

Background & Goals

Modelling a binary outcome (buy/look, payoff/default, go/nogo or male/female) requires logistic regression. Doing logistic regression in Excel requires Solver. “Since its introduction in .. 1991, … Excel Solver has become the most widely distributed – and almost surely the most widely used – general-purpose optimization modeling system.” www.utexas.edu/courses/lasdon/design3.htm

This presentation uses college student data: pulse.xls. This demo models gender (male) based on height. Goals: Create graphs on slides 4 and 22.Determine if slope is statistically significant.

Page 7: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K

Column H: 1=Male, 0 = Female (circled)

Ave Heights:

M: 70.75” 62%

F: 65.3” 38%

Difference:5.35”

3

This demo uses Height (col C) to predict Gender (col H)

Page 8: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 4

To Do: Model Gender by HeightShow Trend-line and Equation

This trend-line does not satisfy the least-squares assumptions and it goes outside the valid range.

but it does pass through the joint mean.

Page 9: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 5

Intuitive idea of solution

This shape handles all heights: even if negative.

Page 10: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K

1) Prepare data for logistic MLE regression

2) Solve for just the intercept

3) Solve for the intercept and slope

4) Generate various graphs

5) Test for statistical significance

6

Outline of Approach:Five Steps

To do: Get data at www.StatLit.org/xls/Model-Logistic-MLE-1Y1X-Excel2013-Data.xlsx

Page 11: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 7

1) To do: Load Data;Hide columns; Enter formula

Hide columns A-B and D-G. Let I2=1; J2=0Enter formula in M2-Q2 and J4

Page 12: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 8

To do: Select M2:Q2, Pull ↓ to 93Copy/Paste Value of J4 onto J5

Odds = Prob(Y=1)/[1-Prob(Y=1)] Range: 0 to infinityLogit = LN(Odds). Range: -infinity to +infinity.

Logistic regress: Logit = intercept + Slope*Height

Page 13: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 9

Result: Ln-Likelihood of OK;Original Sum = -63.82

Intercept of 1 give P(Y = 1 = male) = 73%. But 62% of these students are male. Solver should adjust intercept so P(Y=1) = 62%.

Page 14: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 10

2a) Solve for Intercept

From Data Menu, select Solver

Page 15: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 11

2b) Select Solver parameters

Set Objective to J4Set “To” to “Max”Set “change variable”

to I2

Set “Solving Method” to “GRC Nonlinear”

Press OK

Page 16: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 12

2c) Results: All constraints and conditions are satisfied.

Page 17: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 13

2d) Results are as expected:Probability of Male = 62%

If we must select a single value to predict the outcome, it would be the percentage of students who are men.

Page 18: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K

Copy “Sum LnLk” from J4 to clipboard.Paste-Special Value in J6

14

2e) Store sum after each stepUse for statistical significance

Now solve for the slope in logistic regression

Page 19: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K

.

15

3a) Solve for Slope and InterceptFrom Data menu, select Solver

Page 20: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 16

3b) Set Solver Parameters

Select Intercept and slope (I2:J2) in “Changing Variable Cells”.

Page 21: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 17

3c) Results: All constraints and conditions are satisfied

.

Page 22: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K

Misclassified

18

3d) Error in row 6: short guy.Classified gals in 7, 11 & 12 OK

Page 23: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 19

3e) Prepare to test slope for statistical significance

From J4, copy “Sum LnLk” to clipboard. Paste-Special Value onto J7.

Page 24: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K 20

4a) Prepare data for Graphs

Start Ht-Graph at minimum of height in S2Enter formulas for Logit, Odds and Prob(Y=1).

Page 25: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K

Select T2:V2. Drag down 1 row

Select S2:V3. Drag to bottom

Select columns S & Vfor logistic graph.

21

4b) Prepare X and Y data.X is Height. Y is Prob (Y=1)

Page 26: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K

Original data: Col C & H; Logistic data: Col S & V

22

4c) Graph Logistic Regression of Gender by Height.

Logistic dataMarker No; Line Yes

Marker Yes; Line NoOriginal data

Page 27: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K

Calculate difference with ‘full’; multiply by -2. Conduct a right-tail Chi2 test with 1 degree freedom.

Slope is statistically significant: P-value < 0.05

23

5) Hypothesis test: Is non-zeroslope statistically significant?

Page 28: Create Logistic MLE 1Y1X in Excel 2013 Sept 2015 V0K

Model Logistic MLE 1Y1X Excel 2013V0K

ACKNOWLEDGMENT:This presentation closely follows the Carlberg

(2012) presentation in Chapter 2: pages 21-52.These slides present the how – step by step –

of logistic regression for a single case. Carlberg (2012) discusses the how and the why.

REFERENCE:Carlberg, Conrad (2012). Decision Analytics:

Microsoft Excel. Que Publishing.

24

Acknowledgment and Reference