Regression with a Binary Dependent Variable (SW Chapter 11)

56
1 Regression with a Binary Dependent Variable (SW Chapter 11)

description

Regression with a Binary Dependent Variable (SW Chapter 11). Example: Mortgage denial and race The Boston Fed HMDA data set . The Linear Probability Model. The Linear Probability Model. Example : Linear Prob Model. Linear probability model: HMDA data. Linear probability model: HMDA data. - PowerPoint PPT Presentation

Transcript of Regression with a Binary Dependent Variable (SW Chapter 11)

Page 1: Regression with a Binary Dependent Variable  (SW Chapter 11)

1

Regression with a Binary Dependent Variable (SW Chapter 11)

Page 2: Regression with a Binary Dependent Variable  (SW Chapter 11)

2

Example: Mortgage denial and raceThe Boston Fed HMDA data set

Page 3: Regression with a Binary Dependent Variable  (SW Chapter 11)

3

The Linear Probability Model

Page 4: Regression with a Binary Dependent Variable  (SW Chapter 11)

4

The Linear Probability Model

Page 5: Regression with a Binary Dependent Variable  (SW Chapter 11)

5

Example: Linear Prob Model

Page 6: Regression with a Binary Dependent Variable  (SW Chapter 11)

6

Linear probability model: HMDA data

Page 7: Regression with a Binary Dependent Variable  (SW Chapter 11)

7

Linear probability model: HMDA data

Page 8: Regression with a Binary Dependent Variable  (SW Chapter 11)

8

Linear probability model: ApplicationCattaneo, Galiani, Gertler, Martinez, and Titiunik (2009). “Housing, Health, and Happiness.” American Economic Journal: Economic Policy 1(1): 75 - 105

• What was the impact of Piso Firme, a large-scale Mexican program to help families replace dirt floors with cement floors?

• A pledge by governor Enrique Martinez y Martinez led to State of Coahuila offering free 50m2 of cement flooring ($150 value), starting in 2000, for homeowners with dirt floors

Page 9: Regression with a Binary Dependent Variable  (SW Chapter 11)

9

Cattaneo et al. (AEJ:Economic Policy 2009) “Housing, Health, & Happiness” X1 = “Program dummy” = 1 if offered Piso Firme.

Page 10: Regression with a Binary Dependent Variable  (SW Chapter 11)

10

Cattaneo et al. (AEJ:Economic Policy 2009) “Housing, Health, & Happiness”

Interpretations?

X1 = “Program dummy” = 1 if offered Piso Firme

Page 11: Regression with a Binary Dependent Variable  (SW Chapter 11)

11

Probit and Logit Regression

Page 12: Regression with a Binary Dependent Variable  (SW Chapter 11)

12

Probit Regression

Page 13: Regression with a Binary Dependent Variable  (SW Chapter 11)

13

STATA Example: HMDA data

Page 14: Regression with a Binary Dependent Variable  (SW Chapter 11)

14

STATA Example: HMDA data, ctd.

Page 15: Regression with a Binary Dependent Variable  (SW Chapter 11)

15

Probit regression with multiple regressors

Page 16: Regression with a Binary Dependent Variable  (SW Chapter 11)

16

STATA Example: HMDA data

Page 17: Regression with a Binary Dependent Variable  (SW Chapter 11)

17

STATA Example: HMDA data

Page 18: Regression with a Binary Dependent Variable  (SW Chapter 11)

18

STATA Example: HMDA data

Page 19: Regression with a Binary Dependent Variable  (SW Chapter 11)

19

Probit Regression Marginal Effects

Page 20: Regression with a Binary Dependent Variable  (SW Chapter 11)

20

Probit Regression Marginal Effects. sum pratio; Variable | Obs Mean Std. Dev. Min Max-------------+-------------------------------------------------------- pratio | 1140 1.027249 .286608 .497207 2.324675. scalar meanpratio = r(mean);. sum disp_pepsi; Variable | Obs Mean Std. Dev. Min Max-------------+-------------------------------------------------------- disp_pepsi | 1140 .3640351 .4813697 0 1. scalar meandisp_pepsi = r(mean);. sum disp_coke; Variable | Obs Mean Std. Dev. Min Max-------------+-------------------------------------------------------- disp_coke | 1140 .3789474 .4853379 0 1. scalar meandisp_coke = r(mean);

. probit coke pratio disp_coke disp_pepsi;

Iteration 0: log likelihood = -783.86028 Iteration 1: log likelihood = -711.02196 Iteration 2: log likelihood = -710.94858 Iteration 3: log likelihood = -710.94858

Probit regression Number of obs = 1140 LR chi2(3) = 145.82 Prob > chi2 = 0.0000Log likelihood = -710.94858 Pseudo R2 = 0.0930

------------------------------------------------------------------------------ coke | Coef. Std. Err. z P>|z| [95% Conf. Interval]-------------+---------------------------------------------------------------- pratio | -1.145963 .1808833 -6.34 0.000 -1.500487 -.791438 disp_coke | .217187 .0966084 2.25 0.025 .027838 .4065359 disp_pepsi | -.447297 .1014033 -4.41 0.000 -.6460439 -.2485502 _cons | 1.10806 .1899592 5.83 0.000 .7357465 1.480373------------------------------------------------------------------------------

Page 21: Regression with a Binary Dependent Variable  (SW Chapter 11)

21

Probit Regression Marginal Effects

Page 22: Regression with a Binary Dependent Variable  (SW Chapter 11)

22

Logit Regression

Page 23: Regression with a Binary Dependent Variable  (SW Chapter 11)

23

STATA Example: HMDA data

Page 24: Regression with a Binary Dependent Variable  (SW Chapter 11)

24

Predicted probabilities from estimated probit and logit models usually are (as usual) very close in this application.

Page 25: Regression with a Binary Dependent Variable  (SW Chapter 11)

25

Logit Regression Marginal Effects

Page 26: Regression with a Binary Dependent Variable  (SW Chapter 11)

26

Comparison of Marginal EffectsLPM Probit Logit

Marginal Effect at Means for Price Ratio

-.4008(.0613)

-.4520(.0712) via nlcom

-.4905(.0773) via nlcom

Average Marginal Effect of Price Ratio

-.4008(.0613)

-.4096(beyond eco205)

-.4332(beyond eco205)

Marginal Effect at Means for Coke display dummy

.0771(.0343)

.0856(.0381) via nlcom

.0864(.0390) via nlcom

Average Marginal Effect For Coke display dummy

.0771(.0343)

.0776(beyond eco205)

.0763(beyond eco205)

Page 27: Regression with a Binary Dependent Variable  (SW Chapter 11)

27

Probit model: ApplicationArcidiacono and Vigdor (2010). “Does the River Spill Over? Estimating the Economic Returns to Attending a Racially Diverse College.” Economic Inquiry 48(3): 537 – 557.

• Does “diversity capital” matter and does minority representation increase it?

• Does diversity improve post-graduate outcomes of non-minority students?

• College & Beyond survey, starting college in 1976

Page 28: Regression with a Binary Dependent Variable  (SW Chapter 11)

28

Arcidiacono & Vigdor (EI, 2010)

Page 29: Regression with a Binary Dependent Variable  (SW Chapter 11)

29

Arcidiacono & Vigdor (EI, 2010)

Page 30: Regression with a Binary Dependent Variable  (SW Chapter 11)

30

Arcidiacono & Vigdor (EI, 2010)

Page 31: Regression with a Binary Dependent Variable  (SW Chapter 11)

31

Logit model: ApplicationBodvarsson & Walker (2004). “Do Parental Cash Transfers Weaken Performance in College?” Economics of Education Review 23: 483 – 495.

• When parents pay for tuition & books does this undermine the incentive to do well?

• Univ of Nebraska @ Lincoln & Washburn Univ in Topeka, KS, 2001-02 academic year

Page 32: Regression with a Binary Dependent Variable  (SW Chapter 11)

32

Bodvarsson & Walker (EconEduR,2004)

Page 33: Regression with a Binary Dependent Variable  (SW Chapter 11)

33

Bodvarsson & Walker (EconEduR,2004)

Page 34: Regression with a Binary Dependent Variable  (SW Chapter 11)

34

Estimation and Inference in Probit (and Logit) Models

Page 35: Regression with a Binary Dependent Variable  (SW Chapter 11)

35

Probit estimation by maximum likelihood

Page 36: Regression with a Binary Dependent Variable  (SW Chapter 11)

36

Special case: probit MLE with no X

Page 37: Regression with a Binary Dependent Variable  (SW Chapter 11)

37

Page 38: Regression with a Binary Dependent Variable  (SW Chapter 11)

38

Page 39: Regression with a Binary Dependent Variable  (SW Chapter 11)

39

Page 40: Regression with a Binary Dependent Variable  (SW Chapter 11)

40

The MLE in the “no-X” case (Bernoulli distribution), ctd.:

Page 41: Regression with a Binary Dependent Variable  (SW Chapter 11)

41

The MLE in the “no-X” case (Bernoulli distribution), ctd:

Page 42: Regression with a Binary Dependent Variable  (SW Chapter 11)

42

The probit likelihood with one X

Page 43: Regression with a Binary Dependent Variable  (SW Chapter 11)

43

The probit likelihood function:

Page 44: Regression with a Binary Dependent Variable  (SW Chapter 11)

44

The Probit MLE, ctd.

Page 45: Regression with a Binary Dependent Variable  (SW Chapter 11)

45

The logit likelihood with one X

Page 46: Regression with a Binary Dependent Variable  (SW Chapter 11)

46

Measures of fit for logit and probit

Page 47: Regression with a Binary Dependent Variable  (SW Chapter 11)

47

Application to the Boston HMDA Data (SW Section 11.4)

Page 48: Regression with a Binary Dependent Variable  (SW Chapter 11)

48

The HMDA Data Set

Page 49: Regression with a Binary Dependent Variable  (SW Chapter 11)

49

The loan officer’s decision

Page 50: Regression with a Binary Dependent Variable  (SW Chapter 11)

50

Regression specifications

Page 51: Regression with a Binary Dependent Variable  (SW Chapter 11)

51

Page 52: Regression with a Binary Dependent Variable  (SW Chapter 11)

52

Page 53: Regression with a Binary Dependent Variable  (SW Chapter 11)

53

Page 54: Regression with a Binary Dependent Variable  (SW Chapter 11)

54

Table 11.2, ctd.

Page 55: Regression with a Binary Dependent Variable  (SW Chapter 11)

55

Table 11.2, ctd.

Page 56: Regression with a Binary Dependent Variable  (SW Chapter 11)

56

Summary of Empirical Results