Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The...

21
Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test

Transcript of Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The...

Page 1: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

Example 12.4Possible Gender Discrimination in Salary at Fifth National Bank of Springfield

The Partial F Test

Page 2: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Objective

To use several partial F tests to see whether various groups of explanatory variables should be included in a regression equation for salary, given that other variables are already in the equation.

Page 3: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

BANK.XLS

Recall from Example 11.3 that the Fifth National Bank has 208 employees.

The data for these employees are stored in this file.

In the previous chapter we ran several regressions for Salary to see whether there is convincing evidence of salary discrimination against females.

We will continue this analysis here.

Page 4: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Analysis Overview First, we will regress Salary versus the Female

dummy, YrsExper, and the interactions between Female and YrsExper, labeled Fem_YrsExper. This will be the reduced equation.

Then we’ll see whether the JobGrade dummies Job_2 to Job_6 add anything significant to the reduced equation. If so, we will then see whether the interactions between the Female dummy and the JobGrade dummies, labeled Fem_Job2 to Fem_Job6, add anything significant to what we already have.

Page 5: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Analysis Overview -- continued

If so, we’ll finally see whether the education dummies Ed_2 to Ed_5 add anything significant to what we already have.

Page 6: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Solution

First, note that we created all of the dummies and interaction variables with StatPro’s Data Utilities procedures.

Also, note that we have used three sets of dummies, for gender, job grad and education level.

When we use these in a regression equation, the dummy for one category of each should always be excluded; it is the reference category. The reference categories we have used are “male”, job grade 1 and education level 1.

Page 7: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Solution -- continued The output for the “smallest” equation using Female,

YrsExper, and Fem_YrsExper as explanatory variables is shown here.

Page 8: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Solution -- continued

We’re off to a good start. These three variables already explain 63.9% of the variation of Salary.

The output for the next equation which adds the explanatory variables Job_2 to Job_6 is on the next slide.

This equation appears much better. For example R2 has increased to 81.1%. We check whether it is significantly better with the partial test in rows 26-30.

Page 9: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Page 10: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Solution -- continued The degrees of freedom in cell C28 is the same as the

value in cell C12, the degrees of freedom for SSE.

Then we calculate the F-ratio in cell C29 with the formula =((Reduced!D12-Complete!D12)/Complete!C27)/Complete!E12 were Reduced!D12 refers to SSE for the reduced equation from the Reduced sheet.

Finally, we calculate the corresponding p-value in cell C30 with the formula =FDIST(C29,C27,C28). It is practically 0, so there is no doubt that the job grade dummies add significantly to the explanatory power of the equation.

Page 11: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Solution -- continued Do the interactions between the Female dummy and the job

dummies add anything more?

We again use the partial F test, but now the previous complete equation becomes the new reduced equation, and the equation that includes the new interaction terms becomes the new equation.

The output for this new complete equation is shown on the next slide.

We perform the partial F test in rows 31-35 as exactly as before. The formula in C34 is =((Complete!D12-MoreComplete!D12)/MoreComplete!C32)/MoreComplete!E12.

Page 12: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Page 13: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Solution -- continued Again the p-value is extremely small, so there is no

doubt that the interaction terms add significantly to what we already had.

Finally, we add the education dummies.

The resulting output is shown on the next slide. We see how the terms reduced and complete are relative.

This output now corresponds to the complete equation, and the previous output corresponds to the reduced equation.

Page 14: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Page 15: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Solution -- continued The formula in cell C38 for the F-ratio is now

=((MoreComplete!D12-StillMoreComplete!D12/StillMoreComplete!C36)/StillMoreComplete!E12. The R2 value increased from 84.0% to 84.7%. Also the p-value is not extremely small.

According to the partial F test, it is not quite enough to qualify for statistical significance at the 5% level.

Based on this evidence, there is not much to gain from including the education dummies in the equation, so we would probably elect to exclude them.

Page 16: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Concluding Comments

First, the partial test is the formal test of significance for an extra set of variables. Many users look only at the R2 and/or se values to check whether extra variables are doing a “good job”.

Second, if the partial F test shows that a block of variables is significant, it does not imply that each variable in this block is significant. Some of these values can have low t-values.

Page 17: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Concluding Comments -- continued Third, producing all of these outputs and doing the

partial F tests is a lot of work. Therefore, we included a “Block” option in StatPro to make life easier. To run the analysis in this example use StatPro/Regression analysis/Block menu item. After selecting Salary as the response variable, we see this dialog box.

Page 18: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Concluding Comments -- continued We want four blocks of explanatory variables, and we

want a given block to enter only if it passes the partial F test at the 5% level. In later dialog boxes we specify the explanatory variables. Once we have specified all this, the regression calculations are done in stages. The output from this appears on the next two slides. The output spans over two figures. Note that the output for Block 4 has been left off because it did not pass the F test at 5%.

Page 19: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Page 20: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Page 21: Example 12.4 Possible Gender Discrimination in Salary at Fifth National Bank of Springfield The Partial F Test.

12.1 | 12.2 | 12.3 | 12.3a | 12.1a | 12.4a | 12.5

Concluding Comments -- continued Finally, we have concentrated on the partial F test and

statistical significance in this example. We don’t want you to lose sight, however, of the bigger picture. Once we have decided on a “final” regression equation we need to analyze its implications for the problem at hand.

In this case the bank is interested in possible salary discrimination against females, so we should interpret this final equation in these terms. Our point is simply that you shouldn’t get so caught in the details of statistical significance that you lose sight of the original purpose of the analysis!