Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions...

38
Chapter 11 Chi-Square Distribution

Transcript of Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions...

Page 1: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Chapter 11 Chi-Square Distribution

Page 2: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Review

• So far, we have used several probability distributions for hypothesis testing and confidence intervals with normal distribution and Student’s t distribution.

• In this section, we will be using chi-squre.

Page 3: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

What is Chi-Square?

• = Chi-Square

• The values begin at 0 and then all are positive. The graph of is not symmetrical, and like student’s t distribution, it depends on the number of degrees of freedom.

• It can determine if random variables are dependent or independent.

• It can determine if different populations share the same proportions of specified characteristics.

Page 4: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Example:

Page 5: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Mode (high point)

• The mode (high point) of a chi-square distribution with n degrees of freedom occurs over n-2 (for

Page 6: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Formula for

• O= observed• E= expected

Page 7: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Degrees of Freedom

• Degrees of freedom = (number of rows – 1)(Number of columns – 1)

• R= number of cell rows• C=number of cell columns

Page 8: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Example: (The situation)

• Innovative Machines Incorporated has developed two new letter arrangements for computer keyboards. The company wishes to see if there is any relationship between the arrangement of letters on the keyboard and the number of hours it takes a new typing student to learn to type at 20 words per minute. Or, from another point of view, is the time it takes a student to learn to type independent of the arrangement of the letters on a keyboard? Use 5% level of significance

Page 9: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Example: (step 1)

• Keyboard arrangement and learning times are independent

• Keyboard arrangement and learning times are not independent

Page 10: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Example: (chart)Step 2: Determine E

Page 11: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Answer for E (will show in class)Keyboard 21-40 h 41-60 h 61-80 h Row Total

A O:25E:24

O:30E:40

O:25E:16

80

B O:30E:36

O:71E:60

O:19E:24

120

Standard O:35E:30

O:49E:50

O:16E:20

100

Column Total 90 150 60 300 (sample size)

Remember

Page 12: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Chart to find

Cell

1 25 24 1 1 0.04

2 30 40 -10 100 2.50

3 25 16 9 81 5.06

4 30 36 -6 36 1.00

5 71 60 11 121 2.02

6 19 24 -5 25 1.04

7 35 30 5 25 0.83

8 49 50 -1 1 0.02

9 16 20 -4 16 0.80

Page 13: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

What is then?

• Add up all the numbers0.04

2.50

5.06

1.00

2.02

1.04

0.83

0.02

0.80

Page 14: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Example: (Degrees of freedom for test of independence)

• d.f.=4

Page 15: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Conclusion

• Look in the book with chi-square table.

• Since we have Chi-square as 13.31 with d.f. 4

• The corresponding P-value falls between 0.005 and 0.010.

• Since (.005< P-Value < 0.010) < .05, we reject null and accept alternate. Based on 5% level of significance, we are taking a chance to conclude that keyboard arrangement and learning time are not independent.

Page 16: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Group Work (the situation)

• Vending Machine is to install soda machines in elementary school and high school. The market analyst wish to know if flavor preference and school level are independent. A random sample of 200 students was taken. Their school level and soda preferences are given. Is independence indicated at the 1% level of significance?

Page 17: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Group Work (table)Soda High School Elementary Row Total

Coke O:33E:

O:57E:

90

Pepsi O:30E:

O:20E:

50

Mountain Dew O:5E:

O:35E:

40

Fanta O:12E:

O:8E:

20

Column Total 80 120 200 (sample size)

Page 18: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

How to Test for independence of two statistical variables

• Look at Pg 582. Copy it and follow it!

Page 19: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Test of homogeneity

• The test claim that different populations share the sample proportions of specified characteristics.

Page 20: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Test of Homogeneity

• The procedure is very much the same as test for independence, except the hypothesis is different.

• For test of independence:

• For test of homogeneity:

Page 21: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Example:

• If you could own one pet, what kind would you choose? The possible responses were of the following. Does the same proportion of males same as females prefer each type of pet? Use 1 % level of significance

Gender Dog Cat Other pet No Pet

Female 120 132 18 30

Male 135 70 20 25

Page 22: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Fill this outGender Dog Cat Other pet No Pet Row Total

Female O:120E:

O:132E:

O:18E:

O:30E:

Male O:135E:

O:70E:

O:20E:

O:25E:

Column Total

Page 23: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

AnswerGender Dog Cat Other pet No Pet Row Total

Female O:120E:139.09

O:132E:110.18

O:18E:20.73

O:30E:30

300

Male O:135E:115.91

O:70E:91.82

O:20E:17.27

O:25E:25

250

Column Total

255 202 38 55 550 (sample size)

Page 24: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Fill this outCell

12345678

Page 25: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

AnswerCell

1 120 139.09 2.62

2 132 110.18 4.320

3 18 20.73 0.359

4 30 30 0

5 135 115.91 3.144

6 70 91.82 5.185

7 20 17.27 0.431

8 25 25 0

Page 26: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Final Answer

• Chi-square= 16.059• d.f.=3• P-value=.001

• Based on 1% level of significance, we are taking a chance to say that males and female students have different preferences when it comes to selecting a pet because we rejected the null saying preference is the same and accept the alternate saying the preference is different.

Page 27: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Homework Practice

• Pg 588 #1-15 even

Page 28: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

CHI-SQUARE: GOODNESS OF FIT

Page 29: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Reason Behind Goodness of Fit

• Set up a test to investigate how well a sample distribution fits a given distribution

• Use observed and expected frequencies to compute the sample chi-square statistics

• Find or estimate the P-value and complete the test

Page 30: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Hypothesis Testing

Page 31: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Sample statistic

• With degrees of freedom= k-1• E=Expected frequency• O=Observed frequency• k=number of categories in the distribution

Page 32: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Question

• Does present distribution of favorable responses the same or different than last year? To test this hypothesis, a random sample of 500 employees was taken. The chart is on the next slide. Use 1% level of significance

Page 33: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

ExampleCategory Percentage of Favorable Responses

Vacation time 4%

Salary 65%

Safety regulations 13%

Health and retirement benefits 12%

Overtime policy and pay 6%

Category Observed

Vacation time 30

Salary 290

Safety regulations 70

Health and retirement benefits

70

Overtime 40

Page 34: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Answer

Category O E

Vacation time 30 20 100 5.00

Salary 290 325 1225 3.77

Safety regulations

70 65 25 0.38

Health and retirement benefits

70 60 100 1.67

Overtime 40 30 100 3.33

Total 500 500 14.15

Page 35: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Answer

• K-1 = 5-1=4

• (.005<P-value<.010) < .01• Reject null, accept alternate

• At the 1% level of significance, we can say that the evidence supports the conclusion that this year’s responses to the issues are different from last years because we reject the null saying they are the same and accept the alternate, saying they are different.

Page 36: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Group Work• The age distribution of the Canadian population and the age

distribution of a random sample of 455 residents in the Indian community (Red Lake village)

• Use 5% level of significance to test the claim that the age distribution fits the age distribution of red lake village

Age % population Observed in Red Lake Village

Under 5 7.2% 47

5-14 13.6% 75

15-64 67.1% 288

65 + 12.1% 45

Page 37: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Answer

• .005<P-value<.01• Reject null; accept alternate• ***insert conclusion***

Page 38: Chapter 11 Chi-Square Distribution. Review So far, we have used several probability distributions for hypothesis testing and confidence intervals with.

Homework Practice

• Pg 597 #1-18 even