Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and...

21
Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation

Transcript of Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and...

Page 1: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

Review for the chapter 6 test

6.1 Scatter plots & Correlation6.2 Regression, Prediction, and

Causation

Page 2: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

1. Which could be the least squares regression equation for the graph?A) y= aB) y= a + bxC) y=a – bxD) not enough information to tell

C

Page 3: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

2. Which could be the least squares regression equation for the graph?A) y= aB) y= a + bxC) y=a – bx

D) not enough information to tell

A

Page 4: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

3. The correlation between the heights of fathers and the heights of their (grownup) sons is r=0.52. This tells us that,A) taller than average fathers tend to have taller then average sonsB) taller than average fathers tend to have shorter then average sonsC) sons are, on the average, taller then their fathersD) there is a moderate to weak correlation between heights of fathers and sons

D

Page 5: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

4. There is a positive correlation between the size of a hospital (measured by number of beds) and the median number of days that patients remain in the hospital. Does this mean that you can shorten a hospital stay by choosing to go to a small hospital?(a) No -- a negative correlation would allow that conclusion, but this correlation is positive.(b) Yes -- the data show that stays are shorter in smaller hospitals.(c) No -- the positive correlation is probably explained by the fact that seriously ill peoplego to large hospitals(d) Yes -- the correlation can't just be an accident.

C

Page 6: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

5. Assuming this is true from the previous question that the positive correlation is probably explained by the fact that seriously ill people go to large hospitals, this would be an example of,

A) Common ResponseB) Voluntary ResponseC) ConfoundingD) Causation

A

Page 7: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

6. You compute the correlation coefficient between hours of TV watched and grade point average for a sample of college undergraduates and obtain r = -1.83. This means that

A) you made an arithmetic mistake

B) students who watch more TV tend to get lower grades

C) students who watch more TV tend to get higher grades

D) you can conclude that radiation from TV screens causes gradual brain damage.

E) you can conclude that students who get good grades gradually lose their ability to appreciate TV.

A

Page 8: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

7. The correlation r=0.52 in the previous question shows that the fact that fathers have different heights,A) explains about 27% of the observed variation in their sons’ heightsB) explains about 52% of the observed variation in their sons’ heightsC) explains about 73% of the observed variation in their sons’ heightsD) explains about 95% of the observed variation in their sons’ heightsE) explains why some sons look a lot like their fathers

A

Page 9: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

8. Suppose that the least squares regression line for predicting y from x is y = 100 - 1.3x.Which of the following is a possible value for the correlation between y and x?(a) 0.3 (b) -1.3 (c) 0 (d) -0.5 (e) 0.5

D

Page 10: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

B

9. The equation of the regression line for son's height in inches y versus father's height in inches x is y = 0.5x + 35. For 72 inch tall fathers, what is the mean height of their sons?(a) 69 inches (b) 71 inches (c) 72 inches (d) 74 inches(e) None of the above.

Page 11: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

10. If fathers' heights were measured in feet (one foot equals 12 inches), and sons' heights were measured in furlongs (one furlong equals 7920 inches), the correlation between heights of fathers and heights of sons would be

(a) much smaller that 0.52(b) slightly smaller than 0.52(c) unchanged: equal to 0.52(d) slightly larger than 0.52(e) much larger that 0.52 C

Page 12: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

11. Perfect correlation means all of the following except

(a) r = -1 or r = +1. (b) all points on the scatter plot lie

on a straight line(c) all variation in one variable is

explained by variation in the other variable.

(d) there is a causal relationship between the variables.

(e) each variable is a perfect predictor of the other.

D

Page 13: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

12. Which of the following are most likely to be negatively correlated?

(a) The total floor space and the price of an apartment in New York.

(b) The percentage of lean body mass and the time it takes to run a mile for male college students.

(c) The heights and yearly earnings of 35 year old U.S. adults.

(d) Gender and yearly earnings among 35 year old U.S. adults.

B

Page 14: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

13. The correlation between average monthly temperature x and monthly natural gas consumption y over a period of months at Lincoln High School is -0.86. Which of the following operations would change the value of the correlation?

(a) Measure gas consumption in cubic meters instead of cubic feet.

(b) Remove two outliers from the data before doing the calculation.

(c) Measure temperature in degrees Kelvin instead of in degrees Fahrenheit.

(d) All of (a), (b), and (c) would change the value of the correlation.

B

Page 15: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

E

14. Here is a scatter plot of the percent of games won by 11 basketball teams versus the percent of their shots that they made. What is the correlation between these two variables?

(a) about -0.7

(b) about -0.3 (c) close to 0

(d) about 0.3

(e) about 0.7

Page 16: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

15. Suppose that the correlation between the scores of students on Exam 1 and Exam 2 in a statistics class is r = 0.7. One way to interpret r is to say what percent of the variation in Exam 2 scores can be explained by the straight line relationship between Exam 2 scores and Exam 1 scores. This percent is about

(a) 84% (b) 70% (c) 49% (d) 30%

C

Page 17: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

16. A study of home heating costs collects data on the size of houses and the monthly cost to heat the houses with natural gas. Here are the data.Just by looking at the data (don't do a calculation) you can see that the correlation between house size and heating cost is

(a) close to zero.(b) clearly positive.(c) clearly negative.(d) not close to zero, but could be either positive or negative.(e) makes no sense for these data.

B

Page 18: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

17. A study gathers data on the outside temperature during the winter, in degrees Fahrenheit, and the amount of natural gas a household consumes, in cubic feet per day. Call the temperature x and gas consumption y. The house is heated with gas, so x helps explain y. The least-squares regression line for predicting y from x is y = 1344 - 19x. The next four questions concern

this line.On a day when the temperature is 20F, the regression line predicts that gas used will be about

(a) 1724 cubic feet (b) 1383 cubic feet

(c) 1325 cubic feet (d) 964 cubic feet(e) None of these. D

Page 19: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

A study gathers data on the outside temperature during the winter, in degrees Fahrenheit, and the amount of natural gas a household consumes, in cubic feet per day. Call the temperature x and gas consumption y. The house is heated with gas, so x helps explain y. The least-squares regression line for predicting y from x is y = 1344 - 19x. The next three questions concern this line.

18. We can see from the equation of the line that (a) as the temperature x goes up, gas used y goes up, because the

slope 1344 is positive(b) as the temperature x goes up, gas used y goes up, because the

slope 19 is positive(c) as the temperature x goes up, gas used y goes down because the

slope 1344 is bigger than 19. (d) as the temperature x goes up, gas used y goes down, because the

slope -19 is negative

D

Page 20: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

A study gathers data on the outside temperature during the winter, in degrees Fahrenheit, and the amount of natural gas a household consumes, in cubic feet per day. Call the temperature x and gas consumption y. The house is heated with gas, so x helps explain y. The least-squares regression line for predicting y from x is y = 1344 - 19x. The next two questions concern this line.

19. When the temperature goes up 1 degree, what happens to the gas usage predicted by the regression line?

(a) It goes up 1 cubic foot (b) It goes down 1 cubic foot (c) It goes up 19 cubic feet (d) It goes down 19 cubic feet(e) Can’t tell without seeing the data

D

Page 21: Review for the chapter 6 test 6.1 Scatter plots & Correlation 6.2 Regression, Prediction, and Causation.

A study gathers data on the outside temperature during the winter, in degrees Fahrenheit, and the amount of natural gas a household consumes, in cubic feet per day. Call the temperature x and gas consumption y. The house is heated with gas, so x helps explain y. The least-squares regression line for predicting y from x is y = 1344 - 19x. This is the last question that concerns this line.

20. The correlation between temperature x and gas usage y is r = -0.7. Which of the following would not change r?

(a) measuring temperature in degrees Celsius instead of degrees Fahrenheit (b) removing two outliers from the data used to calculate r.(c) measuring gas usage in hundreds of cubic feet, so that all values of y are

divided by 100.

(d) Both a and c. (e) All of a, b, and c. D