Topics 19 - 20 Unit 4 – Inference from Data: Principles.

of 24 /24
Topics 19 - 20 Unit 4 – Inference from Data: Principles

Embed Size (px)

Transcript of Topics 19 - 20 Unit 4 – Inference from Data: Principles.

  • Slide 1

Slide 2 Topics 19 - 20 Unit 4 Inference from Data: Principles Slide 3 TOPIC 19 CONFIDENCE INTERVALS: MEANS Slide 4 Topic 19 - Confidence Interval: Mean, is unknown The estimated standard deviation of the sample statistic X-bar is called the standard error The purpose of confidence intervals is to use the sample statistic to construct an interval of values that you can be reasonably confident contains the actual, though unknown, parameter. Confidence Interval for a population proportion : where n >= 30 t * is calculated based on level of confidence When running for example 95% Confidence Interval: 95% is called Confidence Level and we are allowing possible 5% for error, we call this alpha ( )= 5% where is the significant level Slide 5 Topic 19 - Confidence Interval: Mean, is unknown Use if the sample data is given, use the Stat, Edit and enter data in the calculator before running the Confidence Interval L 1 is where data is entered by you C-Level: is the level you are running the Confidence Interval Use if the information about sample data is given. X-Bar mean of sample data Sx is Standard deviation of the sample n is sample size C-Level: is the level you are running the Confidence Interval Slide 6 Activity 19-3: M&M Consumption Slide 7 Travel time to work. A study of commuting times reports the travel times to work of a random sample of 20 employed adults in New York State. The mean is = 31.25 minutes and the standard deviation is s = 21.88 minutes. What is the standard error of the mean?samplemeanstandard deviationstandard error s/n = 21.88/20 = 4.8925 minutes. Slide 8 Ancient air. The composition of the earths atmosphere may have changed overtime. To try to discover the nature of the atmosphere long ago, we can examine the gas in bubbles inside ancient amber. Amber is tree resin that has hardened and been trapped in rocks. The gas in bubbles within amber should be a sample of the atmosphere at the time the amber was formed. Measurements on specimens of amber from the late Cretaceous era (75 to 95 million years ago) give these percents of nitrogen: 63.465.064.463.354.864.560.849.151.0 Assume (this is not yet agreed on by experts) that these observations are an SRS from the late Cretaceous atmosphere. Use a 90% confidence interval to estimate the mean percent of nitrogen in ancient air. Slide 9 Enter data for L1. 95% confidence Interval: Using TI83, under Stat, TEST, Choose option 8:TInterval Mean of the sample = 59.6 Standard deviation = 6.26 Degree of freedom = df= 8 Confidence interval for mean percent of nitrogen is between 54.8 and 64.4. Ancient air. Slide 10 Exercise 19-15 Page 414 Exercise 19-23 Page 417 Exercise 19-24 Page 417 Slide 11 TOPIC 20 TEST OF SIGNIFICANCE: MEANS Slide 12 Topic 20 Test of Significant: Mean The purpose of Test of Significant is when we do know the population Parameter but we do not necessary agree with it or we have question about it. To do the test we need to run a sample and we use the statistic to test its validity. Step 2: we initiate hypothesis regarding the question we can not run test of significant without establishing the hypothesis Step 3: Decide what test we have to run, in case of proportion, we use t-test Step 1: Identify and define the parameter. Slide 13 Topic 20 Test of Significant: Mean Step 4: Run the test from calculator Step 5: From the calculator write down the p-value T-test Step 6: Compare your p-value with alpha Significant Level If p-value is smaller than we reject the null hypothesis, then it is statistically significant based on data. If p-value is greater than the we Fail to reject the null hypothesis, then it is not statistically significant based on data. Last step: we write conclusion based on step 6 at significant level p- value > 0.1: little or no evidence against H 0 0.05 < p- value H o : = 115 H a : 115 H o : = 115 H a : < 115 H o : = 115 H a : > 115 Few Possible cases to look at: A teacher suspects that the mean for older students is higher than 115 A teacher suspects that the mean for older students is same or more than 115 A teacher suspects that the mean for older students is also 115 Higher than means (> 115) The opposite of higher than is less than or equal to 115 ( 115) Comparing the two, null hypothesis is the comparison that includes equality (=) Same or more than means (> 115) The opposite of same or more than is less than 115 (< 115) Same means (= 115) The opposite of same is not equal to 115 ( 115) One-sided alternative Two-sided alternative Slide 15 Fuel economy. According to the Environmental Protection Agency (EPA), the Honda Civic hybrid car gets 51 miles per gallon (mpg) on the highway. The EPA ratings often overstate true fuel economy. Larry keeps careful records of the gas mileage of his new Civic hybrid for 3000 miles of highway driving. His result is x-bar= 47.2 mpg. Larry wonders whether the data show that his true long-term average highway mileage is less than 51 mpg. What are his null and alternative hypotheses? Answer Larry wonders whether the data show that his true long-term average highway mileage is less than 51 mpg. H0: = 51 mpg; Ha: < 51 mpg. Slide 16 Problem If a researcher is interested in testing whether the mean is different from some claimed value, 55, then the null and alternative are test the hypotheses H0: = 55, Ha: 55 Slide 17 Stating hypotheses. In planning a study of the birth weights of babies whose mothers did not see a doctor before delivery, a researcher states the hypotheses as H 0 : x-bar = 1000 grams H a : x-bar < 1000 grams Whats wrong with this? Hypotheses should be stated in terms of , not x-bar. Slide 18 Topic 20 Test of Significant: Mean, is unknown Use if the sample data is given, use the Stat, Edit and enter data in the calculator before running the T-test 0 is meanvalue in question List: L1 where the raw data is entered by you : is the alternative hypothesis Use if the information about sample data is given. 0 is meanvalue in question X-bar is sample mean Sx is Sample Standard deviation n is sample size : is the alternative hypothesis Slide 19 Improving your SAT score. We suspect that on the average students will score higher on their second attempt at the SAT mathematics exam than on their first attempt. Suppose we know that the changes in score (second try minus first try) follow a Normal distribution. Here are the results for 46 randomly chosen high school students:Normal distribution 302447706255413212811 43122105632302819117 57145877273351176729 94112125349 82496 12023323999 Do these data give good evidence that the mean change in the population is greater than zero?mean population Slide 20 Activity 20- 2: Sleeping Times The null hypothesis is that the mean sleep time of the population is 7 hours. In symbols, the null hypothesis is H 0 : = 7.0 hours. The alternative hypothesis is that the mean sleep time of the population is not 7 hours. In symbols, the alternative hypothesis is H a : 7.0 hours. Sample Number Sample Size Sample Mean Sample SD Test Statistic p- value 1106.60.825 2106.61.597 3306.60.825 4306.61.597 Slide 21 Exercise 20-8: UFO Sighters Personality Page 432 Exercise 20-10: Credit Card Usage - Page 433 Exercise 20-21: Pet Ownership - Page 436 Exercise 20-14: Age Guesses Page 434 Slide 22 EXTRA PROBLEMS Slide 23 Problem Assume that you are conducting a test of significance using a significance level of = 0.10. If your test yields a P-value of 0.08, what is the appropriate conclusion? P-value = 0.08 < 0.10 Reject Null, It is statistically significant Slide 24 Problem The nicotine content in cigarettes of a certain brand is normally distributed with mean (in milligrams) and standard deviation = 0.1. The brand advertises that the mean nicotine content of their cigarettes is 1.5, but measurements on a random sample of 400 cigarettes of this brand gave a mean of x = 1.52. Is this evidence that the mean nicotine content is actually higher than advertised? at significance level = 0.01. You conclude test the hypotheses H0: = 1.5, Ha: > 1.5 Is this evidence that the mean nicotine content is actually higher than advertised? State the hypothesis Slide 25 Problem A researcher wants to know if the average time in jail for robbery has increased from what it was several years ago when the average sentence was 7 years. He obtains data on 400 more recent robberies and finds an average time served of 7.5 years. If we assume the standard deviation of sample is 3 years, what is the p-value of the test? at significance level = 0.05. You conclude