Significance of the Difference Between Two Sample Means

40
Significance of the Difference Between Two Sample Means

description

Significance of the Difference Between Two Sample Means. Previously…. We learned how to test the hypothesis that one sample came from a population that we have information about, either parameters (like a known μ ) or claims (like a claimed μ ). H 0 : μ = μ 0 - PowerPoint PPT Presentation

Transcript of Significance of the Difference Between Two Sample Means

Page 1: Significance of the Difference Between Two Sample Means

Significance of the Difference Between Two Sample Means

Page 2: Significance of the Difference Between Two Sample Means

Previously…

• We learned how to test the hypothesis that one sample came from a population that we have information about, either parameters (like a known μ) or claims (like a claimed μ).

• H0: μ = μ0

• We knew what μ was (it was a constant), so we could just plug it into our formula and determine our t score.

• We used the sampling distribution of means as our reference distribution.

Page 3: Significance of the Difference Between Two Sample Means

Two-sample t test

• We are going to learn how to compare two samples, one control group and one treatment group.

• We will treat the control group as the population and the treatment group as our single sample.

Page 4: Significance of the Difference Between Two Sample Means

Draw Two Samples and Compare

Control Treatment

(Unknown) Population

Page 5: Significance of the Difference Between Two Sample Means

Our Hypothesis

• H0: μ1 = μ2

• Instead of using to estimate the significance of our treatment, we will use

.

Page 6: Significance of the Difference Between Two Sample Means

Independent Samples

• Independent just means that one group cannot influence the outcome of the other group.

• If we randomly sample from the population and randomly assign individuals to each group, we generally won’t have a problem.

Page 7: Significance of the Difference Between Two Sample Means

Sampling Distribution of the Mean Differences

• Because we are dealing with two samples and we assume that they are being drawn from the same population (remember our null hypothesis), the reference distribution we will use is the distribution of the differences between all pairs of sample means.

Page 8: Significance of the Difference Between Two Sample Means

Draw two samples:

Control Treatment

(Unknown) Population

Page 9: Significance of the Difference Between Two Sample Means

Then find the difference between their means:

Repeat

Page 10: Significance of the Difference Between Two Sample Means

Sampling Distribution of the Mean Differences

• If you plot the mean differences on a frequency distribution, the differences will vary around the “true difference.”

• The differences will be normally distributed (perfectly so with an infinite number of mean differences).

Page 11: Significance of the Difference Between Two Sample Means

If H0 is True

• You will find that the mean of the mean differences is 0.

• In other words, the true difference is 0 (there is no difference).

• Therefore, the control group and sample group “come from the same population,” which means that they do not differ with respect to the variable you are measuring.

Page 12: Significance of the Difference Between Two Sample Means

If H0 is False

• You will find that the mean of the mean differences is not 0.

• In other words, the true difference is not equal to 0 (there is a difference).

• Therefore, the control group and treatment group come from different populations, which means that they do differ with respect to the variable you are measuring.

Page 13: Significance of the Difference Between Two Sample Means

Properties

1. Here is a fancy way of saying what we just said (the mean of all sample differences is equal to the difference between the population means):

2. The larger the sample sizes, the closer the curve will be to the normal curve.

3. The larger the sample sizes, the smaller the standard deviation of sampling distribution differences.

Page 14: Significance of the Difference Between Two Sample Means

Standard Error of the Difference

• The standard deviation of the mean differences is called the standard error of the difference between means and is defined mathematically as follows:

Page 15: Significance of the Difference Between Two Sample Means

Back to Reality

• This is all theoretical, since you won’t be drawing an infinite number of samples and comparing their means, and you won’t know the population parameters.

• So what do we do with our two samples?

Page 16: Significance of the Difference Between Two Sample Means

Estimated Standard Error of the Difference

• The estimated standard error of the difference formula is the same, but with statistics substituted for the parameters:

Page 17: Significance of the Difference Between Two Sample Means

Estimated Standard Error of the Difference

• The formula that we will be using for calculation can be used on samples of different sizes (which is why we will use it):

• Notice that you know all of the necessary terms.

• The reason it looks complex is that the variance is “weighted” by sample size. Larger samples get more say in determining the standard error of the difference.

Page 18: Significance of the Difference Between Two Sample Means

Finding the t score

• Remember in the last unit, we found the t score using a formula very similar to the z score formula. We are going to do the same thing here.

• If we knew the population parameters, we would use this formula

Page 19: Significance of the Difference Between Two Sample Means

Finding the t score

• Of course, we don’t know the population parameters, so we use this formula instead:

• And since we are assuming that the samples come from the same population (μ1 – μ2 = 0), we can write it this way:

Page 20: Significance of the Difference Between Two Sample Means

Computing t

• If we substitute the computational formula for the estimated standard error of the difference in the denominator, we get this monster formula:

• But notice that it’s all stuff we’ve done before.

Page 21: Significance of the Difference Between Two Sample Means

Using the t table

• Once we have the t score, we can check the t table to see if the difference between our samples is “significant.”

• Just as before if our tobt (the value we get from our formula) is LARGER than our tcrit (the value we find in the table), we reject the null.

• One important difference: for our df we use N1 + N2 – 2.

Page 22: Significance of the Difference Between Two Sample Means

Activity #1

• An educational experiment with participants randomly sampled and assigned to two groups: traditional lecture and group discussion produces the following results.

• Lecture Group: N = 25, Mean = 81.7, s = 8.3• Discussion Group: N = 25, Mean = 74.1, s = 10.1

• Is there a “significant” difference between the two groups at the α = .05 level? At the α = .01 level?

Page 23: Significance of the Difference Between Two Sample Means

Step 1

• State the H0: μ1 = μ2

• In other words, there is no difference between the two groups (i.e., they came from the same population).

• Choose an α level (we will look at both values, just for fun).

Page 24: Significance of the Difference Between Two Sample Means

Step 2

• State your rejection rule:

• Reject H0 if |tobt| ≥ tcrit

• using df = N1 + N2 – 2 = 25 + 25 – 2 = 48 (we will use 50 in our table), tcrit = 2.0086 (5%) and 2.6778 (1%)

Page 25: Significance of the Difference Between Two Sample Means

Step 3

• Now we compute:

Page 26: Significance of the Difference Between Two Sample Means

Step 4

• Make a decision:

• We know that tcrit = 2.0086 (5%) and 2.6778 (1%)

• We know that tobt = 2.906

• We can see that tobt is greater than both tcrit values, so we would reject the null regardless of the α level we had chosen.

Page 27: Significance of the Difference Between Two Sample Means

Step 5

• Interpret your results:

• The difference between the lecture group and the discussion group was statistically significant, t = 2.906, p < .01.

• Make sure you know what this means: the probability of obtaining samples with differences this great or greater if the null hypothesis is true (there is no difference between the groups) is less than .01.

Page 28: Significance of the Difference Between Two Sample Means

Assumptions of the Independent Two-Sample t Test

1. Normal distribution: we assume that the DV in the populations are normally distributed.

2. Homogeneity of Variance: (homogeneity = sameness) we assume that the population variances are the same.

3. Independent Samples: we assume that the samples are independent

Page 29: Significance of the Difference Between Two Sample Means

Violations of the Assumption

• If any one of the assumptions is violated, the t test will often continue to produce valid results. The test is robust.

Page 30: Significance of the Difference Between Two Sample Means

However

• If homogeneity of variance is violated (i.e., the variances are dissimilar) AND the sample sizes are different, you could get wacky results.

• Also, if the third assumption, independence, is violated, you will want to use a different test.

Page 31: Significance of the Difference Between Two Sample Means

Dependent Samples

• If the samples are not independent, they are dependent.

• Actually, using dependent samples can increase the power of your analysis, which is a good thing (it means you have a greater chance of rejecting the null if there is a difference between the samples).

Page 32: Significance of the Difference Between Two Sample Means

Matched Pairs

• Finding subjects that are alike with regard to a variable that is likely to influence the experimental outcome and assigning one to each group (e.g., matching people with similar IQ scores for an academic experiment).

• To conduct a matched pairs experiment, you need information about at least one additional variable (like IQ score, bench press maximum, relation to other subjects, etc.)

Page 33: Significance of the Difference Between Two Sample Means

Within-subject Comparisons(Repeated Measures Design)

• Each subject is exposed to both treatment and control conditions.

• The order of exposure is randomized and counterbalanced.

• You can get away with smaller sample sizes.

Page 34: Significance of the Difference Between Two Sample Means

Why More Power?

• If the subjects are matched, the difference between means will be closer to the actual difference and not as heavily influenced by variation caused by the “matching variable.”

• With repeated measures, the subjects are the same in both groups, so you have even less uncontrolled variance.

Page 35: Significance of the Difference Between Two Sample Means

Computing t for Dependent Samples

• First, construct a table with each pair (matched pairs) or individual (repeated measures) in one row.

Pair/Indiv. Treatment Control D D2

1 15 12

2 13 14

3 16 13

4 22 16

5 8 7

6 13 10

7 12 14

8 9 7

9 5 4

10 6 3

Page 36: Significance of the Difference Between Two Sample Means

Computing t for Dependent Samples

• Calculate the difference (D) between each condition for each row. Then square D.

Pair/Indiv. Treatment Control D D2

1 15 12 3 9

2 13 14 -1 1

3 16 13 3 9

4 22 16 6 36

5 8 7 1 1

6 13 10 3 9

7 12 14 -2 4

8 9 7 2 4

9 5 4 1 1

10 6 3 3 9

Page 37: Significance of the Difference Between Two Sample Means

Computing t for Dependent Samples

• Find the mean of the differences:

• Find the estimated standard error the difference:

Page 38: Significance of the Difference Between Two Sample Means

Computing t for Dependent Samples

• Calculate t:

Page 39: Significance of the Difference Between Two Sample Means

Finding tcrit

• Finding the critical value is the same as before, except for an important difference:

• When finding tcrit use the number of pairs (for matched pairs) or individuals (for repeated measures) as N. df = N – 1.

Page 40: Significance of the Difference Between Two Sample Means

Homework

• Study for Chapter 10 Quiz

• Read Chapter 11

• Do Chapter 10 HW