Download - Day 12 t test for dependent samples and single samples pdf

12/31/2014

1

Wednesday, December 31, 2014 1

Compute by hand and interpret

Dependent samples t test

Single sample t test

Use SPSS to compute the same tests

and interpret the output

12/31/2014

2

The Slides discuss

• Comparing two means to ascertain which mean is of greater statistical significance.

• The statistical test used for this type of analysis is the t test, and the statistic that is computed is called a t value.

• Three research situations in which the t test can be used to analyze the data and compare the means from

a) Two paired samples (e.g., pretest and posttest scores); and

b) a sample and a population (e.g., comparing a mean of a sample to the mean of the population).

• Examples with numerical data to illustrate each of the three types of t tests.

• After t value is computed, you have decided whether it is statistically significant and whether your research hypothesis was confirmed.

STEPS IN THE PROCESS OF HYPOTHESIS TESTING

1. Formulating a research question for the study.

2. Stating a research hypothesis (i.e., an alternative hypothesis). The hypothesis should represent the

researcher’s prediction about the outcome of the study and should be testable. Note that a null

hypothesis for the study is always implied, but it is not formally stated in most cases. The null

hypothesis predicts no difference between groups or means or no relationship between variables.

3. Designing a study to test the research hypothesis. The study’s methodology should include plans

for selecting one or more samples from the population that in interest to researcher; selecting or

designing instruments to gather the numerical data; carrying out the study’s procedure (and

intervention in experimental studies); and determining the statistical test(s) to be used to analyze

the data.

4. Conducting the study and collecting numerical data.

5. Analyzing the data and calculating the appropriate test statistics (e.g., Pearson correlation

coefficient, t test value or chi square value).

6. Determining the appropriate p value.

7. Deciding whether to retain or reject the null hypothesis. A p value of 0.05 is the most commonly

used benchmark to consider the results statistically significant. If the results are statistically

significant, the researcher may also wish to calculate the effect size (EF) to determine the

practical significance of the study’s results.

8. Making a decision whether to confirm the study’s alternative hypothesis (i.e., the research

hypothesis ) and how probable it is that the results were obtained purely by chance. This decision

is based on the decision made regarding the null hypothesis.

9. Summarizing the study’s conclusion, addressing the study’s research question.

12/31/2014

3

Assumptions of t-Test

• The groups are independent of each other.

• A person (or case) may appear in only one group.

• When the two group are approximately the same size, there is no need for the homogeneity of variance. This assumption is called the assumption of the homogeneity of variance.

• We compare the two variances to determine if there is a statistically significant difference between them.

• To test for the assumption of the homogeneity of the variances, we divide the larger variance by the smaller variance and obtain a ratio, called the F value.

• When the F value is statistically significant we cannot assume that the variance are equal.


• A test for the equality of variance, such as the levene’s is used to test the significance of the F value. An F value that is not statistically significant (p>0.05) indicates that the assumption for the homogeneity of variance is not violated and, therefore, equal variances can be assumed , on the other hand, a significant F value (p< 0.05) indicates that the assumption for the homogeneity of variance was violated. The t test statistical results are adjusted for the unequal variance.

12/31/2014

4


• The independent variables are interval or ratio.

• The population from which samples are drawn is normally distributed.

• Samples are randomly selected.

• The groups have equal variance (Homogeneity of variance).

• The t-statistic is robust (it is reasonably reliable even if assumptions are not fully met). Therefore, even if the assumption of the homogeneity of variance is not fully met, the researcher can probably still use the test to analyze the data.

A t-test is used to compare two means in three different situations

T Test

t-test for independent samples

- The two groups whose means are being compared are independent of each other

Ex: a comparison of experimental and control groups.

T-test for paired samples (t-test for dependent, matched or correlation

samples)

The two means represent two sets of scores that are paired.

Ex: a comparison of pretest and posttest scores obtained from one group of people

t-test for a single sample

The t-test is used when the mean of a sample is compared to the mean of a population

12/31/2014

5

Dependent Samples

t-tests

Dependent Samples t-test

• A t test for paired samples (dependent sample t-test) is used

when the two means being compared come from two sets of

scores that are related to each other.

• It is used, for example, in experimental research to measure

the effect of an intervention (a treatment) by comparing the

posttest to the pretest scores.

• The most important requirement for conducting this t test is

that the two sets of scores are paired: they belong to the

same individuals.

• Useful to control individual differences. Can result in more

powerful test than independent samples t-test.

12/31/2014

6

Computing the paired-samples t-test

• First find for each person the difference (D) between the

two scores (e.g., between pretest and posttest) and sum up

those difference (∑ D). Usually, the lower scores (e.g.,

pretest) are subtracted from the higher scores (e.g., posttest)

so D values are always positive.

• We also need to compute the sum of the squares of the

difference (∑ D2). Finally the t value is computed using the

formula:

The example that follows demonstrates the computation of a paired-

samples t test. To simplify the computations , we use scores of eight

students only. Of course, in conducting real-life research, it is

recommended that larger samples (thirty or more) be used.

12/31/2014

7

An example of a t Test

for dependent samples or paired samples Research has documented the potential effect of students’ positive self-concept on

their self-reflections, attitudes toward self, schoolwork, and general development. A

special program is developed by school psychologists and primary grade teachers to

enhance the self-concept of young-age children. The program is implemented in

Sunny Bay School with four groups of first-and second-grade students. The

intervention lasts six weeks and involves various activities in the class and at home.

The instrument used to assess the effectiveness of the program is comprised of forty

pictures, and scores can range from 0 to 40. The program coordinator conducts a

series of workshops to train five graduate students to administer the instrument. All of

the children in the program are tested before the start of the program and then again

during the first week after the end of the program. A t test for paired samples is used

to test the hypothesis that students’ self-concept would improve significantly on the

posttest, compared with their pretest scores. The research hypothesis is:

HA: Mean POSTTEST > Mean PRETEST

Ho: Mean POSTTEST = Mean PRETEST

Review 6 Steps for Significance

Testing

1. Set alpha (p level).

2. State the Research

question

3. Set up hypotheses,

Null and Alternative.

4. Calculate the test

statistic (sample

value).

4. Find the critical

value of the statistic.

5. State the decision

rule.

6. State the conclusion.

12/31/2014

8

Pretest and Posttest Scores of Eight Students

Pretest

X1

Posttest

X2

Posttes

t –

Pretest

D

(Posttest-Pretest)2

D2

30 31 +1 1

31 32 +1 1

34 35 +1 1

32 40 +8 64

32 32 0 0

30 31 +1 1

33 35 +2 4

34 37 +3 9

D= 17 D2 =81

The table shows the numerical data we

used to compute the t value for the eight

students selected at random from the

program participants.

The table shows the pretest and posttest

scores for each students, as well as the

means on the pretest and posttest.

The third column in the table shows the

difference between each pair of scores

(D) and is created by subtracting the

pretest from the posttest for each

participant.

The gain scores are then squared and

recorded in the fourth column (D2).

The scores in these last two columns are

added up to create ∑D and ∑D2

respectively. These values are used in the

computation of the t value:

Dependent Samples t Example

1. Set alpha = .05

2. Null hypothesis: H0: Posttest = Pretest. Alternative is HA: Mean Posttest > Pretest.

3. Calculate the test statistic:

12/31/2014

9

Dependent Samples t Example 4. Determine the critical value of t. Alpha =.05, one tailed

test df = N(pairs)-1 =8-1=7 ; Critical value is 1.895

5. Decision rule: The obtained t value of 2.37 exceeds the

critical values of 1.895 for one tailed test under p=0.05

6. Conclusion. We reject the null hypothesis that states that

there is no difference between the pretest and the posttest

scores. The chance that these results were obtained purely

by chance is less than 5 percent. We confirm the research

hypothesis that predicted that the posttest mean score

would be significantly higher than the pretest mean score.

Table A section from the Table of Critical

Value for t Tests

Level of significance (p value) for one-tailed test

p values 0.10 0.05 0.025 0.01 0.005

Level of significance (p value) for two-tailed test

p values 0.20 0.10 0.05 0.02 0.01

df=7 1.415 1.895 2.365 2.998 3.499 1.895

0.05

df=7

12/31/2014

10

Interpretation

According to this study, the self-concept

enhancement program was effective in

increasing the self-concept of the first-and

second-grade students.

Our data seem to indicate that the intervention to increase the

self-concept of the primary grade students was effective.

However, those conducting the research or those reading

about it should still decide for themselves whether the

intervention is worth while. The question to be asked is

whether an increase of 2.13 (34.13-32.00) points (out of

forty possible points on the scale) is worth the investment of

time, money, and effort.

Using SPSS for dependent t-test

• Open SPSS

• Open file “SPSS Examples” (same as before)

• Go to:

– “Analyze” then “Compare Means”

– Choose “Paired samples t-test”

– Choose the two IV conditions you are

comparing. Put in “paired variables box.”

12/31/2014

11

Dependent t- SPSS output

Paired Samples Statistics

Mean N Std.

Deviation Std. Error

Mean Pair 1 Pretest 32.00 8 1.604 .567

Postest 34.13 8 3.227 1.141

Paired Samples Correlations

N Correlation Sig. Pair 1 Pretest & Postest 8 .635 .091

Paired Samples Test

Paired Differences

95% Confidence Interval of the

Difference

Mean

Std. Deviation

Std. Error Mean Lower Upper

Pair 1

Pretest - Postest

-2.125 2.532 .895 -4.242 -.008

Paired Samples Test

t df

Sig. (2-tailed)

Pair 1

Pretest - Postest

-2.374 7 .049

Task of t-test for dependent samples

The data below shows the lengths ( in arbitrary units)

of syllables containing a particular vowel in two

different environments. Each of the ten subjects was

asked to read two sentences, each containing the

vowel in one environment, and a pair of lengths was

thus obtained for each subject. The researcher predicts

greater mean length in environment 2 than in

environment 1.

12/31/2014

12

Subject no. Environment 1 Environment 2

1 22 26

2 18 22

3 26 27

4 17 15

5 19 24

6 23 27

7 15 17

8 16 20

9 19 17

10 25 30

Table of Lengths of a vowel in two

environments

Do 6 Steps for Significance Testing



question




statistic (sample

value).




rule.


12/31/2014

13

t-Tests for a Single

Sample

t-Test for a Single Sample

• A t test for single sample is used when the mean of a sample

is compared to the mean of a population.

• Occasionally, a researcher is interested in comparing a

single group (a sample) to a larger group (a population).

• For example, a high school teacher of a freshmen-

accelerated English class may want to confirm that the

students in that class had obtained higher scores on an

English placement test compared with their peers.

• In order to carry out this kind of a study the researcher must

know prior to the start of the study the mean value of the

population. In this example, the mean score of the

population is the overall mean of the scores of all freshmen

on the English placement test.

12/31/2014

14

Computing t-Test for Single sample

• The t value is computed using the formula:

An example of a t Test

for Single sample A kindergarten teacher in a school commented to her colleague that the students in

her class this year seem to be less bright than those she had in the past. Her colleague

disagrees with her, and to test whether the first-graders this year are really different

from those in previous years, they conduct a test for a single sample. The scores used

are from the Wechsler Preschool and Primary Scale of Intelligence Third Edition

(WPPSI-III), which is given every year to all kindergarten students in the district. In

this example, we consider the district to be the population to which we compare the

mean of the current kindergarten class. Although the mean IQ score of the population

at large is 100 (µ=100), this district’s mean IQ score is 110 (µ= 110), and this mean is

used in the analysis. The research hypothesis is stated as a null hypothesis and

predicts that there is no statistically significant difference in the mean IQ score of this

year’s kindergarten students (the sample) and the mean IQ score of all kindergarten

students in the district (the population) that were gathered and recorded over the last

three years.

Ho: Mean Class = Mean District

12/31/2014

15


Testing



question




statistic (sample

value).




rule.


Table IQ scores of Ten Students

Scores

115 118

135 113

105 98

107 120

112 99

∑X= 1,122

¯X=112.20

SD= 10.94

In order to test their hypothesis, the two teachers randomly select IQ scores of ten students from this year’s kindergarten class.

These IQ scores are listed in the table followed by the computation of the t value.

12/31/2014

16

Dependent Samples t Example 1. Is there any differences between this year’s

kindergarten class and the “typical kindergarten students in the district?

2. Null hypothesis: H0: Class = District : there

is no significant difference between this year’s

kindergarten class and the “typical” kindergarten

students in the district.

3. Alternative is HA: Class ≠ District. 4. Calculate the test statistic:

Single Samples t Example 4. Determine the critical value of t. Alpha =.05, two tailed

test df = N(pairs)-1 =10-1=9 ; Critical value is 1.895

5. Decision rule: The obtained t value of 0.64 does not

exceeds the critical values of 2.262 for two tailed test

under p=0.05 (because our hypothesis is non-directional)

6. Conclusion. We retain the null hypothesis that states that

there is no significant difference between this year’s

kindergarten class and the “typical” kindergarten students

in the district.

12/31/2014

17

Table A section from the Table of Critical

Value for df=9

Level of significance (p value) for one-tailed test

p values 0.10 0.05 0.025 0.01 0.005

Level of significance (p value) for two-tailed test

p values 0.20 0.10 0.05 0.02 0.01

df=9 1.383 1.833 2.262 2.821 3.250 2.262

0.05

df=9

Interpretation

The mean IQ score of this year’s

kindergarten students (mean= 112.20) is

actually slightly higher than the mean

score of the district (mean=110).

The research hypothesis that was

stated in a null form (i.e., predicting

no difference between two means) is

confirmed.

12/31/2014

18

Using SPSS for dependent t-test

• Open SPSS

• Open file “SPSS Examples” (same as before)

• Go to:

– “Analyze” then “Compare Means”

– Choose “Paired samples t-test”

– Choose the two IV conditions you are

comparing. Put in “paired variables box.”

Single Sample- SPSS output

One-Sample Statistics

N Mean Std. Deviation Std. Error Mean IQ Scores 10 112.20 10.942 3.460

One-Sample Test

Test Value = 0

95% Confidence Interval

of the Difference

t df Sig. (2-tailed)

Mean Difference Lower Upper

IQ Scores 32.425 9 .000 112.200 104.37 120.03

12/31/2014

19

Task of t-test for Single samples A report published by the Antaretic Academy of

Neurolinguistics indicates the left-handed people may be

better language learners than right-handed people. To test this

hypothesis, you did the following experiments:

In an ESL center, 250 students were randomly selected

and then assigned to two groups (left-handed vs. right-

handed). After equal amounts of instruction, you administered

a battery of language tests to all students. The information you

obtained was:

If the mean on the test for the population of ESL students is 50

a. Test whether left-handed people are better than the

population

b. Test whether right-handed people are better than the

population.


Testing



question




statistic (sample

value).




rule.


12/31/2014

20

References Main Sources

Coolidge, F. L.2000. Statistics: A gentle introduction. London: Sage.

Kranzler, G & Moursund, J .1999. Statistics for the terrified. (2nd ed.). Upper Saddle

River, NJ: Prentice Hall.

Butler Christopher.1985. Statistics in Linguistics. Oxford: Basil Blackwell.

Hatch Evelyn & Hossein Farhady.1982. Research design and Statistics for Applied Linguistics.

Massachusetts: Newbury House Publishers, Inc.

Ravid Ruth.2011. Practical Statistics for Educators, fourth Ed. New York: Rowman &

Littlefield Publisher, Inc.

Quirk Thomas. 2012. Excel 2010 for Educational and Psychological Statistics: A Guide

to Solving Practical Problem. New York: Springer.

Other relevant sources

Agresi A, & B. Finlay.1986. Statistical methods for the social sciences. San Francisco,

CA: Dellen Publishing Company.

Bachman, L.F. 2004. Statistical Analysis for Language Assessment. New York: Cambridge University

Press.

Field, A. (2005). Discovering statistics using SPSS (2nd ed.). London: Sage.

Moore, D. S. (2000). The basic practice of statistics (2nd ed.). New York: W. H.

Freeman and Company.

Wednesday, December 31, 2014

39