Http:// Sir Francis Galton Karl Pearson October 28 and 29.
-
date post
22-Dec-2015 -
Category
Documents
-
view
213 -
download
0
Transcript of Http:// Sir Francis Galton Karl Pearson October 28 and 29.
http://www.york.ac.uk/depts/maths/histstat/people/
Sir Francis Galton Karl Pearson
October 28 and 29
30 40 50 60 70 80
30
40
50
60
70
80
MATH
RD
G
A correlation coefficient is a numerical expression of the degree of relationship between two continuous variables.
30 40 50 60 70 80
30
40
50
60
70
80
MATH
RD
G
What might be some practical uses of such a statistic?
A correlation coefficient is a numerical expression of the degree of relationship between two continuous variables.
Population
SampleA XA
µ
_
SampleB XB
SampleE XE
SampleD XD
SampleC XC
_
_
_
_
sa
sb
sc
sd
se
n
n
n
n n
-1 r +1
-1 +1
Pearson’s r
Pearson’s r is a function of the sum of the cross-product of z-scores for x and y.
The familiar t distribution, at N-2 degrees of freedom, can be used to test the probability that the statistic r was drawn from a population with = 0
H0 : XY = 0
H1 : XY 0
where
r N - 2
1 - r2
t =
-1 r +1
-1 +1
Pearson’s r
Pearson’s r can also be interpreted as how far the scores of Y individuals tend to deviate from the mean of X when they are expressed in standard deviation units.
-1 r +1
-1 +1
Pearson’s r
Pearson’s r can also be interpreted as the expected value of zY given a value of zX.
tend to deviate from the mean of X when they are expressed in standard deviation units.
The expected value of zY is zX*r
If you are predicting zY from zX where there is a perfect correlation (r=1.0), thenzY=zX.. If the correlation is r=.5, then zY=.5zX.
Factors that affect r
Non-linearity
Restriction of range / variability
Outliers
Reliability of measure / measurement error
Johnson & Newport, scaled properly,with new ranges age <20 and >20.
All Subjects
0 10 20 30 40Age of Arrival
0
100
200
300
Eng l
i sh
Prof
icie
ncy
r=-.87 r=-.49