PPA 415 – Research Methods in Public Administration
Lecture 4 – Measures of Dispersion
Introduction
By themselves, measures of central tendency cannot summarize data completely.
For a full description of a distribution of scores, measures of central tendency must be paired with measures of dispersion.
Measures of dispersion assess the variability of the data. This is true even if the distributions being compared have the same measures of central tendency.
Introduction – Example, JCHA 1999
How safe do you feel in your community?
10.09.08.07.06.05.04.03.02.0
How safe is your community?
Trafford3.5
3.0
2.5
2.0
1.5
1.0
.5
0.0
Std. Dev = 2.67
Mean = 6.8
N = 14.00
How safe do you feel in your community?
10.09.08.07.06.05.04.03.02.01.00.0
How safe is your community?
Red Hollow3.5
3.0
2.5
2.0
1.5
1.0
.5
0.0
Std. Dev = 3.96
Mean = 6.8
N = 7.00
Introduction
Measures of dispersion discussed. Index of qualitative variation (IQV). The range and interquartile range. Standard deviation and variance.
Index of Qualitative Variation
Used primarily for nominal variables, but can be used with any variable with a frequency distribution.
Ratio of amount of variation actually observed in a distribution of scores to the maximum variation that could exist in that distribution.
Index of Qualitative Variation
Maximum variation in a frequency distribution occurs when all cases are evenly distributed across all categories.
The measure gives you information on how homogeneous or heterogeneous a distribution is.
Index of Qualitative Variation
sfrequencie squared theof sum thef
cases ofnumber N
categories ofnumber
:where
1
2
2
22
k
kN
fNkIQV
Index of Qualitative Variation
JCHA 1999: Ethnicity in Housing Communities Dixie Red Hickory Oak TerraceFultondale Brookside Warrior I Warrior II Bradford Manor Trafford Hollow Grove Ridge Manor
White f1 9 20 18 8 10 2 14 5 7 0 9Nonwhite f2 6 4 5 4 4 13 0 2 0 17 20
Total N 15 24 23 12 14 15 14 7 7 17 29Categories k 2 2 2 2 2 2 2 2 2 2 2
IQV 96.0% 55.6% 68.1% 88.9% 81.6% 46.2% 0.0% 81.6% 0.0% 0.0% 85.6%
Range and Interquartile Range
Range: the distance between the highest and lowest scores. Only uses two scores. Can be misleading if there are extreme values.
Interquartile range: Only examines the middle 50% of the distribution. Formally, it is the difference between the value at the 75% percentile minus the value at the 25th percentile.
Range and Interquartile Range
Problems: only based on two scores. Ignores remaining cases in the distribution.
)()( 251753 PQPQIQR
lowestHighestRange
Range and Interquartile Range: JCHA 1999 Example
StatisticsHow long have your lived at this address? N Valid 181
Missing 4Minimum 1Maximum 564Percentiles 25 24
75 108
Range = Maximum - Minimum 563IQR = P75-P25 84
The Standard Deviation
The basic limitation of both the range and the IQR is their failure to use all the scores in the distribution
A good measure of dispersion should Use all the scores in the distribution. Describe the average or typical deviation of the
scores. Increase in value as the distribution of scores
becomes more heterogeneous.
The Standard Deviation
One way to do this is to start with the distances between every point and some central value like the mean.
The distances between the scores are the mean (Xi-Mean X) are called deviation scores.
The greater the variability, the greater the deviation score.
The Standard Deviation
One course of action is to sum the deviations and divide by the number of cases, but the sum of the deviations is always equal to zero.
The next solution is to make all deviations positive. Absolute value – average deviation. Squared deviations – standard deviation.
Average and Population Standard Deviation
N
XXAD
i
Deviation Average
N
XX
N
XX
i
i
2
2
2
n)(populatioDeviation Standard
n)(populatio Variance
Sample Variance and Standard Deviation
1
deviation standard Sample1
varianceSample
2
2
2
n
XXs
n
XXs
i
i
Computational Variance and Standard Deviation - Sample
2
2
2
2
Deviation Standard Sample nalComputatio1
(Sample) Variance nalComputatio
ss
nn
xx
s
Examples – JCHA 1999
Safety (Xi) X2
10 1.9 1.9 3.61 1009 0.9 0.9 0.81 815 -3.1 3.1 9.61 255 -3.1 3.1 9.61 25
10 1.9 1.9 3.61 1007 -1.1 1.1 1.21 49
10 1.9 1.9 3.61 10010 1.9 1.9 3.61 10010 1.9 1.9 3.61 1005 -3.1 3.1 9.61 25
81 0.0 20.8 48.90 705N 10
8.1
X
)( XX i 2)( XX i XX i
Examples – Average and Standard Deviation
33.234.5
34.59
9.48
9
1.656705
91081
705
1
33.234.5
34.59
9.48
1
8.210
28
2
22
2
2
2
2
2
ss
nn
xx
s
ss
n
XXs
n
XXAD
i
i
Grouped Standard Deviation
1
2
2
nn
fxfx
s
mm
Grouped Standard Deviation Example
What is your monthly household income?
f xm x2m fxm fx2
m
Valid $500 or less 13 250 62500 3250 812500$1,000 or less 16 750 562500 12000 9000000$1,500 or less 9 1250 1562500 11250 14062500Total 38 26500 23875000
Missing Missing Values 5Total 43
Grouped Standard Deviation Example
84.381$6984.803,145
37
84.736,394,5
37
16.263,480,18000,875,23
3738500,26
000,875,23
1
22
2
s
s
nn
fxfx
s
mm
Top Related