Statistik: Numerical Descriptive Measures
description
Transcript of Statistik: Numerical Descriptive Measures
NUMERICAL DESCRIPTIVE MEASURESMeeting 3
Course : I0262 – Statistics ProbabilityYear : 2011
Topic : Numerical descriptive Measures
• Measures of central Tendency• Variation and Shape• Numerical descriptive Measures for a
Population• Application in PH Stat Excel
Bina Nusantara University 2
Learning Outcome
• Identify basic statistics (data, sample, population, symbolism, and definition)
• Interpret the result of identify and the calculation
Bina Nusantara University 3
Bina Nusantara
DEFINITION
The central tendency is the extent to which all the data values group around a typical or central value.
The variation is the amount of dispersion, or scattering, of values
The shape is the pattern of the distribution of values from the lowest value to the highest value.
Bina Nusantara
3.1 MEASURES OF CENTRAL TENDENCY
• Mean• Median• Mode• Quartiles
Bina Nusantara
3.1 MEASURES OF CENTRAL TENDENCY
A. Mean Population
Sample
n = number of sample size xi = observation
N = number of population size
N
xxx
N
x NN
i
i
...21
1
n
xxx
n
xx n
n
i
i
...21
1
Bina Nusantara
3.1 MEASURES OF CENTRAL TENDENCY
Example : cost of a fast-food hamburger meal
City HamburgerTokyo 5.99London 7.62New York 5.75Sydney 4.45Chicago 4.99San Francisco 5.29Boston 4.39Atlanta 3.70Toronto 4.62Rio de Janeiro 2.99
98,4
10
99.2...62.799.5
10
10
1
i
ixx
Bina Nusantara
3.1 MEASURES OF CENTRAL TENDENCY
B. Median Odd
Even
n = number of sample sizex= observations that have been sequenced
2/)1( nxmedian
)(2
112/2/ nn xxmedian
Bina Nusantara
3.1 MEASURES OF CENTRAL TENDENCY
Example : cost of a fast-food hamburger meal
City HamburgerTokyo 5.99London 7.62New York 5.75Sydney 4.45Chicago 4.99San Francisco 5.29Boston 4.39Atlanta 3.70Toronto 4.62Rio de Janeiro 2.99
81.4
)99.462.4(2
1
)(2
1
)(2
1
65
12/102/10
xx
xxmediani Hamburger
1 2.992 3.703 4.394 4.455 4.626 4.997 5.298 5.759 5.99
10 7.62
Data sequence
Bina Nusantara
3.1 MEASURES OF CENTRAL TENDENCY
C. Mode/Modus Is the value that occurs most often
10
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Mode = 9
0 1 2 3 4 5 6
No Mode
Bina Nusantara
3.1 MEASURES OF CENTRAL TENDENCY
D. QuartilesQuartiles split the ranked data into 4 segments with an equal number of values per segment.
The first quartile, Q1, is the value for which 25% of the observations are smaller and 75% are larger
Q2 is the same as the median (50% are smaller, 50% are larger) Only 25% of the values are greater than the third quartile
25% 25% 25% 25%
1Q 2Q 3Q
Bina Nusantara
3.1 MEASURES OF CENTRAL TENDENCY
D. Quartiles
• First quartile position: Q1 = (n+1)/4 ranked value
• Second quartile position: Q2 = median
• Third quartile position: Q3 = 3(n+1)/4 ranked value
where n is the number of observed values
Bina Nusantara
3.1 MEASURES OF CENTRAL TENDENCY
Example : cost of a fast-food hamburger meal
CityHamburg
erTokyo 5.99London 7.62New York 5.75Sydney 4.45Chicago 4.99San Francisco 5.29Boston 4.39Atlanta 3.70Toronto 4.62Rio de Janeiro 2.99
i Hamburger1 2.992 3.703 4.394 4.455 4.626 4.997 5.298 5.759 5.99
10 7.62
Data sequence
• Q1 = (n+1)/4 = 2.75 ~ 3
4.39• Q3 = 3(n+1)/4 = 8.25 ~ 9
5.99
jika Q bukan bernilai integer maka dinaikkan
jika Q bernilai integer maka p adalah rata-rata nilai di posisi Q dan Q+1
Bina Nusantara
3.2 MEASURES OF VARIATION
Variation measures the spread, or dispersion, of values in a data set. Range Interquartile Range Variance Standard Deviation Coefficient of Variation
Bina Nusantara
3.2 MEASURES OF VARIATION
A. Range Simplest measure of variation Difference between the largest and the smallest values:
Range = Xlargest – Xsmallest
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Range = 13 - 1 = 12
Bina Nusantara
3.2 MEASURES OF VARIATIONA. RangeDisantavages of range Ignores the way in which data are distributed
Sensitive to outliers
7 8 9 10 11 12
Range = 12 - 7 = 5
7 8 9 10 11 12
Range = 12 - 7 = 5
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120
Range = 5 - 1 = 4
Range = 120 - 1 = 119
Bina Nusantara
3.2 MEASURES OF VARIATION
B. Interquartile Range Problems caused by outliers can be eliminated by using the
interquartile range. The IQR can eliminate some high and low values and calculate
the range from the remaining values.
Interquartile range = Q3 – Q1
Bina Nusantara
3.2 MEASURES OF VARIATIONC. Variance The variance is the average (approximately) of squared
deviations of values from the mean.
Population
Sample
= meann = sample sizeXi = ith value of the variable X
N
i
i
N
x
1
22
n
i
i
n
xxs
1
22
1x
Bina Nusantara
3.2 MEASURES OF VARIATION
D. Standard Deviation The standard deviation is the root of variance
Population
Sample
N
i
i
N
x
1
22
n
i
i
n
xxss
1
22
1
Bina Nusantara
3.2 MEASURES OF VARIATION
D. Standard DeviationExample : Computing variance and standard deviation of cost of
a fast-food hamburger meal x 2.99 -1.989 3.9561213.70 -1.279 1.6358414.39 -0.589 0.3469214.45 -0.529 0.2798414.62 -0.359 0.1288814.99 0.011 0.0001215.29 0.311 0.0967215.75 0.771 0.5944415.99 1.011 1.0221217.62 2.641 6.974881
1.671
2xxi xxi
10
98.4
n
x 671.1
11
22
n
i
i
n
xxs
293.1671.1 s
Bina Nusantara
3.2 MEASURES OF VARIATION
E. Coefficient of Variance The coefficient of variation is the standard deviation divided by
the mean, multiplied by 100. It is always expressed as a percentage. (%) It shows variation relative to mean. The CV can be used to compare two or more sets of data
measured in different units.
100%X
SCV
Bina Nusantara
3.2 MEASURES OF VARIATIONComparing mean, variance, standard deviation, and CV
Mean = 15.5 s = 3.338 Cv = 0.21511 12 13 14 15 16 17 18 19 20 21
11 12 13 14 15 16 17 18 19 20 21
Data B
Data A
Mean = 15.5 s = 0.9258Cv = 0,060
11 12 13 14 15 16 17 18 19 20 21
Mean = 15.5 s = 4.57Cv = 0,295
Data C
Bina Nusantara
3.3 APPLICATION IN PH STAT EXCELMenu : Data> Data Analysis
Bina Nusantara
3.3 APPLICATION IN PH STAT EXCELInput Range : Column Hamburger DataClick Summary Statistics
Bina Nusantara
3.3 APPLICATION IN PH STAT EXCEL
Output
Bina Nusantara
EXERCISES
1. Calculate the mean, median, quartile, and variance from this data
Motion pictureOpening
Gross SalesTotal Gross
SalesNumber of Theaters
Weeks in Top 60
Coach Carter 29.17 67.25 2574 16Ladies in Lavender 0.15 6.65 119 22Batman Begins 48.75 205.28 3858 18Unleashed 10.9 24.47 1962 8Pretty P 0.06 0.23 24 4Fever P 12.4 42.01 3275 14Harry P & the Goblet of Fire 102.69 287.18 3858 13Monster in Law 23.11 82.89 3424 16White Noise 24.11 55.85 2279 7Mr. & Mrs. Smith 50.34 186.22 3451 21
Bina Nusantara
THANK YOU