A113 6P P14 Salary
-
Upload
tan-jia-mei -
Category
Documents
-
view
221 -
download
0
Transcript of A113 6P P14 Salary
-
8/3/2019 A113 6P P14 Salary
1/14
Copyright 2009
A113 Mathematics
Problem 14: Salary
6th
Presentation
-
8/3/2019 A113 6P P14 Salary
2/14
Recap
Mean is the average of a set of numerical values.
Standard deviation
A measure of dispersion of data (how spread outyour data is) from the mean
A smaller standard deviation value will indicate thatmost of the data points are nearer to the mean.
Variance is the square of standard deviation.
-
8/3/2019 A113 6P P14 Salary
3/14
Frequency table
The frequency table shows all possible outcomes of an eventand the number of times each outcome occurs.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Salary Range ($) Frequency Relative Frequency
1000
-
8/3/2019 A113 6P P14 Salary
4/14
0
0.05
0.1
0.15
0.2
0.25
Relative frequency histogram
The distribution of the individual entries is as shown below:
Salary Range ($)
R
elativeFrequency
Mean = 1494
Standard deviation = 289.6
Variance = 83849
Distribution of the individual entries of salary
The distribution showsthat all outcomes have
about the same chanceof occurrence for ourdata.
The mean of thedistribution isapproximately $1494.
The variance andstandard deviation are83849 and 289.6respectively.
-
8/3/2019 A113 6P P14 Salary
5/14
Sample size
Sample size is the number of observations thatconstitute the sample.
For example, when we are calculating the averageof 2 entries, the number of observations is 2.Hence, the sample size will be 2.
Likewise, if we are calculating the average of 30entries, then the sample size will be 30.
-
8/3/2019 A113 6P P14 Salary
6/14
0
0.05
0.1
0.15
0.2
0.25
Distribution of the averages of two entries of salary The relative frequencyis the lowest at the twotails of the distribution,and it increases as wemove towards the mean.
The mean of thedistribution isapproximately $1494,i.e. the mean of thedistribution of the
individual entries.
The variance isreduced by a factor of 2,which is the number ofentries in each sample.
Mean = 1501
Number ofentries in a
sample
Standard deviation = 202.3
Variance = 40941( 83849 / 2 )
The distribution of the average of 2 entries is as shown below:
Average of 2 entries
RelativeFrequency
Salary Range ($)
Variance ofindividual
entriesdistribution
-
8/3/2019 A113 6P P14 Salary
7/14
0
0.05
0.1
0.15
0.2
0.25
Distribution of the averages of ten entries of salary
The distribution of the average of 10 entries is as shown below:
The spread of the
distribution is furtherreduced with the highestfrequency occurring atthe mean.
The mean of thedistribution isapproximately $1494.
The variance is reducedproportionally by thenumber of entries in eachsample.
Standard deviation = 94.0
Variance = 8834
( 83849 / 10 )
Average of 10 entries
RelativeFrequency
Salary Range ($)
Mean = 1499
-
8/3/2019 A113 6P P14 Salary
8/14
0
0.05
0.1
0.15
0.2
0.25
Distribution of the averages of thirty entries of salary
The probability ofobtaining an averagethat is approximately
$1494 increases withthe number of entriesin each sample.
The mean isapproximately $1494
and the spread of thedistribution becomeseven smaller.
Mean = 1500
Standard deviation = 52.9
Variance = 2796
( 83849 / 30 )
Average of 30 entries
The distribution of the average of 30 entries is as shown below:
RelativeFrequency
Salary Range ($)
-
8/3/2019 A113 6P P14 Salary
9/14
Central Limit Theorem
The Central Limit Theorem states that if the samplesize is large ( 30),the shape of the histogram of thesample means will resemble a bell-shaped curve,also known as the Normal distribution curve.
NormalDistribution Curve
E.g.:
RelativeFrequency
Salary Range ($)
$1501 $1499 $1500
-
8/3/2019 A113 6P P14 Salary
10/14
Normal distribution
The Normal curve is symmetrical about its mean. It is described by its mean and its standard deviation
(or variance).
The area under the Normal distribution curve represents theprobability of an event occurring where the total area is 1.
Mean
-
8/3/2019 A113 6P P14 Salary
11/14
Normal distribution
The height of a population is normally distributed. It can bemodeled by a Normal distribution curve.
For the example above, there is a 10% chance for a randomlychosen individual to have a height exceeding 172 cm.
Mean172 cm163 cm
90%
10%
-
8/3/2019 A113 6P P14 Salary
12/14
Estimating average
From the discussions made so far, the distribution of theaverage of a reasonably large sample size approaches aNormal distribution and the spread of the data will be verynarrow.
As the sample size gets larger, the average obtained from thissample is less likely to deviate too far from the actual averageof the population.
Thus, to estimate the average of a population, we can use theaverage computed from a randomly selected large sample( 30) of the population.
In conclusion, adopting Peters suggestion is an efficient and
effective way of solving Serenes problem.
-
8/3/2019 A113 6P P14 Salary
13/14
Learning points
Understand the relationship between frequency andrelative frequency
Understand the relationship between relative frequencyand probability
Understand that for large sample size ( 30), the Normal
distribution curve is a fairly accurate representation ofthe distribution of the sample means
Understand that the variance of the distribution ofsample means is inversely proportional to the samplesize
-
8/3/2019 A113 6P P14 Salary
14/14
Discussion
Suppose the distribution of the individual entries is as follows:
Suggest how the distribution of average might look like whensample size is large (30).
0
0.05
0.1
0.15
0.2
0.25
Mean = 1202.5Standard deviation = 145.7
Variance = 21230
Distribution of the individual entries of salary
Salary Range ($)
RelativeFrequency