1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at...

30
1 Sampling Distribution of the Mean

description

3 The Sample Mean is a Random Variable! Imagine selecting many different samples from one population. Sample 1 Sample 2 Sample 3 Sample j Sample k Unless there’s a miraculous coincidence, we would expect each sample to have a (slightly) different mean

Transcript of 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at...

Page 1: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

1

Sampling Distribution of the Mean

Page 2: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

2

The Sampling Distribution of the Sample Mean

Let’s start at a familiar point: the sample mean, which we know as “x-bar”

Any random sample we might select from a population will, in general, have a different sample mean, right?

In other words, the sample mean varies from sample to sample...

Page 3: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

3

The Sample Mean is a Random Variable!

Imagine selecting many different samples from one population.

Sample 1

Sample 2Sample 3

Sample jSample k

•Unless there’s a miraculous coincidence,

we would expect each sample to have a

(slightly) different mean

Page 4: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

4

The Sample Mean is a Random Variable

So, x-bar varies from sample to sample.

Since the samples are random, x-bar varies due to a random process.

That makes x-bar a random variable!

Page 5: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

5

The Sample Mean is a Random Variable

So, to sum up so far: We can think of a descriptive statistic, like the

sample mean, as a random variable, since its value depends on the luck of the draw in random sampling.

Even though one sample has just one sample mean, this one mean is one of the many possible results of random sampling.

Page 6: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

6

So what is a “Sampling Distribution?”

The distribution of an “Ordinary Random Variable” has

• Shape• Center• Spread

The distribution of a descriptive statistic (like x-bar) has

• Shape• Center• Spread

Any random variable has a probability distribution or (if it’s continuous) a probability density function

Page 7: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

7

So what is a “Sampling Distribution?”

A sampling distribution is nothing more or less than a special case of a probability

distribution.

More specifically, if the random variable happens to be a sample statistic, we call its

distribution a Sampling Distribution

Page 8: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

8

Other Terminology

In a Sampling Distribution,

The shape is called the Sampling DistributionSampling Distribution (it’s often normal)

The center is still called the MeanMean

The spread is called the Standard ErrorStandard Error

The special terms are to keepyou from getting confused!?!

Page 9: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

9

Sampling Distribution of the Sample Mean: SHAPE

As we’ll soon see, if samples are large enough, the sampling distribution of the mean is approximately normal.

Like any normal distribution, it is defined by its mean and standard deviation, or standard errorstandard error

Page 10: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

10

Sampling Distribution of Two Dice

• A sampling distribution is created by looking at

• all samples of size n=2 (i.e. two dice) and their means…

• While there are 36 possible samples of size 2, there are only 11 values for x-bar, and some (e.g. x-bar = 3.5) occur more frequently than others (e.g. x-bar = 1).

Page 11: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

11

• The sampling distribution of x-bar is shown below:

Sampling Distribution of Two Dice…

P( ) 6/36

5/36

4/36

3/36

2/36

1/36

P(

)

11

Page 12: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

12

Compare…

• Compare the distribution of X…

• …with the sampling distribution of x-bar.

• As well, note that:

Page 13: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

13

Sampling Distribution of the Sample Mean: CENTER

The mean of the sampling distribution of the sample mean is simply the population mean,

In other words, the mean of the “average” sample should be the same

as the population mean, μ

Page 14: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

14

Sampling Distribution of the Sample Mean: SPREAD

The dispersion of the sampling distribution (in most cases) is given by this formula:

We call this the STANDARD ERROR OF

Page 15: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

15

Central Limit Theorem…

The sampling distribution of the mean of a random sample drawn from any population is approximately normal for a sufficiently large sample size.

The larger the sample size, the more closely the sampling distribution of x-bar will resemble a normal distribution.

Page 16: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

16

Central Limit Theorem…

If the population is normal, then x-bar is normally distributed for all values of n (n = the number of samples that x-bar is the sample of their means).

If the population is non-normal, then x-bar is approximately normal only for larger values of n.

In many practical situations, a sample size of 30 may be sufficiently large to allow us to use the normal distribution as an approximation for the sampling distribution of x-bar.

Page 17: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

Central Limit TheoremCentral Limit Theorem

Page 18: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

18

Example 1

The foreman of a bottling plant has observed that the amount of soda in each “32-ounce” bottle is actually a normally distributed random variable, with a mean of 32.2 ounces and a standard deviation of 0.3 ounce.

If a customer buys one bottle, what is the probability that the bottle will contain more than 32 ounces?

Page 19: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

19

Example 1

We want to find P(X > 32), where x-bar is normally distributed and = 32.2 and = 0.3μ σ

“there is about a 75% chance that a single bottle of soda contains more than 32oz.”

Page 20: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

20

Example 2

The foreman of a bottling plant has observed that the amount of soda in each “32-ounce” bottle is actually a normally distributed random variable, with a mean of 32.2 ounces and a standard deviation of 0.3 ounce.

If a customer buys a carton of four bottles, what is the probability that the mean amount of the four bottles will be greater than 32 ounces?

Page 21: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

21

We want to find P(x-bar > 32), where X is normally distributed

with = 32.2 and = 0.3μ σThings we know:

1) X is normally distributed, therefore so will x-bar.

2) = 32.2 oz.

3)

Example 2

Page 22: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

22

Example 2

If a customer buys a carton of four bottles, what is the probability that the mean amount of the four bottles will be greater than 32 ounces?

“There is about a 91% chance the mean of the four bottles will exceed 32oz.”

Page 23: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

23

Graphically Speaking…

what is the probability that one bottle will contain more than 32 ounces?

what is the probability that the mean of four bottles will exceed 32 oz?

mean=32.2

Page 24: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

24

Example 3

The dean of the School of Business claims that the average salary of the school’s graduates one year after graduation is $800 per week with a standard deviation of $100.

A second-year student would like to check whether the claim about the mean is correct. He does a survey of 25 people who graduated one year ago and determines their weekly salary. He discovers the sample mean to be $750.

To interpret his finding he needs to calculate the probability that a sample of 25 graduates would have a mean of $750 or less when the population mean is $800 and the standard deviation is $100.

Page 25: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

25

• We want to compute

• Although X is likely skewed it is likely that x-bar is normally distributed. The mean of x-bar is

• The standard deviation is

Example 3

Page 26: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

26

Example 3

The probability of observing a sample mean as low as $750 when the population mean is $800 is extremely small. Because the event is quite unlikely, we would conclude that the dean’s claim is not justified.

Page 27: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

Example 4Example 4

Cola bottles are filled so that Cola bottles are filled so that contents X have a normal distribution contents X have a normal distribution with mean = 298 ml and standard with mean = 298 ml and standard deviation sigma = 3 ml.deviation sigma = 3 ml.

What proportion of bottles have less What proportion of bottles have less than 295 ml?than 295 ml?

Ans: P(X < 295) = P(Z < –1) = 0.1586Ans: P(X < 295) = P(Z < –1) = 0.1586

Page 28: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

Example 4Example 4What is the probability of the average of What is the probability of the average of a six pack of bottles being less than 295 a six pack of bottles being less than 295 ml?ml?Ans: P(x-bar < 295) = Ans: P(x-bar < 295) =

Page 29: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

Example 4Example 4

P(X < 295) = .15, but P(x-bar < 295) = 0.007P(X < 295) = .15, but P(x-bar < 295) = 0.007

As the sample size increases, the variation in the As the sample size increases, the variation in the distribution decreases so that a value like 295 ml is very distribution decreases so that a value like 295 ml is very difficult and rare to occur in an average of a six pack or difficult and rare to occur in an average of a six pack or more of bottles, but could quite easily occur in a single more of bottles, but could quite easily occur in a single bottle.bottle.

Big point: averages have less variation than individual Big point: averages have less variation than individual observations.observations.

Page 30: 1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.

Assignment

p. 247 4.43 - 4.45p. 247 4.43 - 4.45p. 248-250 4.46, 4.48 - 4.50, 4.53p. 248-250 4.46, 4.48 - 4.50, 4.53