Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf ·...

20
Thinking About Chance & Probability Models Chapters 17 and 18 November 14, 2012 Chance Processes Randomness and Probability Probability Models Probability Rules Using Probability Models for Inference How Probability Works

Transcript of Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf ·...

Page 1: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

Thinking About Chance &

Probability ModelsChapters 17 and 18

November 14, 2012

Chance Processes

Randomness and Probability

Probability Models

Probability Rules

Using Probability Models for Inference

How Probability Works

Page 2: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

1.0 Chance Processes

People talk loosely about chance all the time.

For scientific purposes, chance has a precise meaning.

Chance behavior is unpredictable in the shortrun but has a regular and predictable pattern inthe long run.

This definition pre-supposes that the process can berepeated over and over again, independently and underthe same conditions.

Many games fall into this category.

Page 3: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

1.1 Heads or tails?

One simple game of chance involves betting on the tossof a coin.

The result cannot be predicted in advance.

But there is still a regular pattern in the results:proportion of heads seen in the long run.

The following figure shows the results of tossing a coin1,000 times: HTTHHHT . . .

Page 4: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

1.2 Random Sampling

Choose a S.R.S. of size n from a population.

Suppose the unknown parameter is p.

Then p̂ converges to p for large n. (Law of largenumbers)

Then the distribution of the sample estimate p̂converges to a normal density curve in the long run.

p

Page 5: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

2.0 Randomness and Probability

A variable is called a random variable if its values areuncertain but there is some long run stability in theprocess underlying the data.

protagonist helper hinderer

Variable?

Observational units?

Page 6: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

2.1 Example

Variable X: “Whether an infant chose the helper orhinderer”.

Each infant is like one repetition of the experiment,presumably under identical conditions.

If we assume that each infant is making the planned useof chance in their selection, then X is a random variable.This is called choosing at random.

Then the proportion of infants who choose the helper(say) will stabilize in the long run.

This long run stability point is then called theprobability of choosing the helper.

Page 7: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

3.0 Probability Models

A probability model for a random variable describesthe possible outcomes and says how to assignprobabilities to them.

Random variable X: Kind of toy selected.

Table: Probability Model for X

X ProbabilityHelper p

Hinderer (1-p)

Page 8: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

3.1 Probability Model for S.R.S

Suppose we take a S.R.S. of size n from a population.Let p be the unknown proportion of individuals from thepopulation who share a certain characteristic.

Let p̂ be the proportion of individuals in our sample whoshare this characteristic.

Then the Normal density curve assigns probabilities to thepossible values of p̂ upon repeated sampling so long as nis large. The mean of this density is p. The standard

deviation is√

p(1−p)n

.

Page 9: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

4.0 Some Probability Rules

A Any probability is a number between 0 and 1.

For any outcome A, the probability P(A) satisfies0 ≤ P(A) ≤ 1.

B All possible outcomes together must have probability 1.

C The probability that an outcome does not occur is 1minus the probability that it does occur.

D The probability that one or the other outcome occursis the sum of their individual probabilities.

Page 10: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

4.1 Example

Ex 18.8: Select a first year college student at random and askwhat their academic rank was in high school. Here is theprobability model for the random variable X = academic rankof student.

Rank Top20%

Second20%

Third20%

Fourth20%

Lowest20%

Prob. 0.44 0.26 0.23 0.06 0.01

1. Check that this is valid probability model.

2. What is the probability that a randomly chosen first yearcollege student was not in the top 20%?

3. What is the probability that a randomly chosen first yearcollege student was in the top 40%?

Page 11: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

4.2 More Probability Rules

Multiplication Rule: The chance that two things willboth happen equals the chance that the first will happen,multiplied by the chance that the second will happengiven the first has happened.

Two things are independent if the chancesfor the second given the first are the same, nomatter how the first turns out. Otherwise theyare dependent.

Addition Rule: To find the chance that at least one oftwo things will happen, check to see if they are mutuallyexclusive. If they are, add the chances.

Two things are mutually exclusive whenthe occurrence of one prevents the occurrenceof the other; one excludes the other.

Page 12: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

5.0 Lotsa Distributions

(from Prof. June Morita)

= sampling unit

POPULATION SAMPLEdesired information approximation/estimate

parameter

numerical fact re. population

statistic

estimates the parameter

We want to estimate the proportion of individuals in apopulation with a certain characteristic – call it p.Draw an S.R.S. of size n individuals from the population.Compute the sample proportion p̂. Use it to estimate p.

Page 13: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

5.1 Population Distribution

DefinitionThe population distribution of a variable is thedistribution of the values of the variable among all individualsin the population.

Example: Elway Research conducted a poll of 408 randomlyselected Washingtonians and found that 64% (or 261) of theirsample said they would support a temporary sales tax hike.Write the population distribution for the variable: would yousupport a sales tax hike? (yes/no).

(All possible values for the variable and % of time it occurs inthe population.)

Page 14: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

5.2 Distribution of Sample Values

DefinitionThe distribution of the values in the sample showsall possible values seen for the variable in our sample and howoften those values occured.

Example: Elway Research conducted a poll of 408 randomlyselected Washingtonians and found that 64% (or 261) of theirsample said they would support a temporary sales tax hike.Write the distribution of the sample values for the variable:would you support a sales tax hike? (yes/no).

Page 15: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

5.3 Sampling Distribution of Statistic

DefinitionThe sampling distribution of a statistic shows thedistribution of values of the statistic in all possible samples ofsize n drawn from the population.

Example: Elway Research conducted a poll of 408 randomlyselected Washingtonians and found that 64% (or 261) of theirsample said they would support a temporary sales tax hike.Write the sampling distribution of their statistic p̂?

For large sample sizes, p̂ tends to be close to theparameter p. (law of large numbers)

And as the sample size increases, the samplingdistribution for p̂ looks more and more like a normaldistribution. The mean of this density is p. The standard

deviation is√

p(1−p)n

. (central limit theorem)

Page 16: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

5.4 Compare and Contrast

The population distribution describes the individualsthat make up the population.

versus

The distribution of the values in the sampledescribes the individuals that make up the sample;

versus

The sampling distribution describes how a statisticvaries from one sample to another, all drawn from the samepopulation.

Page 17: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

5.4 Example

You ask an S.R.S of 1,500 college students whether theyapplied for admission to any other college. Suppose 500 (33%)students in your sample admitted they had.

1. Population? Sample? Sample Size?

2. Variable?

3. Population distribution of variable?

4. Distribution of variable in sample?

5. What is the statistic (in words)? What is its’ value forthis one sample?

6. What is the sampling distribution of this statistic?

Page 18: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

6.0 Sampling Distribution As a

Probability Model

Sampling Distributions Are Probability Models.

What does this mean exactly? It meansI Imagine having a box full of tickets. Each ticket has a

value of p̂ on it.I How many there are of each p̂ will depend on the

sampling distribution.I We mix up these tickets well and draw one at random.I Then the probability of picking a ticket with a specific p̂

value will be determined by the area given to that valueof p̂ under the sampling distribution.

Check that the sampling distribution for p̂ from example5.4 is a valid probability model.

Page 19: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

6.1 Example

You ask an S.R.S. of 1,500 college students whether theyapplied for admission to any other college. Suppose, in fact,35% of all college students applied to colleges besides theones they are attending.

1 What is the sampling distribution for p̂?

2 What is the probability that if you select at random fromthis distribution, you will get a p̂ less than 0.33?

3 What is the probability that if you select at random fromthis distribution, you will get a p̂ less than 0.33 or greaterthan 0.35?

4 Give the range of the middle 95% of the p̂ values for thisdistribution. Interpret this range.

Page 20: Thinking About Chance & Probability Modelsfaculty.washington.edu/grover4/class13au12notes.pdf · 14/11/2012  · 4.2 More Probability Rules Multiplication Rule: The chance that two

7.0 How Probability Works

The probability of a head in tossing a fair coin is 0.5.

This means that as we make more tosses, theproportion of heads will eventually get close to 0.5.

This does not mean that the count of heads will getclose to half the number of tosses. Why?

Number of Proportion of Number ofof Tosses of heads heads

100 0.51 511,000 0.507 507

10,000 0.5031 5031100,000 0.50074 50,704