Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence...

187
Last Time • Binomial Distribution – Excel Computation • Political Polls – Strength of evidence • Hypothesis Testing – Yes – No Questions

Transcript of Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence...

Page 1: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Last Time

• Binomial Distribution– Excel Computation

• Political Polls– Strength of evidence

• Hypothesis Testing– Yes – No Questions

Page 2: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Administrative Matter

• Midterm I, coming Tuesday, Feb. 24

(will say more later)

Page 3: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Reading In Textbook

Approximate Reading for Today’s Material:

Pages 488-491, 317-318

Approximate Reading for Next Class:

Pages 261-262, 9-14, 270-276, 30-34

Page 4: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Haircut?

Why?

Website:

http://www.time.com/time/health/article/0,8599,1733719,00.html

Page 5: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Haircut?

Page 6: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Example: Suppose surgery cures (a certain

type of) cancer 60% of time

Q: is eating apricot pits a more effective cure?

Page 7: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

E.g. Pits vs. Surgery

Let p be “cure rate” of pits

(i.e. proportion of people cured)

Page 8: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

E.g. Pits vs. Surgery

Let p be “cure rate” of pits

(H0 & H1? New method needs to

“prove it’s worth”

so put burden of proof on it)

Page 9: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

E.g. Pits vs. Surgery

Let p be “cure rate” of pits

H0: p < 0.6 vs. H1: p ≥ 0.6

Recall cure rate of surgery

(competing treatment)

Page 10: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

E.g. Pits vs. Surgery

Let p be “cure rate” of pits

H0: p < 0.6 vs. H1: p ≥ 0.6

(OK to be sure of “at least as good”,

since pits nicer than surgery)

Page 11: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Page 12: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose observe X = 11, out of 15 were

cured by pits

Page 13: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose observe X = 11, out of 15 were

cured by pits

I.e.: “best guess about p” is:

733.0ˆ1511 n

Xp

Page 14: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose observe X = 11, out of 15 were

cured by pits

I.e.: “best guess about p” is:

6.0733.0ˆ1511 n

Xp

Page 15: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose observe X = 11, out of 15 were

cured by pits

I.e.: “best guess about p” is:

6.0733.0ˆ1511 n

Xp

Looks Better?

Page 16: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose observe X = 11, out of 15 were

cured by pits

I.e.: “best guess about p” is:

But is it conclusive?

6.0733.0ˆ1511 n

Xp

Page 17: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose observe X = 11, out of 15 were

cured by pits

I.e.: “best guess about p” is:

But is it conclusive?

6.0733.0ˆ1511 n

Xp

Or just due to sampling variation?

Page 18: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach: Define

“p-value” =

Page 19: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach: Define

“p-value” = “observed significance level”

Page 20: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach: Define

“p-value” = “observed significance level”

= “significance probability”

Page 21: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach: Define

“p-value” = “observed significance level”

= “significance probability”

= P[seeing something as

unusual as 11 | H0 is

true]

Page 22: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = “observed significance level”

= P[seeing something as

unusual as 11 | H0 is

true]

Page 23: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = “observed significance level”

= P[seeing something as

unusual as 11 | H0 is

true]

Note: for

Page 24: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = “observed significance level”

= P[seeing something as

unusual as 11 | H0 is

true]

Note: for could use “X/n = 0.733”

Page 25: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = “observed significance level”

= P[seeing something as

unusual as 11 | H0 is

true]

Note: for could use “X/n = 0.733”,

but this depends too much on n

Page 26: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = “observed significance level”

= P[seeing something as

unusual as 11 | H0 is true]

Note: for could use “X/n = 0.733”,

but this depends too much on n

(look at example illustrating this)

Page 27: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Class Example 4

For X ~ Bi(n,0.6):n P(X/n = 0.6) P(X/n >= 0.6)

5 0.346 0.31710 0.251 0.36730 0.147 0.422

100 0.081 0.457300 0.047 0.475

1000 0.026 0.4863000 0.015 0.492

10000 0.008 0.496

Page 28: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Class Example 4

For X ~ Bi(n,0.6):

Computed using

Excel:http://www.stat-or.unc.edu/webspace/courses/marron/UNCstor155-2009/ClassNotes/Stor155Eg4.xls

n P(X/n = 0.6) P(X/n >= 0.6)

5 0.346 0.31710 0.251 0.36730 0.147 0.422

100 0.081 0.457300 0.047 0.475

1000 0.026 0.4863000 0.015 0.492

10000 0.008 0.496

Page 29: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Class Example 4

For X ~ Bi(n,0.6):

Note: these go to 0,

even at “most likely

value”

n P(X/n = 0.6) P(X/n >= 0.6)

5 0.346 0.31710 0.251 0.36730 0.147 0.422

100 0.081 0.457300 0.047 0.475

1000 0.026 0.4863000 0.015 0.492

10000 0.008 0.496

Page 30: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Class Example 4

For X ~ Bi(n,0.6):

Note: these go to 0,

even at “most likely

value”

So “small” is

not conclusive

n P(X/n = 0.6) P(X/n >= 0.6)

5 0.346 0.31710 0.251 0.36730 0.147 0.422

100 0.081 0.457300 0.047 0.475

1000 0.026 0.4863000 0.015 0.492

10000 0.008 0.496

Page 31: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Class Example 4

For X ~ Bi(n,0.6):

But for these

“small”

is conclusive

n P(X/n = 0.6) P(X/n >= 0.6)

5 0.346 0.31710 0.251 0.36730 0.147 0.422

100 0.081 0.457300 0.047 0.475

1000 0.026 0.4863000 0.015 0.492

10000 0.008 0.496

Page 32: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Class Example 4

For X ~ Bi(n,0.6):

But for these

“small”

is conclusive

(so use range,

not value)

n P(X/n = 0.6) P(X/n >= 0.6)

5 0.346 0.31710 0.251 0.36730 0.147 0.422

100 0.081 0.457300 0.047 0.475

1000 0.026 0.4863000 0.015 0.492

10000 0.008 0.496

Page 33: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = “observed significance level”

= P[seeing 11 or more

unusual | H0 is true]

Page 34: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = “observed significance level”

= P[seeing 11 or more

unusual | H0 is true]

So use:

= P[X ≥ 11 | H0 is true]

Page 35: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | H0 is true]

Page 36: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | H0 is true]

What to use here?

Page 37: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | H0 is true]

What to use here?

Recall: H0: p < 0.6

Page 38: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | H0 is true]

What to use here?

Recall: H0: p < 0.6

How does P[X ≥ 11 | p] depend on p?

Page 39: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

How does P[X ≥ 11 | p] depend on p?

Page 40: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

How does P[X ≥ 11 | p] depend on p?

Calculated in Class EG 4b:

http://www.stat-or.unc.edu/webspace/courses/marron/UNCstor155-2009/ClassNotes/Stor155Eg4.xls

p P(X >= 11|p)

0.2 0.000

0.3 0.001

0.4 0.009

0.5 0.059

0.6 0.217

0.7 0.515

0.8 0.836

Page 41: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

How does P[X ≥ 11 | p] depend on p?

Bigger assumed p

goes with

Bigger Probability

i.e. less conclusive

p P(X >= 11|p)

0.2 0.000

0.3 0.001

0.4 0.009

0.5 0.059

0.6 0.217

0.7 0.515

0.8 0.836

Page 42: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | H0 is true] =

= P[X ≥ 11 | p < 0.6]

Page 43: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | H0 is true] =

= P[X ≥ 11 | p < 0.6]

So, to be “sure” of conclusion, use largest

available value of P[X ≥ 11 | p]

Page 44: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | H0 is true] =

= P[X ≥ 11 | p < 0.6]

So, to be “sure” of conclusion, use largest

available value of P[X ≥ 11 | p]

Thus, define:

“p-value” = P[X ≥ 11 | p = 0.6]

Page 45: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | H0 is true] =

= P[X ≥ 11 | p < 0.6]

So, to be “sure” of conclusion, use largest

available value of P[X ≥ 11 | p]

Thus, define:

“p-value” = P[X ≥ 11 | p = 0.6]

(since “=” gives safest result)

Page 46: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6]

Page 47: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6]

Generally: use

= P[seeing something as

unusual as X = 11 | H0 is

true]

Page 48: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6]

Generally: use

= P[seeing something as

unusual as X = 11 | H0 is

true]

Here use boundary between H0 & H1

Page 49: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6]

Generally: use

= P[seeing something as

unusual as X = 11 | H0 is true]

Here use boundary between H0 & H1

(above e.g. p = 0.6)

Page 50: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6]

Now calculate numerical value

Page 51: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6]

Now calculate numerical value

(already done above,

Class EG 4)

Page 52: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6] = 0.217

Now calculate numerical value

(already done above,

Class EG 4)

Page 53: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6] = 0.217

Now calculate numerical value

(already done above,

Class EG 4)

How to interpret?

Page 54: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6] = 0.217

Intuition: p-value reflects chance of error

when H0 is rejected

Page 55: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6] = 0.217

Intuition: p-value reflects chance of error

when H0 is rejected

(i.e. when conclusion is made)

Page 56: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6] = 0.217

Intuition: p-value reflects chance of error

when H0 is rejected

(i.e. when conclusion is made)

(based on available evidence)

Page 57: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6] = 0.217

Intuition: p-value reflects chance of error

when H0 is rejected

(i.e. when conclusion is made)

(based on available evidence)

When p-value is small, it is safe to make a

firm conclusion

Page 58: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

For small p-value, safe to make firm conclusion

Page 59: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

For small p-value, safe to make firm conclusion

How small?

Page 60: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

For small p-value, safe to make firm conclusion

How small?

Approach 1: Traditional (& legal) cutoff

Page 61: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

For small p-value, safe to make firm conclusion

How small?

Approach 1: Traditional (& legal) cutoff

Called here “Yes-No”:

Page 62: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

For small p-value, safe to make firm conclusion

How small?

Approach 1: Traditional (& legal) cutoff

Called here “Yes-No”:

Reject H0 when p-value < 0.05

Page 63: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

For small p-value, safe to make firm conclusion

How small?

Approach 1: Traditional (& legal) cutoff

Called here “Yes-No”:

Reject H0 when p-value < 0.05

(just an agreed upon value,

but very widely used)

Page 64: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

For small p-value, safe to make firm conclusion

How small?

Approach 1: Traditional (& legal) cutoff

Called here “Yes-No”:

Reject H0 when p-value < 0.05

(but sometimes want different values,

e.g. your airplane is safe to fly)

Page 65: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach 1: “Yes-No”

Reject H0 when p-value < 0.05

Page 66: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach 1: “Yes-No”

Reject H0 when p-value < 0.05

Terminology: say results are “statistically

significant”, when this happens

Page 67: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach 1: “Yes-No”

Reject H0 when p-value < 0.05

Terminology: say results are “statistically

significant”, when this happens

Sometimes specify a value α

Greek letter “alpha”

Page 68: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach 1: “Yes-No”

Reject H0 when p-value < 0.05

Terminology: say results are “statistically

significant”, when this happens

Sometimes specify a value α

as the cutoff (different from 0.05)

Page 69: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach 2: “Gray Level”

Idea: allow “shades of conclusion”

Page 70: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach 2: “Gray Level”

Idea: allow “shades of conclusion”

e.g. Do p-val = 0.049 and p-val = 0.051

represent very different levels of evidence?

Page 71: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach 2: “Gray Level”

Idea: allow “shades of conclusion”

Use words describing strength of evidence:

0.1 < p-val: no evidence

0.01 < p-val < 0.1 marginal evidence

p-val < 0.01 very strong

evidence

Page 72: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach 2: “Gray Level”

Use words describing strength of evidence:

0.1 < p-val: no evidence

0.01 < p-val < 0.1 marginal evidence

p-val < 0.01 very strong

evidence

Page 73: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach 2: “Gray Level”

Use words describing strength of evidence:

0.1 < p-val: no evidence

0.01 < p-val < 0.1 marginal evidence

p-val < 0.01 very strong

evidence

stronger when closer to 0.01

Page 74: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Approach 2: “Gray Level”

Use words describing strength of evidence:

0.1 < p-val: no evidence

0.01 < p-val < 0.1 marginal evidence

p-val < 0.01 very strong evidence

stronger when closer to 0.01

weaker when closer to 0.1

Page 75: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6] = 0.217

Bottom Line:

Yes-No: can not reject H0, since

0.217 > 0.05

i.e. no firm evidence pits better than

surgery

Gray level: not much indicated

Page 76: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6] = 0.217

No firm evidence pits better than

surgery

Gray level: not much indicated

Page 77: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6] = 0.217

No firm evidence pits better than

surgery

Gray level: not much indicated

Practical Issue: since 73% = observed rate for

pits > 60% (surgery),

Page 78: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6] = 0.217

No firm evidence pits better than

surgery

Gray level: not much indicated

Practical Issue: since 73% = observed rate for

pits > 60% (surgery), may want to gather

more data

Page 79: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

“p-value” = P[X ≥ 11 | p = 6] = 0.217

No firm evidence pits better than

surgery

Gray level: not much indicated

Practical Issue: since 73% = observed rate for

pits > 60% (surgery), may want to gather

more data, might show value of pits

Page 80: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Medical Imaging – Another Fun ExampleMedical Imaging – Another Fun Example

Cornea DataCornea Data

Page 81: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Medical Imaging – Another Fun ExampleMedical Imaging – Another Fun Example

Cornea DataCornea Data

Cornea = Outer surface Cornea = Outer surface

of eyeof eye

Page 82: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Medical Imaging – Another Fun ExampleMedical Imaging – Another Fun Example

Cornea DataCornea Data

Cornea = Outer surface Cornea = Outer surface

of eyeof eye

““Curvature” important toCurvature” important to

visionvision

Page 83: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Medical Imaging – Another Fun ExampleMedical Imaging – Another Fun Example

Cornea DataCornea Data

Cornea = Outer surface Cornea = Outer surface

of eyeof eye

““Curvature” important toCurvature” important to

visionvision

Study Study heat map heat map showingshowing

curvaturecurvature

Page 84: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Cornea DataCornea Data

Heat map Heat map shows curvatureshows curvature

Each image is one personEach image is one person

Page 85: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Cornea DataCornea Data

Heat map Heat map shows curvatureshows curvature

Each image is one personEach image is one person

Understand “populationUnderstand “population

variation”?variation”?

Page 86: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Cornea DataCornea Data

Heat map Heat map shows curvatureshows curvature

Each image is one personEach image is one person

Understand “populationUnderstand “population

variation”?variation”?

(too messy for brain(too messy for brain

to summarize)to summarize)

Page 87: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Cornea DataCornea Data

Approach: PrincipalApproach: Principal

Component AnalysisComponent Analysis

Page 88: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Cornea DataCornea Data

Approach: PrincipalApproach: Principal

Component AnalysisComponent Analysis

Idea: follow “direction” inIdea: follow “direction” in

image space, image space,

Page 89: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Cornea DataCornea Data

Approach: PrincipalApproach: Principal

Component AnalysisComponent Analysis

Idea: follow “direction” inIdea: follow “direction” in

image space, that highlightsimage space, that highlights

population featurespopulation features

Page 90: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Cornea DataCornea Data

Population featuresPopulation features

Page 91: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Cornea DataCornea Data

Population featuresPopulation features

• Overall curvatureOverall curvature

(hot – cold)(hot – cold)

Page 92: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Cornea DataCornea Data

Population featuresPopulation features

• Overall curvatureOverall curvature

(hot – cold)(hot – cold)

• With the rule astigmatismWith the rule astigmatism

(figure 8 pattern)(figure 8 pattern)

Page 93: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Research Corner

Cornea DataCornea Data

Population featuresPopulation features

• Overall curvatureOverall curvature

(hot – cold)(hot – cold)

• With the rule astigmatismWith the rule astigmatism

(figure 8 pattern)(figure 8 pattern)

• CorrelationCorrelation

Page 94: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose X had been 13 out of 15

(cured by pits)

Page 95: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose X had been 13 out of 15

(cured by pits)

(recall above saw 11 / 25 not conclusive,

so now suppose stronger evidence)

Page 96: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose X had been 13 out of 15

So 1513ˆ n

Xp

Page 97: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose X had been 13 out of 15

So %7.86ˆ1513 n

Xp

Page 98: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose X had been 13 out of 15

So

(more conclusive than before)

%60%7.86ˆ1513 n

Xp

Page 99: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose X had been 13 out of 15

So

(more conclusive than before)

(how much stronger is the evidence?)

%60%7.86ˆ1513 n

Xp

Page 100: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose X had been 13 out of 15

So

p-value = P[ X ≥ 13 | p = 0.6]

%7.86ˆ1513 n

Xp

Page 101: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose X had been 13 out of 15

So

p-value = P[ X ≥ 13 | p = 0.6] = 0.027

%7.86ˆ1513 n

Xp

Page 102: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose X had been 13 out of 15

So

p-value = P[ X ≥ 13 | p = 0.6] = 0.027

Calculated similar to above:http://www.stat-or.unc.edu/webspace/courses/marron/UNCstor155-2009/ClassNotes/Stor155Eg4.xls

%7.86ˆ1513 n

Xp

Page 103: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose X had been 13 out of 15

p-value = P[ X ≥ 13 | p = 0.6] = 0.027

Page 104: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose X had been 13 out of 15

p-value = P[ X ≥ 13 | p = 0.6] = 0.027

Conclusions:

Yes-No: 0.027 < 0.05, so can reject H0 and

make firm conclusion pits are better

Page 105: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

H0: p < 0.6 vs. H1: p ≥ 0.6

Now suppose X had been 13 out of 15

p-value = P[ X ≥ 13 | p = 0.6] = 0.027

Conclusions:

Yes-No: 0.027 < 0.05, so can reject H0 and

make firm conclusion pits are better

Gray Level: Strong case, nearly very strong that

pits are better

Page 106: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

In General:

p-value = P[what was seen,

or more conclusive | at

boundary between

H0 & H1]

Page 107: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

In General:

p-value = P[what was seen,

or more conclusive | at

boundary between

H0 & H1]

(will use this throughout the course,

well beyond Binomial distributions)

Page 108: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

HW C14: Answer from both gray-level and

yes-no viewpoints:

(a) A TV ad claims that less than 40% of

people prefer Brand X. Suppose 7 out of

10 randomly selected people prefer Brand

X. Should we dispute the claim? (p-value

= 0.055)

Page 109: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

HW C14: Answer from both gray-level and

yes-no viewpoints:

(b) 80% of the sheet metal we buy from

supplier A meets our specs. Supplier B

sends us 12 shipments, and 11 meet our

specs. Is it safe to say the quality of B is

higher? (p-value = 0.275)

Page 110: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”

Page 111: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”, E.g. C14(a)

Page 112: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”, E.g. C14(a)

Find what Excel needs:

Page 113: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”, E.g. C14(a)

Find what Excel needs:

Number_s: 7

Trials: 10

Probability_s: 0.4

Cumulative: true

(plug in)

Page 114: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”, E.g. C14(a)

Check given answer

(0.055)

Page 115: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”, E.g. C14(a)

Check given answer

(0.055)

Way off!

Page 116: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”, E.g. C14(a)

Check given answer

(0.055)

Way off! Try “1 -”

i.e. target (0.945)

Page 117: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”, E.g. C14(a)

Check given answer

(0.055)

Way off! Try “1 -”

i.e. target (0.945)

Still off, how about

the “> vs. ≥” issue?

Page 118: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”, E.g. C14(a)

Check given answer

(0.055)

Way off! Try “1 -”

i.e. target (0.945)

Still off, how about

the “> vs. ≥” issue?

try replacing 7 by 6?

Page 119: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”, E.g. C14(a)

Check given answer

(0.055)

Way off! Try “1 -”

i.e. target (0.945)

Still off, how about

the “> vs. ≥” issue?

try replacing 7 by 6? Yes!

Page 120: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”:

• Can solve HW OK

Page 121: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”:

• Can solve HW OK

• But not on exam

– No numerical answer given

– No interaction with Excel

Page 122: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Warning

Avoid the “Excel Twiddle Trap”:

• Can solve HW OK

• But not on exam

– No numerical answer given

– No interaction with Excel

• Real Goal: Understanding Principles

Page 123: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

And now for something completely different

Lateral Thinking: What is the phrase?

Page 124: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

And now for something completely different

Lateral Thinking: What is the phrase?

Card

Shark

Page 125: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

And now for something completely different

Lateral Thinking:What is the phrase?

Page 126: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

And now for something completely different

Lateral Thinking:What is the phrase?

Knight Mare

Page 127: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

And now for something completely different

Lateral Thinking:What is the phrase?

Page 128: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

And now for something completely different

Lateral Thinking:What is the phrase?

Gator Aide

Page 129: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

In General:

p-value = P[what was seen,

or more conclusive | at

boundary between

H0 & H1]

Page 130: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

In General:

p-value = P[what was seen,

or more conclusive | at

boundary between

H0 & H1]

Caution: more conclusive requires careful

interpretation

Page 131: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Caution: more conclusive requires careful

interpretation

Page 132: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Caution: more conclusive requires careful

interpretation

Reason: Need to decide between

1 - sided Hypotheses

Page 133: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Caution: more conclusive requires careful

interpretation

Reason: Need to decide between

1 - sided Hypotheses, like

H0 : p < vs. H1: p ≥

some given numerical value

Page 134: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Caution: more conclusive requires careful

interpretation

Reason: Need to decide between

1 - sided Hypotheses, like

H0 : p < vs. H1: p ≥

And 2 - sided Hypotheses

Page 135: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Caution: more conclusive requires careful

interpretation

Reason: Need to decide between

1 - sided Hypotheses, like

H0 : p < vs. H1: p ≥

And 2 - sided Hypotheses, like

H0 : p = vs. H1: p ≠

Page 136: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

2 - sided Hypotheses, like

H0 : p = vs. H1: p ≠

Note: Can never have H1: p =

Page 137: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

2 - sided Hypotheses, like

H0 : p = vs. H1: p ≠

Note: Can never have H1: p = ,

since can’t tell for sure between

and + 0.000001

Page 138: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

2 - sided Hypotheses, like

H0 : p = vs. H1: p ≠

Note: Can never have H1: p = ,

since can’t tell for sure between

and + 0.000001

(Recall: H1 has burden of proof)

Page 139: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Caution: more conclusive requires careful

interpretation

1 - sided Hypotheses & 2 - sided

Hypotheses

Page 140: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Caution: more conclusive requires careful

interpretation

1 - sided Hypotheses & 2 - sided

Hypotheses

(important choice will need to make a lot)

Page 141: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Caution: more conclusive requires careful

interpretation

1 - sided Hypotheses & 2 - sided

Hypotheses

Useful Rule: set up 2-sided when problem

uses words like “equal” or “different”

Page 142: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine

• Gambling device

Page 143: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine

• Gambling device

• Players put money in

Page 144: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine

• Gambling device

• Players put money in

• With (small) probability, win a “jackpot”

(of quite a lot more money)

Page 145: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

Page 146: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

(in real life, focus is on “return rate”)

Page 147: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

(in real life, focus is on “return rate”)

(since people enjoy fewer, but bigger jackpots)

Page 148: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

(in real life, focus is on “return rate”)

(since people enjoy fewer, but bigger jackpots)

(but usually no signs,

since return rate is < 0)

Page 149: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

In 10 plays, I don’t win any.

Page 150: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

In 10 plays, I don’t win any.

Can I conclude sign is false?

Page 151: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

In 10 plays, I don’t win any.

Can I conclude sign is false?

(& thus have grounds for complaint,

or is this a reasonable occurrence?)

Page 152: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

In 10 plays, I don’t win any. Conclude false?

Let p = P[win]

Page 153: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

In 10 plays, I don’t win any. Conclude false?

Let p = P[win]

(usual approach: give unknowns a

name, so can work with)

Page 154: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

In 10 plays, I don’t win any. Conclude false?

Let p = P[win], let X = # wins in 10 plays

Page 155: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

In 10 plays, I don’t win any. Conclude false?

Let p = P[win], let X = # wins in 10 plays

Model: X ~ Bi(10, p)

Page 156: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

In 10 plays, I don’t win any. Conclude false?

Let p = P[win], let X = # wins in 10 plays

Model: X ~ Bi(10, p)

(set up as H0, the point want to disprove)

Page 157: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

In 10 plays, I don’t win any. Conclude false?

Let p = P[win], let X = # wins in 10 plays

Model: X ~ Bi(10, p)

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

Page 158: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says “Win

30% of the time”

In 10 plays, I don’t win any. Conclude false?

Let p = P[win], let X = # wins in 10 plays

Model: X ~ Bi(10, p)

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

(“false” means don’t win 30% of time,

so go 2-sided)

Page 159: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Aside (similar to above):

• Can never set up H0: p ≠ 0.3

Page 160: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Aside (similar to above):

• Can never set up H0: p ≠ 0.3

• And then prove that p = 0.3

Page 161: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Aside (similar to above):

• Can never set up H0: p ≠ 0.3

• And then prove that p = 0.3

• Since can’t handle gray area of hypo test

Page 162: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Aside (similar to above):

• Can never set up H0: p ≠ 0.3

• And then prove that p = 0.3

• Since can’t handle gray area of hypo test

• E.g. can’t distinguish from p = 0.30001

Page 163: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Aside (similar to above):

• Can never set up H0: p ≠ 0.3

• And then prove that p = 0.3

• Since can’t handle gray area of hypo test

• E.g. can’t distinguish from p = 0.30001

• Could always be “off a little bit”

Page 164: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

In 10 plays, I don’t win any. Conclude false?

Let p = P[win], let X = # wins in 10 plays

Model: X ~ Bi(10, p)

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

Page 165: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

In 10 plays, I don’t win any. Conclude false?

Let p = P[win], let X = # wins in 10 plays

Model: X ~ Bi(10, p)

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

(now test & see how weird X = 0 is, for p = 0.3)

Page 166: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

e.g. a slot machine bears a sign which says

“Win 30% of the time”

In 10 plays, I don’t win any. Conclude false?

Let p = P[win], let X = # wins in 10 plays

Model: X ~ Bi(10, p)

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

p-value = P[X = 0 or more conclusive | p = 0.3]

Page 167: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

p-value = P[X = 0 or more conclusive | p = 0.3]

Page 168: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

p-value = P[X = 0 or more conclusive | p = 0.3]

(understand this by visualizing # line)

Page 169: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

p-value = P[X = 0 or more conclusive | p = 0.3]

0 1 2 3 4 5 6

Page 170: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

p-value = P[X = 0 or more conclusive | p = 0.3]

0 1 2 3 4 5 6

30% of 10, most likely when p = 0.3

i.e. least conclusive

Page 171: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

p-value = P[X = 0 or more conclusive | p = 0.3]

0 1 2 3 4 5 6

so more conclusive includes

Page 172: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

p-value = P[X = 0 or more conclusive | p = 0.3]

0 1 2 3 4 5 6

so more conclusive includes

but since 2-sided, also include

Page 173: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Generally how to calculate?

0 1 2 3 4 5 6

Page 174: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Generally how to calculate?

Observed Value

0 1 2 3 4 5 6

Page 175: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Generally how to calculate?

Observed Value

Most Likely Value

0 1 2 3 4 5 6

Page 176: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Generally how to calculate?

Observed Value

Most Likely Value

0 1 2 3 4 5 6

# spaces = 3

Page 177: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Generally how to calculate?

Observed Value

Most Likely Value

0 1 2 3 4 5 6

# spaces = 3

so go 3 spaces in other

direct’n

Page 178: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Result: More conclusive means

X ≤ 0 or X ≥ 6

0 1 2 3 4 5 6

Page 179: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Result: More conclusive means

X ≤ 0 or X ≥ 6

p-value = P[X = 0 or more conclusive | p = 0.3]

Page 180: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Result: More conclusive means

X ≤ 0 or X ≥ 6

p-value = P[X = 0 or more conclusive | p = 0.3]

= P[X ≤ 0 or X ≥ 6 | p = 0.3]

Page 181: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Result: More conclusive means

X ≤ 0 or X ≥ 6

p-value = P[X = 0 or more conclusive | p = 0.3]

= P[X ≤ 0 or X ≥ 6 | p = 0.3]

= P[X ≤ 0] + (1 – P[X ≤ 5])

Page 182: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Result: More conclusive means

X ≤ 0 or X ≥ 6

p-value = P[X = 0 or more conclusive | p = 0.3]

= P[X ≤ 0 or X ≥ 6 | p = 0.3]

= P[X ≤ 0] + (1 – P[X ≤ 5])

= 0.076

Page 183: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Result: More conclusive means

X ≤ 0 or X ≥ 6

p-value = P[X = 0 or more conclusive | p = 0.3]

= P[X ≤ 0 or X ≥ 6 | p = 0.3]

= P[X ≤ 0] + (1 – P[X ≤ 5])

= 0.076

Excel result from:http://www.stat-or.unc.edu/webspace/courses/marron/UNCstor155-2009/ClassNotes/Stor155Eg4.xls

Page 184: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

p-value = 0.076

Page 185: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

p-value = 0.076

Yes-No Conclusion: 0.076 > 0.05,

so not safe to conclude “P[win] = 0.3”

sign

is wrong, at level 0.05

Page 186: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

p-value = 0.076

Yes-No Conclusion: 0.076 > 0.05,

so not safe to conclude “P[win] = 0.3”

sign

is wrong, at level 0.05

(10 straight losses is reasonably likely)

Page 187: Last Time Binomial Distribution –Excel Computation Political Polls –Strength of evidence Hypothesis Testing –Yes – No Questions.

Hypothesis Testing

Test: H0: p = 0.3 vs. H1: p ≠ 0.3

p-value = 0.076

Yes-No Conclusion: 0.076 > 0.05,

so not safe to conclude “P[win] = 0.3”

sign

is wrong, at level 0.05

Gray Level Conclusion: in “fuzzy zone”,

some evidence, but not too strong