FEC 512.05

53
1 Introduction to Hypothesis Testing Istanbul Bilgi University FEC 512 Financial Econometrics-I Asst. Prof. Dr. Orhan Erdem

description

Hypothesis Testing

Transcript of FEC 512.05

Page 1: FEC 512.05

1

Introduction to Hypothesis Testing

Istanbul Bilgi University

FEC 512 Financial Econometrics-I

Asst. Prof. Dr. Orhan Erdem

Page 2: FEC 512.05

2

What is a Hypothesis?

A hypothesis is a claim (assumption) about a population parameter:

population meanExample: The mean age of the citizens of this city is = 50.

Page 3: FEC 512.05

3

Is always about a population parameter, not about a sample statistic

Begin with the assumption that the null hypothesis is true Similar to the notion of innocent until proven guilty

Always contains “=” , “≤” or “” sign May or may not be rejected

The Null Hypothesis, H0

50μ:H 0 50x:H 0

Page 4: FEC 512.05

4

The Alternative Hypothesis, HA

Is the opposite of the null hypothesis e.g.: The mean age of this city is not 50

( HA: 50 )

Never contains the “=” , “≤” or “” sign May or may not be accepted Is generally the hypothesis that is

believed (or needs to be supported) by the researcher

Page 5: FEC 512.05

5

Formulating Hypotheses

Example: The average annual income of citizens of this city is claimed to be $65,000 per year. An analyst would like to test this claim.

What is the appropriate hypothesis test?

Page 6: FEC 512.05

6

Example 1: The average annual income of city A is claimed to be $65,000 per year. An analyst would like to test this claim.

What is the appropriate test?

H0: µ = 65,000 (income is as claimed) status quoHA: µ ≠ 65,000 (income is different than claimed)

The analyst will believe the claim unless sufficient evidence is found to discredit it.

Formulating Hypotheses

Page 7: FEC 512.05

7

A Trial as a Hypothesis Test

Hypothesis testing is very much like a court trial. The null hypothesis is that the defendant is

innocent. We then present the evidence—collect data. Then we judge the evidence—”Could these data

plausibly have happened by chance if the null hypothesis were true?” If they were very unlikely to have occurred, then the

evidence raises more than a reasonable doubt in our minds about the null hypothesis.

Page 8: FEC 512.05

8

What to Do with an “Innocent” Defendant If the evidence is not strong enough to reject the

presumption of innocent, the jury returns with a verdict of “not guilty.” The jury does not say that the defendant is innocent.All it

says is that there is not enough evidence to convict, to reject innocence.

The defendant may, in fact, be innocent, but the jury has no way to be sure.

Said statistically, we will fail to reject the null hypothesis.

We never declare the null hypothesis to be true, because we simply do not know whether it’s true or not.

Page 9: FEC 512.05

Population

Claim: thepopulationmean age is 50.(Null Hypothesis:

REJECT

Supposethe samplemean age is 20: x = 20

SampleNull Hypothesis

20 likely if = 50?Is

Hypothesis Testing Process

If not likely,

Now select a random sample

H0: = 50 )

x

Page 10: FEC 512.05

10

Sampling Distribution of x

= 50If H0 is true

If it is unlikely that we would get a sample mean of this value ...

... then we reject the null

hypothesis that = 50.

Reason for Rejecting H0

20

... if in fact this were the population mean…

x

Page 11: FEC 512.05

11

Level of Significance,

Defines unlikely values of sample statistic if null hypothesis is true Defines rejection region of the sampling

distribution

Is designated by , (level of significance)

Typical values are .01, .05, or .10 Is selected by the researcher at the beginning

Provides the critical value(s) of the test

Page 12: FEC 512.05

12

Level of Significance and the Rejection Region

H0: μ ≥ 50

H1: μ < 500

H0: μ ≤ 50

H1: μ > 50

Represents critical value

Lower-tail test

Level of significance =

0Upper-tail test

Two-tail test

Rejection region is shaded

/2

0

/2H0: μ = 50

H1: μ ≠ 50

Page 13: FEC 512.05

13

Errors in Making Decisions

Type I Error Reject a true null hypothesis Considered a serious type of error

The probability of Type I Error is Called level of significance of the test Set by researcher in advance

Page 14: FEC 512.05

14

Errors in Making Decisions

Type II Error Fail to reject a false null hypothesis

The probability of Type II Error is β

(continued)

Page 15: FEC 512.05

15

Outcomes and Probabilities

Actual SituationDecision

Do NotReject

H0

No error (1 - )

Type II Error ( β )

RejectH0

Type I Error( )

Possible Hypothesis Test Outcomes

H0 False H0 True

Key:Outcome

(Probability) No Error ( 1 - β )

Page 16: FEC 512.05

16

Power of the Test

The power of a test is the probability of rejecting a null hypothesis that is false

i.e., Power = P(Reject H0 | H1 is true)

Power of the test increases as the sample size increases

Page 17: FEC 512.05

17

Hypothesis Tests for the Mean

σ Known σ Unknown

Hypothesis Tests for

Assume first that the population standard deviation σ is known

Page 18: FEC 512.05

18

Test of Hypothesis for the Mean (σ Known)

Convert sample result ( ) to a z value

The decision rule is:

α0

0 z

n

σμx

z if H Reject

σ Known σ Unknown

Hypothesis Tests for

Consider the test

00 μμ:H

01 μμ:H

(Assume the population is normal)

x

Page 19: FEC 512.05

19

Level of Significance and the Rejection Region

H0: μ ≥ 50

HA: μ < 50

50

H0: μ ≤ 50

HA: μ > 50

H0: μ = 50

HA: μ ≠ 50

/2

Lower tail test

Level of significance =

50

/2

Upper tail test Two tailed test

50

-? ? -? ?

Reject H0 Reject H0 Reject H0 Reject H0Do not

reject H0

Do not reject H0

Do not reject H0

Example:Example: Example:

Page 20: FEC 512.05

20

Level of Significance and the Rejection Region

H0: μ ≥ 50

HA: μ < 50

0

H0: μ ≤ 50

HA: μ > 50

H0: μ = 50

HA: μ ≠ 50

/2

Lower tail test

Level of significance =

0

/2

Upper tail test Two tailed test

0

-zα zα -zα/2 zα/2

Reject H0 Reject H0 Reject H0 Reject H0Do not

reject H0

Do not reject H0

Do not reject H0

Example:Example: Example:

Page 21: FEC 512.05

21

Reject H0Do not reject H0

Upper Tail Tests

zα0

μ0

H0: μ ≤ μ0

H1: μ > μ0

Critical value

Z

α0

0 z

n

σμx

z if H Reject

nσ/ZμX if H Reject α00

n

σzμ α0

Alternate rule:

x

Page 22: FEC 512.05

22

Reject H0 Do not reject H0

The cutoff value,

or , is called

a critical value

Lower Tail Tests

-zα

-zα xα

0

μ

H0: μ ≥ μ0

HA: μ < μ0

n

σμ 0 zx

I always find a corresponding z

value to x

Page 23: FEC 512.05

23

There are two cutoff

values (critical values):

or

Do not reject H0 Reject H0Reject H0

Two Tailed Tests

/2

-zα/2

xα/2

± zα/2

xα/2

0μ0

H0: μ= μ0

HA: μ μ0

zα/2

xα/2

n

σμ /20/2 zx

Lower

Upperxα/2

Lower Upper

/2

Page 24: FEC 512.05

24

Example: Upper-Tail Z Test

for Mean ( Known) A supermarket chain manager thinks that

customer age have increased, and now average over 52. The company wishes to test this claim. (Assume = 10 is known)

H0: μ ≤ 52 the average is not over 52 per month

H1: μ > 52 the average is greater than 52 (i.e., sufficient evidence exists to support the manager’s claim)

Form hypothesis test:

Page 25: FEC 512.05

25

Reject H0Do not reject H0

Suppose that = .10 is chosen for this test

Find the rejection region:

= .10

1.280

Reject H0

Example: Find Rejection Region

(continued)

1.28nσ/

μxz if H Reject 0

0

Page 26: FEC 512.05

26

Obtain sample and compute the test statistic

Suppose a sample is taken with the following results: n = 64, x = 53.1 (=10

was assumed known)

Using the sample results,

0.88

64

105253.1

n

σμx

z 0

Example: Sample Results(continued)

Page 27: FEC 512.05

27

Reject H0Do not reject H0

Example: Decision

= .10

1.280

Reject H0

Do not reject H0 since z = 0.88 < 1.28

i.e.: there is not sufficient evidence that the mean age is over 52

z = 0.88

Reach a decision and interpret the result:(continued)

Page 28: FEC 512.05

28

Known

Large Samples

Unknown

Hypothesis Tests for

Small Samples

The test statistic is:

Calculating the Test Statistic

n

sμx

t 1n

But is sometimes approximated using a z:

n

σμx

z

(continued)

Page 29: FEC 512.05

29

Known

Large Samples

Unknown

Hypothesis Tests for

Small Samples

The test statistic is:

Calculating the Test Statistic

n

sμx

t 1n

(The population must be approximately normal)

(continued)

Page 30: FEC 512.05

30

Review: Steps in Hypothesis Testing

1. Specify the population value of interest

2. Formulate the appropriate null and alternative hypotheses

3. Specify the desired level of significance

4. Determine the rejection region

5. Obtain sample evidence and compute the test statistic

6. Reach a decision and interpret the result

Page 31: FEC 512.05

31

Example: Risk and Return of an Equity Mutual Fund (σ unknown) The ABC Fund, that has been in existence for

36 months achieved a monthly return of 1.5% with s=3.6%. This fund was expected to have earned at least 1.9% mean monthly return over that time period. Are the actual results consistent with an underlying population mean return of at least 1.9% with 5% level of significance?

Page 32: FEC 512.05

32

Solution

Test the claim that the true mean # of ABC Fund return is at least 1.9

1. Specify the population value of interest The mean return of ABC

2. Formulate the appropriate null and alternative hypotheses H0: μ 1.9 HA: μ < 1.9 (This is a lower

tail test) 3. Specify the desired level of significance

= .05 is chosen for this test

Page 33: FEC 512.05

33

Reject H0 Do not reject H0

Hypothesis Testing Example 4. Determine the

rejection region

= .05

-tα= -1.68 0

This is a one-tailed test with = .05. Since σ is not known, the cutoff value is a t value. But since we have 36 data, z-value can also be used.

Reject H0 if t < t = -1.68 ; otherwise do not reject H0

(continued)

Page 34: FEC 512.05

34

5. Obtain sample evidence and compute the test statistic sample is taken with the following results: n = 36, x = 1.5

Then the test statistic is:

Hypothesis Testing Example

66.0

36

6.39.15.1

n

sx

t

Page 35: FEC 512.05

35

Reject H0 Do not reject H0

= .05

-1.68 0

6. Reach a decision and interpret the result

-0.66

Since t = -1.68 <-0.66, we fail to reject the null hypothesis that the mean return for ABC Fund is at least 1.9%

Hypothesis Testing Example(continued)

z

Page 36: FEC 512.05

36

Smallest value of for which H0 can

be rejected

Obtain the p-value from a table or computer

Compare the p-value with

If p-value < , reject H0

If p-value , do not reject H0

p-Value Approach to Testing

Page 37: FEC 512.05

37

Reject H0

= .10

Do not reject H0 1.28

0

Reject H0

Z = .88

Calculate the p-value and compare to (assuming that μ = 52.0)

(continued)

.1894

.810610.88)P(z

6410/

52.053.1zP

52.0) μ | 53.1xP(

p-value = .1894

Example: p-Value Solution

Do not reject H0 since p-value = .1894 > = .10

Page 38: FEC 512.05

38

p-value =0.26

= .05

p-value example

Example: How likely is it to see a sample mean of 0.93 (or something further below the mean) if the true mean is = 1.9?

-1.68

-0.66

x

Page 39: FEC 512.05

39

Compare the p-value with

If p-value < , reject H0

If p-value , do not reject H0

Here: p-value = 0.26 = .05

Since .05 < 0.26, we fail to reject the null hypothesis

(continued)p-value example

Page 40: FEC 512.05

40

Reject H0: μ 52

Do not reject H0 : μ 52

Type II Error (Revisited)

Type II error is the probability of

failing to reject a false H0

5250

Suppose we fail to reject H0: μ 52 when in fact the true mean is μ = 50

Page 41: FEC 512.05

41

Reject H0: 52

Do not reject H0 : 52

Type II Error

Suppose we do not reject H0: 52 when in fact the true mean is = 50

5250

This is the true distribution of x if = 50

This is the range of x where H0 is not rejected

(continued)

Page 42: FEC 512.05

42

Reject H0: μ 52

Do not reject H0 : μ 52

Type II Error

Suppose we do not reject H0: μ 52 when in fact the true mean is μ = 50

5250

β

Here, β = P( x cutoff ) if μ = 50

(continued)

Page 43: FEC 512.05

43

Reject H0: μ 52

Do not reject H0 : μ 52

Suppose n = 64 , σ = 6 , and = .05

5250

So β = P( x 50.766 ) if μ = 50

Calculating β

50.76664

61.64552

n

σzμxcutoff

(for H0 : μ 52)

50.766

Page 44: FEC 512.05

44

Reject H0: μ 52

Do not reject H0 : μ 52

.1539.3461.51.02)P(z

646

5050.766zP50)μ|50.766xP(

Suppose n = 64 , σ = 6 , and = .05

5250

Calculating β(continued)

Probability of type II error:

β = .1539

Page 45: FEC 512.05

45

Type I & II Error Relationship

Type I and Type II errors can not happen at the same time

Type I error can only occur if H0 is true

Type II error can only occur if H0 is false

If Type I error probability ( ) , then

Type II error probability ( β )

Page 46: FEC 512.05

46

Factors Affecting Type II Error All else equal,

β when the difference between hypothesized parameter and its true value

β when

β when σ

β when n

Page 47: FEC 512.05

47

Making Errors

How often will a Type I error occur? Since a Type I error is rejecting a true null

hypothesis, the probability of a Type I error is our level.

When H0 is false and we reject it, we have done the right thing. A test’s ability to detect a false hypothesis is

called the power of the test.

Page 48: FEC 512.05

48

Making Errors

When H0 is false and we fail to reject it, we have made a Type II error. We assign the letter to the probability of this

mistake. It’s harder to assess the value of because we

don’t know what the value of the parameter really is.

There is no single value for --we can think of a whole collection of ’s, one for each incorrect parameter value.

Page 49: FEC 512.05

49

Making Errors (cont.)

One way to focus our attention on a particular is to think about the effect size. Ask “How big a difference would matter?”

We could reduce for all alternative parameter values by increasing . This would reduce but increase the chance of a Type I

error. This tension between Type I and Type II errors is

inevitable.

Page 50: FEC 512.05

50

Power

The power of a test is the probability that it correctly rejects a false null hypothesis.

When the power is high, we can be confident that we’ve looked hard enough at the situation.

The power of a test is 1 – .

Page 51: FEC 512.05

51

Reject H0: μ 52

Do not reject H0 : μ 52

Type II Error

Suppose we do not reject H0: μ 52 when in fact the true mean is μ = 50

52

50

β

(continued)

Power

Page 52: FEC 512.05

52

Power (cont.)

Whenever a study fails to reject its null hypothesis, the test’s power comes into question.

When we calculate power, we imagine that the null hypothesis is false.

The value of the power depends on how far the truth lies from the null hypothesis value. The distance between the null hypothesis value,

p0, and the truth, p, is called the effect size. Power depends directly on effect size.

Page 53: FEC 512.05

53

Reducing Both Type I and Type II Error The previous figure seems to show that if we

reduce Type I error, we must automatically increase Type II error.

But, we can reduce both types of error by making both curves narrower.

How do we make the curves narrower? Increase the sample size.