Bootstrap Conﬁdence Intervals · Bootstrap Conﬁdence Intervals ST552 Lecture 13 Charlotte...

Bootstrap Confidence Intervals

ST552 Lecture 13

Charlotte Wickham

2019-02-11

Motivation

The inferences we’ve covered so far relied on our assumption of

Normal errors:

‘ ≥ N(0, ‡2In◊n)

For example, we’ve seen under this assumption, the least squares

estimates are also Normally distributed:

— ≥ N3

—, ‡21XT X

2≠14

If the errors aren’t truly Normally distributed, what

distribution do the estimates have?

Warm-up: Your Turn

Imagine the errors are in fact t3 distributed?

With your neighbour: design a simulation to understand the

distribution of the least squares estimates.

[ aGot ichor B

sampling

Some model & - ¥ ,① Assure some things about model : y=XpptE

Decide o→pX ad B ,make something up .

② Compute f 's 100,000 times

a) Simulate y: simulate E

finely y-- X ft E

b) Using y ,fit model : f- ( * X) - '

③ MakeA Red :

We don't

¥ LA'

Example: 1. Fix n, fix X

5 10 15 20x

Response

n = 10

Xnvniffo ,20)

Example: 1. Fix —, find y

5 10 15 20x

Response

X Ely )

f- f 1,05)T

trueregressionline

, Ely )

Example: 2. Simulate errors, find y

5 10 15 20x

Response

Fsi .÷÷:D

Example: 3. Find least squares line

5 10 15 20x

Response

[Fitted

regression line

Example: 4. Repeat #2. and #3. many times

5 10 15 20x

Response

IOne fit fromOne

simulation

Example: Examine distribution of estimates

−5 0 5 10Estimate of intercept

0.00 0.25 0.50 0.75Estimate of slope

fo -- I p,

= o - 5

'siege. .

1000 Go 1000 § ,

Example: Compared to theory

−5 0 5 10Estimate of intercept

0.00 0.25 0.50 0.75Estimate of slope

If Eira Nfo ,o

' ) BN Nfp ,

E # D- ' )

E ; I ¥

Based on simulation :

If error are tz distributed I to

Normal,

We assured on n, X

When the errors aren’t Normal: CLT

Think of our estimates like linear combinations of the errors. I.e. a

sort of average of i.i.d random variables.

Some version of the Central Limit Theorem will apply.

For large samples, even when the errors aren’t Normal,

—≥N(—, ‡2(XT X )

- - + we

approximately

as h - A that approximationwill improve .

Summary so far

If we knew the error distribution and true parameters we could use

simulation to understand the sampling distribution the least

squares estimates.

Simulation can also be used to demonstrate the CLT at work in

regression.

Bootstrap confidence intervals

In practice, with data in front of us, we don’t know the distribution

of the errors (nor the true parameter values).

The bootstrap is one approach to estimate the sampling

distribution of —, by using the simulation idea, and substituting in

our best guesses for the things we don’t know.

Bootstrapping regression

(Model based resampling)

0. Fit model and find estimates, —, and residuals, ei

1. Fix X ,

2. For k = 1, . . . , B2.1 Generate errors, ‘ú

i sampled with replacement from ei2.2 Construct y , using the model, y = y + ‘ú

2.3 Use least squares to find —ú(k)

3. Examine the distribution of —úand compare to —

One confidence interval for —j is the 2.5% and 97.5% quantiles of

the distribution of —új .

(Known as the Percentile method, there are other (better?)methods).

residualsRepeat many

timesy

= x § + E*

Example: Faraway Galapagos Islands

(I’ll illustrate with simple linear regression, Faraway does multiplecase in 3.6)

−200

0 500 1000 1500Elevation

Observed data

:Eth Elevates

Bootstrap: 1. Find —, y , and ei .

−200

0 500 1000 1500Elevation

Observed data

Assume E Species ;-- fo Tf, Eteuitei

fo 'RE Levit

e i←

Bootstrap: Using fixed X , ˆbeta from observed data

−200

0 500 1000 1500Elevation

Observed data

Bootstrap: 2. Resample residuals to construct bootstrapped

response

−200

0 500 1000 1500Elevation

Bootstrapped data

appeala booth- r

yerror EEO,

eco sampledat

random

€ are from previouspage

Bootstrap: 3. Fit regression model to bootstrapped response

−200

0 500 1000 1500Elevation

Bootstrapped data

Bootstrap: 3. Repeat #2. and #3. many times

−200

0 500 1000 1500Elevation

Species

Imay regression

lines fit to many

bootstrappeddatasets

Examine distribution of estimates

−50 0 50Bootstrap estimates of intercept

0.10 0.15 0.20 0.25 0.30 0.35Bootstrap estimates of slope

telling us about2- File sapling dist

↳ / of ei

197 . site

(otI e:*° C- 25,50 ) : 95 's .CI for Po

High level: bootstrap idea

We don’t know the distribution of the errors, but our best guess is

probably the empirical c.d.f on the residuals.

Sampling from a random variable with a c.d.f. defined as the

empirical c.d.f. of the residuals, boils down to sampling with

replacement from residuals.

inn " " " i : "

Limitations

We might rely on bootstrap confidence intervals when we are

worried about the assumption of Normal errors. But, there are

limitations.

• We still rely on the assumption that the errors are

independent and identically distributed.

• Generally scaled residuals are used (residuals don’t have the

same variance, more later)

• An alternative bootstrap resamples the (yi , xi1, . . . , xip)

vectors, i.e. resamples the rows of the data, a.k.a resamplingcases bootstrap.

↳ choice should duped on

Experiment :sresapaEY.akae.de

' 'SJiang say .

Limitations

We might rely on bootstrap confidence intervals when we are

worried about the assumption of Normal errors. But, there are

limitations.

• We still rely on the assumption that the errors are

independent and identically distributed.

• Generally scaled residuals are used (residuals don’t have the

same variance, more later)

• An alternative bootstrap resamples the (yi , xi1, . . . , xip)

vectors, i.e. resamples the rows of the data, a.k.a resamplingcases bootstrap.

Bootstrap Conﬁdence Intervals · Bootstrap Conﬁdence Intervals ST552 Lecture 13 Charlotte...

Documents

Transcript of Bootstrap Conﬁdence Intervals · Bootstrap Conﬁdence Intervals ST552 Lecture 13 Charlotte...

Bootstrap prediction intervals for threshold ... · One-Sample Bootstrap Prediction Intervals for TAR Models Given the nonstandard distribution of prediction errors, we consider bootstrap

Interacting with Remote Systems - Princeton University · 2 non-parameteric bootstrap conﬁdence intervals (voter turnout) 3 parametric bootstrap conﬁdence intervals (voter turnout)

Bootstrap-After-Bootstrap Prediction Intervals for Auto Regressive Models

Conﬁdence Intervals for Predicted Outcomes in Regression ...jslsoc/stata/ci_computations/xulong-prvalue-23aug... · Conﬁdence Intervals for Predicted Outcomes in Regression Models

Estimators and Bootstrap Confidence Intervals for Ruin ...

An Invitation to the Bootstrap : Panacea for Statistical ...€¦ · Statistical Inference: bias and variance estimation, conﬁdence intervals and testing hypothesis. It shows also

On conﬁdence intervals for semiparametric …Stat Comput (2013) 23:135–148 DOI 10.1007/s11222-011-9297-1 On conﬁdence intervals for semiparametric expectile regression Fabian

Rate Optimal Estimation and Conﬁdence Intervals for High ...

A bootstrapped Malmquist index applied to Swedish district ... · 2000). In this study a bootstrap approach is used to determine the conﬁdence intervals of the different components

Bootstrap Confidence Intervals using Percentiles

Nonparametric Confidence Intervals: Nonparametric Bootstrap.

Conﬁdence intervals, t tests, P values

(Better) Bootstrap Confidence Intervals

Conﬁdence Intervals for Extreme Quantile Estimates using ...Conﬁdence Intervals for Extreme Quantile Estimates using Extreme Value Theory Valentin Erismann Supervised by Prof.

Bootstrap Confidence Intervals for Reservoir Model Selection Techniques

Lecture 3: Confidence intervalskeshet/MCB2012/SlidesDodo/DataFitLect3.pdf · Bootstrap confidence intervals: Computational technique based on resampling the errors. Asymptotic confidence

Confidence Intervals for Effect Sizes: Applying Bootstrap ...

Chapter 5 Conﬁdence Intervals and Hypothesis Testingidiom.ucsd.edu/~rlevy/pmsl_textbook/chapters/pmsl_5.pdf · Conﬁdence Intervals and Hypothesis Testing ... • Choose interval

Conﬁdence Intervals - Arkansas State Universitymyweb.astate.edu/sbounds/Statistics_AP/5 Week 5/ELA… · · 2014-03-17Calculate and interpret conﬁdence intervals for one population

Bootstrap Inference in Econometrics - Queen's …qed.econ.queensu.ca/.../papers/cea-presadd-2002.pdfMonte Carlo tests, several types of bootstrap test, and bootstrap conﬁdence intervals.