1 Nonparametric Methods III Henry Horng-Shing Lu Institute of Statistics National Chiao Tung...

Nonparametric Methods III

Henry Horng-Shing LuInstitute of Statistics

National Chiao Tung Universityhslu@stat.nctu.edu.tw

http://tigpbp.iis.sinica.edu.tw/courses.htm

PART 4: Bootstrap and Permutation Tests Introduction References Bootstrap Tests Permutation Tests Cross-validation Bootstrap Regression ANOVA

References Efron, B.; Tibshirani, R. (1993). An Introduction

to the Bootstrap. Chapman & Hall/CRC. http://cran.r-project.org/doc/contrib/Fox-Co

mpanion/appendix-bootstrapping.pdf http://cran.r-project.org/bin/macosx/2.1/chec

k/bootstrap-check.ex http://bcs.whfreeman.com/ips5e/content/cat

_080/pdf/moore14.pdf

Hypothesis Testing (1) A statistical hypothesis test is a method of m

aking statistical decisions from and about experimental data.

Null-hypothesis testing just answers the question of “how well the findings fit the possibility that chance factors alone might be responsible.”

This is done by asking and answering a hypothetical question.

http://en.wikipedia.org/wiki/Statistical_hypothesis_testing

Hypothesis Testing (2) Hypothesis testing is largely the product of Ronald Fisher,

Jerzy Neyman, Karl Pearson and (son) Egon Pearson. Fisher was an agricultural statistician who emphasized rigorous experimental design and methods to extract a result from few samples assuming Gaussian distributions. Neyman (who teamed with the younger Pearson) emphasized mathematical rigor and methods to obtain more results from many samples and a wider range of distributions. Modern hypothesis testing is an (extended) hybrid of the Fisher vs. Neyman/Pearson formulation, methods and terminology developed in the early 20th century.

Hypothesis Testing (3)

Hypothesis Testing (7) Parametric Tests:

Nonparametric Tests: Bootstrap Tests Permutation Tests

Confidence Intervals vs. Hypothesis Testing (1)

Interval estimation ("Confidence Intervals") and point estimation ("Hypothesis Testing") are two different ways of expressing the same information.

http://www.une.edu.au/WebStat/unit_materials/c5_inferential_statistics/confidence_interv_hypo.html

If the exact p-value is reported, then the relationship between confidence intervals and hypothesis testing is very close. However, the objective of the two methods is different: Hypothesis testing relates to a single

conclusion of statistical significance vs. no statistical significance.

Confidence intervals provide a range of plausible values for your population.

http://www.nedarc.org/nedarc/analyzingData/advancedStatistics/convidenceVsHypothesis.html

Which one? Use hypothesis testing when you want to do a

strict comparison with a pre-specified hypothesis and significance level.

Use confidence intervals to describe the magnitude of an effect (e.g., mean difference, odds ratio, etc.) or when you want to describe a single sample.

http://www.nedarc.org/nedarc/analyzingData/advancedStatistics/convidenceVsHypothesis.html

P-value

http://bcs.whfreeman.com/ips5e/content/cat_080/pdf/moore14.pdf

Achieved Significance Level (ASL)

Definition:

The (ASL) is defined as:

ˆ ˆASL = ( | H ).

The smaller ASL, the stronger is the evidence of false.

achieved significance

e ASL is an estimate o

the p-value by

f the test

uation and bootstrap methods.

Definition

A is a way of deciding whether or not the data decisively

reject the h

hypoth

ypothe

https://www.cs.tcd.ie/Rozenn.Dahyot/453Bootstrap/05_Permutation.pdf

Bootstrap Tests Methodology Flowchart R code

Bootstrap Tests Beran (1988) showed that bootstrap inference

is refined when the quantity bootstrapped is asymptotically pivotal.

It is often used as a robust alternative to inference based on parametric assumptions.

http://socserv.mcmaster.ca/jfox/Books/Companion/appendix-bootstrapping.pdf

Hypothesis Testing by a Pivot

- 1. : (0, 1), ( , ) , , .

-2. : = (0, 1) , ( , ),

Examples

A pivot Z N when X iid N and X is knownn

XAn asymptotic pivot T N as n when X iid N

Xwhere X is unknown and S

http://en.wikipedia.org/wiki/Pivotal_quantity

Pivot or pivotal quantity: a function of observations whose distribution does

not depend on unknown parameters.

T statistics can be regarded as a pivot or an asymptotic pivotal when the data are normally distributed.

Bootstrap T tests can be applied when the data are not normally distributed.

One Sample Bootstrap Tests

Bootstrap T tests Flowchart R code

ˆˆ ( , , ..., ) ( ),

ˆ( )ndata x x x x x s x and t

Bootstrap B times*Bx*2x*1x

· *0#{ }/Boot bASL t t B

Flowchart of Bootstrap T Tests

Bootstrap T Tests by R

An Example of Bootstrap T Tests by R

Bootstrap Tests by The “BCa” The BCa percentile method is an efficient met

hod to generate bootstrap confidence intervals.

There is a correspondence between confidence intervals and hypothesis testing.

So, we can use the BCa percentile method to test whether H0 is true.

Example: use BCa to calculate p-value

Use R package “boot.ci(boot)” Use R package “bcanon(bootstrap)” http://qualopt.eivd.ch/stats/?page=bootstrap http://www.stata.com/capabilities/boot.html

BCa Confidence Intervals:

http://finzi.psych.upenn.edu/R/library/boot/DESCRIPTION

An Example of “boot.ci(boot)” in R

http://finzi.psych.upenn.edu/R/library/bootstrap/DESCRIPTION

An example of “bcanon(bootstrap)” in R

BCa by http://qualopt.eivd.ch/stats/?page=bootstrap

Use BCa to calculate p-value by R

Two Sample Bootstrap Tests Flowchart R code

Bootstrap B times* * *1 1 1( , )d y x

1 2 1: ( , y , ..., y )nSample yy

* * *1 1 1ˆ ( ) ( )s s y x

· *ˆ ˆ (#( )) /Boot bASL B

Flowchart of Two-Sample Bootstrap Tests

1 2 2: ( , x , ..., x )mSample xx

1 2 1ˆ : ( , , ..., , , ..., ) ( , ) ( ) ( )n n n mcombined data d d d d d s s d y x y x

* * *2 2 2( , )d y x

* * *2 2 2

ˆ ( ) ( )s s y x * * *ˆ ( ) ( )B B Bs s y x

m+n=Ncombine

* * *( , )B B Bd y x

Two-Sample Bootstrap Tests by R

Output (1)

Output (2)

Permutation Tests Methodology Flowchart R code

Permutation In several fields of mathematics, the term

permutation is used with different but closely related meanings. They all relate to the notion of (re-)arranging elements from a given finite set into a sequence.

http://en.wikipedia.org/wiki/Permutation

Permutation Tests Permutation test is also called a

randomization test, re-randomization test, or an exact test.

If the labels are exchangeable under the null hypothesis, then the resulting tests yield exact significance levels.

Confidence intervals can then be derived from the tests.

The theory has evolved from the works of R.A. Fisher and E.J.G. Pitman in the 1930s.

http://en.wikipedia.org/wiki/Pitman_permutation_test

Applications of Permutation Tests (1)

We can use a permutation test only when we can see how to resample in a way that is consistent with the study design and with the null hypothesis.

Two-sample problems when the null hypothesis says that the two populations are identical. We may wish to compare population means, proportions, standard deviations, or other statistics.

Matched pairs designs when the null hypothesis says that there are only random differences within pairs. A variety of comparisons is again possible.

Relationships between two quantitative variables when the null hypothesis says that the variables are not related. The correlation is the most common measure of association, but not the only one.

Applications of Permutation Tests (2)

A tradionnal way is to consider some hypotheses: ~ ( , )

and ~ ( , ), and the null hypothesis becomes = .

ˆUnder , the statistic . = - can be modelled as a normal

distribution with mean

ˆ ˆ( )

1 1 0 and variance ( ).

The ASL is then computed by

ˆ ASL=2

when is unknown and has to be estimated from the data by

( ) ( ).

ai a bi bi i

X X X X

0ill reject if ASL > .H

Inference by Permutation Tests

Flowchart of The Permutation Test for Mean Shift in One Sample

1 2 1 2 , , ..., , , , ..., n n n n mSample x x x x x x

Partition 2 subset B times

(treatment group)

(control group) (treatment group)

(control group)

* * *1 2

ˆ ( ) ( )b b bs s x x

11G 12G

1x 2x11O 12O

1 2ˆ ( ) ( )s s x x

· *ˆ ˆ (#( )) / , and NPerm b nASL B B C

21G 22G

1BG 2BG

An Example for One Sample Permutation Test by R

http://mason.gmu.edu/~csutton/EandTCh15a.txt

An Example of Output Results

1 2 2: ( , x , ..., x )mSample xx

Partition subset B times

treatment

subgroup

control

subgroup

11G 12G

1 2 1: ( , y , ..., y )nSample yy

1 2 1ˆ : ( , , ..., , , ..., ) ( , ) ( ) ( )n n n mcombined data d d d d d s s d y x y x

combine

* * *ˆ ( ) ( )b b bs s x y

· *ˆ ˆ (#( )) / , and NPerm b nASL B B C

Flowchart of The Permutation Test for Mean Shift in Two Samples

21G 22G

1BG 2BG

treatment

subgroup

control

subgroup

Bootstrap Tests vs. Permutation Tests Very similar results between the

permutation test and the bootstrap test. is the exact probability when . is not an exact probability but is

guaranteed to be accurate as an estimate of the ASL, as the sample size B goes to infinity.

PermASL

BootASL

Cross-validation Methodology R code

Cross-validation Cross-validation, sometimes called rotation es

timation, is the statistical practice of partitioning a sample of data into subsets such that the analysis is initially performed on a single subset, while the other subset(s) are retained for subsequent use in confirming and validating the initial analysis. The initial subset of data is called the training set. the other subset(s) are called validation or testing s

http://en.wikipedia.org/wiki/Cross-validation

Overfitting Problems In statistics, overfitting is fitting a statistical model that has too

many parameters. When the degrees of freedom in parameter selection exceed the i

nformation content of the data, this leads to arbitrariness in the final (fitted) model parameters which reduces or destroys the ability of the model to generalize beyond the fitting data.

The concept of overfitting is important also in machine learning. In both statistics and machine learning, in order to avoid overfitti

ng, it is necessary to use additional techniques (e.g. cross-validation, early stopping, Bayesian priors on parameters or model comparison), that can indicate when further training is not resulting in better generalization.

http://en.wikipedia.org/wiki/Overfitting

library(bootstrap)

?crossval

An Example of Cross-validation by R

output

Bootstrap Regression Bootstrapping pairs:

Resample from the sample pairs { }. Bootstrapping residuals:

1. Fit by the original sample and obtain the residuals.2. Resample from residuals.

, i ix y

ˆi iy x

Bootstrapping Pairs by R

http://www.stat.uiuc.edu/~babailey/stat328/lab7.html

Output

Bootstrapping Residuals by R

http://www.stat.uiuc.edu/~babailey/stat328/lab7.html

Bootstrapping residuals

ANOVA When random errors follow a normal

distribution: When random errors do not follow a

Normal distribution: Bootstrap tests:Permutation tests:

An Example of ANOVA by R (1) Example

Twenty lambs are randomly assigned to three different diets. The weight gain (in two weeks) is recorded. Is there a difference among the diets?

Reference http://mcs.une.edu.au/~stat261/Bootstrap/

bootstrap.R

An Example of ANOVA by R (1)

Output (1)

Output (2)

Output (3)

Output (4)

Output (5)

Output (6)

Output (7)

The Second Example of ANOVA by R (1)

Data source http://finzi.psych.upenn.edu/R/library/rpart/html/kyp

hosis.html Reference

http://www.stat.umn.edu/geyer/5601/examp/parm.html Kyphosis is a misalignment of the spine. The data are on 8

3 laminectomy (a surgical procedure involving the spine) patients. The predictor variables are age and age^2 (that is, a quadratic function of age), number of vertebrae involved in the surgery and start the vertebra number of the first vertebra involved. The response is presence or absence of kyphosis after the surgery (and perhaps caused by it).

Output (1)

Data = kyphosis

Output (2)

Output (3)

Output (4)

Output (5)

#deviance

#p-value

Output (6)

Exercises: Write your own programs similar to those

examples presented in this talk.

Write programs for those examples mentioned at the reference web pages.

Write programs for the other examples that you know.

Practice Makes Perfect!81

1 Nonparametric Methods III Henry Horng-Shing Lu Institute of Statistics National Chiao Tung...

Documents

Transcript of 1 Nonparametric Methods III Henry Horng-Shing Lu Institute of Statistics National Chiao Tung...

Duen Horng (Polo) Chau - Georgia Institute of Technologypoloclub.gatech.edu/cse6242/2018spring/slides/CSE6242-710... · Duen Horng (Polo) Chau Assistant Professor Associate Director,

130125 hslu ad_wordspräsi

Stepping Stones: Principal Career Paths and School … STONES: PRINCIPAL CAREER PATHS AND SCHOOL OUTCOMES ... (Horng 2009; Loeb, ... and Horng 2010). 2.1 Effects of Leadership TurnoverPublished

Nonparametric Methods II 1 Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University hslu@stat.nctu.edu.tw .

Instructor: Assoc. Prof. Chung- Horng Lung Group members : Qui Nguyen, Xiaolin Li,

1 Nonparametric Methods II Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University hslu@stat.nctu.edu.tw .

Maximum Likelihood Estimates and the EM Algorithms I Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University hslu@stat.nctu.edu.tw.

Page 1 Environmental Study through NARL Synergy & CEPERC Associate Researcher Jyh-Horng Wu.

AnttiKnowles Horng-TzerYau February12,2017 arXiv:1503 ...

A Spiritual Journey through Breast Cancer Chi-Mei Horng Chi-Mei Horng Associate Professor Associate Professor Department of Clinical Psychology Department.

Horng H Chen MD on behalf of the NHLBI Heart Failure Clinical Research Network

HSLU T&A Jahrbuch 2007/08

Horng-Chyi HorngStatistics II_Five43 Inference on the Variances of Two Normal Population &5-5 (&9-5)

Introduction to Epidemiology Instructor: Guan-Hua Huang, Ph.D. E-mail: ghuang@stat.nctu.edu.twghuang@stat.nctu.edu.tw Class meetings: Wednesday 1:30-4:30.

1 Bayesian Methods with Monte Carlo Markov Chains III Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University hslu@stat.nctu.edu.tw.

Syaiful Muazir / Horng-Chang Hsieh Lagging yet strategic ...

Horng-Chyi HORNG and Shu-Pei HU

The Preferred Principal - admin.kasa.orgadmin.kasa.org/2012_Summer_Institute/Documents/Ed_Seesion_Handouts...The Preferred Principal: Leadership Traits, Style, and Gender ... (Horng

Proteasome-Related HslU and HslV Genes Typical of ...

V1 Jan11/MD · Drupal 7 Features + Possibilities Requirements & Modules Missing Functionalities HSLU – University of applied Sciences Term Paper. Properties – Goals Enterprise