Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of...

47
A survey about the analysis of nonparametric power functions Preferences of goodness-of-fit tests: A survey about the analysis of nonparametric power functions Arnold Janssen Mathematical Institute, Heinrich-Heine-University Duesseldorf Ulm, September , 2015

Transcript of Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of...

Page 1: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Preferences of goodness-of-fit tests: A surveyabout the analysis of nonparametric power

functions

Arnold Janssen

Mathematical Institute, Heinrich-Heine-University Duesseldorf

Ulm, September , 2015

Page 2: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Table of contents

1 Introduction

2 Local comparision of asymptotic nonparametric powerfunctions

3 Global nonparametric power functions are almost flat

4 Tests for hazard rates

5 Recommendation in practice

6 Making two-sample GOF tests distribution free if ties arepresent

7 References

Page 3: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Introduction

Outline

1 Introduction

2 Local comparision of asymptotic nonparametric powerfunctions

3 Global nonparametric power functions are almost flat

4 Tests for hazard rates

5 Recommendation in practice

6 Making two-sample GOF tests distribution free if ties arepresent

7 References

Page 4: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Introduction

Kolmogorov-Smirnov (KS) test

asymptotics of a normalized empirical process under the null(critical regions), B0: Brownian bridge

omnibus tests are typically consistent

Page 5: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Introduction

What is the difference for more general critical lines?

Which type of tests can be recommended?

The statistician should analyze the goodness-of-fit tests of hiscomputer package in order to get some knowledge and animpression about their preferences.

Page 6: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Introduction

What is the difference for more general critical lines?

Which type of tests can be recommended?The statistician should analyze the goodness-of-fit tests of hiscomputer package in order to get some knowledge and animpression about their preferences.

Page 7: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Introduction

Page 8: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Introduction

Discussion under local alternatives

about nonparametric power functionsclassification of tests: preferences for special alternativesmeaning and statistical interpretation of preferedalternatives

Page 9: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Introduction

History

Hájek/ Šidák (1967): asymptotic power of two-sample linearrank tests

Pitman asymptotic relative efficiency AREefficiency (ARE=1) for special alternativesalso ARE=0 show up

For KS test: Hájek/ Šidák (1967), Andel (1967):one-sided KS test is close to the median test(via local asymptotic relative efficiency)two-sided: Milbrodt and Strasser (1990)

Tool: the three Lemmata of Le Cam

Page 10: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Introduction

History

Hájek/ Šidák (1967): asymptotic power of two-sample linearrank tests

Pitman asymptotic relative efficiency AREefficiency (ARE=1) for special alternativesalso ARE=0 show up

For KS test: Hájek/ Šidák (1967), Andel (1967):one-sided KS test is close to the median test(via local asymptotic relative efficiency)two-sided: Milbrodt and Strasser (1990)

Tool: the three Lemmata of Le Cam

Page 11: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Outline

1 Introduction

2 Local comparision of asymptotic nonparametric powerfunctions

3 Global nonparametric power functions are almost flat

4 Tests for hazard rates

5 Recommendation in practice

6 Making two-sample GOF tests distribution free if ties arepresent

7 References

Page 12: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Page 13: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

“local parameters of order 1√n ” score functions g : (0,1)→ R

(or tangents g)H0 : λλ|(0,1) = F0 (after quantile transformation)

dFn,i.gdF0

(x) = 1 + cni g(x) + . . . 1 ≤ i ≤ n

cni =1√n one sample

Nonparametrics: parametric submodels are still present

Page 14: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Under local alternatives Png/√

n with g ∈ L02(0,1), where

L02(0,1) := {h ∈ L2(0,1) :

∫ 10 h(u) du = 0}, < h,g >=

∫ 10 hg dλλ:

√n(Fn(t)− F0(t))

D−−−−→Pn

g/√

n

X0(t) = B0(t)︸ ︷︷ ︸noise

+

∫ t

0g(u) du︸ ︷︷ ︸signal

, t ∈ [0,1],

F0 continuous.

limit experiment: signal detection problem (SDP) for B0.Milbrodt and Strasser (1990)

Page 15: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Reparametrization

testing problem for the limit experiment:

H0 : Sg ≤ 0 versus H1 : Sg ≥ 0,Sg 6= 0,

whereSg(t) :=

∫ t

0g(u) du, 0 ≤ t ≤ 1,

is the signal for g ∈ L02(0,1).

Page 16: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Le Cam theory: State of the art.

Convergence of the experiments, SDP limit experimentMain theorem of TestingConvergence of power functions towards the powerfunction of a limit test

Two-sided goodness-of-fit tests with convex acceptanceregions on suitable function spaces are asymptoticallyadmissible (Le Cam). Thus, there exists no overallasymptotically efficient goodness-of-fit test for all possibledirections of alternatives with dominating power function.

Page 17: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Le Cam theory: State of the art.

Convergence of the experiments, SDP limit experimentMain theorem of TestingConvergence of power functions towards the powerfunction of a limit testTwo-sided goodness-of-fit tests with convex acceptanceregions on suitable function spaces are asymptoticallyadmissible (Le Cam). Thus, there exists no overallasymptotically efficient goodness-of-fit test for all possibledirections of alternatives with dominating power function.

Page 18: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Example: One-sided Kolmogorov-Smirnov-Type Test

Let ρ : [0,1]→ R be a boundary function

ϕnKS =

1 >

supt∈[0,1](√

n (Fn(t)− t)− ρ(t)) 0 .0 ≤

Limit test for SDP is Kolmogorov-Smirnov-Type Test:

ϕKS =

1 >

supt∈[0,1](X0(t)− ρ(t)) 00 ≤

with X0(t) = B0(t) +∫ t

0 g(u) du for 0 ≤ t ≤ 1.

Page 19: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Let Pg be the distribution on C[0,1] of

X (t) = B0(t) +∫ t

0g(u)du = B0(t) + Sg(t) , 0 ≤ t ≤ 1

power of test ϕ for SDP (Girsanov formula)

g 7→ EPg (ϕ) =

∫ϕexp

(∫ 1

0gdB0 −

||g||2

2

)dP0

principal component decompositionintegral test statistics: compact operators, Shorack/WellnerCramér von Mises test statistics, Neuhaus (1976)not for sup-statistics (KS-test), Milbrodt/Strasser (1990)

Page 20: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Let Pg be the distribution on C[0,1] of

X (t) = B0(t) +∫ t

0g(u)du = B0(t) + Sg(t) , 0 ≤ t ≤ 1

power of test ϕ for SDP (Girsanov formula)

g 7→ EPg (ϕ) =

∫ϕexp

(∫ 1

0gdB0 −

||g||2

2

)dP0

principal component decompositionintegral test statistics: compact operators, Shorack/WellnerCramér von Mises test statistics, Neuhaus (1976)not for sup-statistics (KS-test), Milbrodt/Strasser (1990)

Page 21: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Comparison of power functions of tests ϕ along a path ϑ 7→ νϑof distributions

oracle test : Neyman Pearson test or(benchmark) best two-sided unbaised test

principle of local comparison at H0 : ϑ = 0

slope: ddϑEϑ (ϕ)|ϑ=0 curvature: d2

dϑ2 Eϑ (ϕ)|ϑ=0

Page 22: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Theorem (principal components of tests, J.(1995))

H0 = P0, h tangent h ∈ L(0)2 , ray t 7→ Pth. There exist a gradient

h0 ∈ L(0)2 and a Hilbert-Schmidt operator T : L(0)

2 → L(0)2 with

Eth(ϕ) = E0(ϕ)︸ ︷︷ ︸level α

+t < h,h0 >︸ ︷︷ ︸slope

+t2

2< h,T (h) >︸ ︷︷ ︸

curvature

+o(t2)

< h,T (h) > =∞∑

i=1

λi < h,hi >2, ||hi || = 1, λ1 ≥ λ2 ≥ . . . ≥ 0

||h0||2 +12

∞∑i=1

λ2i < α(1− α), α = E0(ϕ)

one-sided tests: h0||h0|| direction with highest slope

two-sided unbaised test: h0 = 0: h1 direction with highestcurvature (=highest preference for h0, h1 respectively)

Page 23: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Theorem (principal components of tests, J.(1995))

H0 = P0, h tangent h ∈ L(0)2 , ray t 7→ Pth. There exist a gradient

h0 ∈ L(0)2 and a Hilbert-Schmidt operator T : L(0)

2 → L(0)2 with

Eth(ϕ) = E0(ϕ)︸ ︷︷ ︸level α

+t < h,h0 >︸ ︷︷ ︸slope

+t2

2< h,T (h) >︸ ︷︷ ︸

curvature

+o(t2)

< h,T (h) > =∞∑

i=1

λi < h,hi >2, ||hi || = 1, λ1 ≥ λ2 ≥ . . . ≥ 0

||h0||2 +12

∞∑i=1

λ2i < α(1− α), α = E0(ϕ)

one-sided tests: h0||h0|| direction with highest slope

two-sided unbaised test: h0 = 0: h1 direction with highestcurvature (=highest preference for h0, h1 respectively)

Page 24: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

Results for one- and two-sided KS-type tests

one-sided gradient h(a,b)0 (approximately), J./Kunz (2002),

Rahnenführer (2003)

h(a,a)0 ∼ sign(2u − 1)

score function of the mediantest

Page 25: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

one-sided ϕ: local asymptotic relative efficiency in direction h

AREL(ϕ,h) =(

< h,h0 >

‖h‖ f (u1−α)

)2

if < h,h0 > ≥ 0

f , u1−α N(0,1) density, 1− α quantile

AREL = 1 iff ϕ Neyman Pearson test (familiar Pitmaninterpretation of ARE)

numerical result: α = 0.05

AREL

(ϕKS,one-sided,h(a,a)

)≈ 0.715

Page 26: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

unbaised two-sided GOFtests gradient: h0 = 0

first principal component h1 ofthe Hilbert-Schmidt operatorh1 ≈ h(a,b)

0 , J. (1995), Rah-nenführer (2003).

Page 27: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

A.Connection to the score function of locationfamilies (x , ϑ) 7→ f (x − ϑ)

score function h ↑ : there exists a location family with score

function − f ′(F−1(u))f(F−1(u))

= h(u), Hájek/ Šidák (1967):

h(a,b)0 : f(a,b)(x) =

1a + b

[exp

(xa

)1(−∞,0](x) + exp

(−x

b

)1(0,∞)(x)

]

J./Ünlü (2008)

f(a,a) double exponential

Page 28: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Local comparision of asymptotic nonparametric power functions

B.Connection to differentiable stat. functionals κ

Koshevnik and Levit (1976)Pfanzagl, Wefelmeyer (1982)Bickel, Klaassen, Ritov, Wellner (1993)

κ : P → R (differentiable functional)

ddϑκ (Pϑ)|ϑ=0 =< g, κ >, κ canonical gradient (preference of κ)

quantile function: κ(F ) = F−1(

aa+b

), κ(x) = h(a.b) ◦ F (x)

Conclusion: KS type test is a testfor F 7→ F−1

(a

a+b

)

Page 29: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Global nonparametric power functions are almost flat

Outline

1 Introduction

2 Local comparision of asymptotic nonparametric powerfunctions

3 Global nonparametric power functions are almost flat

4 Tests for hazard rates

5 Recommendation in practice

6 Making two-sample GOF tests distribution free if ties arepresent

7 References

Page 30: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Global nonparametric power functions are almost flat

SDP: B0(t) +∫ t

0 g(u) du,0 ≤ t ≤ 1, distribution Pg

H0 : P0

Theorem J. (2000) (almost flat global power)

Let ϕ be a test for the SDP {Pg : g ∈ L02(0,1)} with EP0(ϕ) = α.

For all ε > 0 and K > 0 there exists a subspace V ⊂ L02(0,1) of

finite dimension with

sup{|EPh(ϕ)− α| : h ∈ V⊥, ‖h‖ ≤ K} ≤ ε.

Moreover,

dim(V )− 1 ≤ ε−1α(1− α)(exp(K 2)− 1).

Page 31: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Global nonparametric power functions are almost flat

already contained in the text bookLehmann and Romano (2005), Chap. 14.6,Testing statistical hypotheses, third edition.also holds uniformly in the sample size n (J. (2003) )trick: C[0,1] and RN are Borel-isomorphic as polish spaces

Explaination: Nonparametric asymptotic power functions ofgoodness-of-fit tests are almost flat. The statistician has a“total amount of power” which can be distributed along theorthonormal directions. Each test distributes the power in adifferent way.Dual description via level points of GOF tests

Page 32: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Global nonparametric power functions are almost flat

already contained in the text bookLehmann and Romano (2005), Chap. 14.6,Testing statistical hypotheses, third edition.also holds uniformly in the sample size n (J. (2003) )trick: C[0,1] and RN are Borel-isomorphic as polish spacesExplaination: Nonparametric asymptotic power functions ofgoodness-of-fit tests are almost flat. The statistician has a“total amount of power” which can be distributed along theorthonormal directions. Each test distributes the power in adifferent way.Dual description via level points of GOF tests

Page 33: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Tests for hazard rates

Outline

1 Introduction

2 Local comparision of asymptotic nonparametric powerfunctions

3 Global nonparametric power functions are almost flat

4 Tests for hazard rates

5 Recommendation in practice

6 Making two-sample GOF tests distribution free if ties arepresent

7 References

Page 34: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Tests for hazard rates

Gaussian shift experiment: SPD with Brownian motion B

X (t) := B(t) +∫ t

0h(u)du, 0 ≤ t ≤ 1

limit model for hazard rates λ(t) = f (t)1−F (t)

λϑλ(u)− 1 =

h(u)√n

+ . . .

the same result for global power functions

Page 35: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Tests for hazard rates

KS type test for SDP for Brownian motion:

ϕ = 1(

supt≤1

Xt ≥ c)

Rényi test

Rahnenführer (2003): h0 ≈ constant“close to the log-rank test (Savage test)”

Related to the K-transformation of Khmaladze (1981, 1983)

Fn = Mn + An Mn martingale, An compensator

“tests based on Mn instead of Fn”= testing hazards instead of densities

Two-sided: Drees/Milbrodt (1994)

Page 36: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Tests for hazard rates

KS type test for SDP for Brownian motion:

ϕ = 1(

supt≤1

Xt ≥ c)

Rényi test

Rahnenführer (2003): h0 ≈ constant“close to the log-rank test (Savage test)”Related to the K-transformation of Khmaladze (1981, 1983)

Fn = Mn + An Mn martingale, An compensator

“tests based on Mn instead of Fn”= testing hazards instead of densities

Two-sided: Drees/Milbrodt (1994)

Page 37: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Recommendation in practice

Outline

1 Introduction

2 Local comparision of asymptotic nonparametric powerfunctions

3 Global nonparametric power functions are almost flat

4 Tests for hazard rates

5 Recommendation in practice

6 Making two-sample GOF tests distribution free if ties arepresent

7 References

Page 38: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Recommendation in practice

Ask the scientist about his preferences for alternatives interms of statistical functionals (median, quantiles,interquartile distance, . . . )calculate the canonical gradients of the functionals,g1, . . . ,gk (= prefered directions of the functionals)

select a GOF test with high power on the subspace ofalternatives span(g1, . . . ,gk ) or

one-sided: Likelihood ratio test for the cone{∑ki=1 βigi : βi ≥ 0

}two-sided: h1, . . . ,hd O.N. basis of span(g1, . . . ,gk )

Neyman’s smooth test Tn =∑d

j=1

(1√n

∑ni=1 hj(xi)

)2.

Page 39: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Recommendation in practice

Ask the scientist about his preferences for alternatives interms of statistical functionals (median, quantiles,interquartile distance, . . . )calculate the canonical gradients of the functionals,g1, . . . ,gk (= prefered directions of the functionals)select a GOF test with high power on the subspace ofalternatives span(g1, . . . ,gk ) or

one-sided: Likelihood ratio test for the cone{∑ki=1 βigi : βi ≥ 0

}two-sided: h1, . . . ,hd O.N. basis of span(g1, . . . ,gk )

Neyman’s smooth test Tn =∑d

j=1

(1√n

∑ni=1 hj(xi)

)2.

Page 40: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Making two-sample GOF tests distribution free if ties are present

Outline

1 Introduction

2 Local comparision of asymptotic nonparametric powerfunctions

3 Global nonparametric power functions are almost flat

4 Tests for hazard rates

5 Recommendation in practice

6 Making two-sample GOF tests distribution free if ties arepresent

7 References

Page 41: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Making two-sample GOF tests distribution free if ties are present

two i.i.d. samples n = n1 + n2, min(n1,n2)→∞

X1, . . . ,Xn1 Xn1+1, . . . ,Xn

edf Gn edf Hn

Theorem (J. 1994)Under the null H0 : Xi ∼ F i.i.d.

Tn :=(n1n2

n

) 12

supt∈R

∣∣∣Gn(t)− Hn(t)∣∣∣ D−−−→

n→∞supt∈R|B0 (F (t))|

“depends on F for tied data.”works also for other norms on D[0,1].

Page 42: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Making two-sample GOF tests distribution free if ties are present

How to get critical values at finite sample size?

Fix X1, . . . ,Xn and use the permutation test. Let

σ : {1, . . . ,n} → {1, . . . ,n}

be random uniformly distributed permutation, independentof the X ’s.Critical values are taken from the permutation distribution

σ 7→ Tn(Xσ(1),Xσ(2), . . . ,Xσ(n)

).

Result: Power of the permutation test is equal to the power ofthe unconditional test if F for H0 would be known.

Page 43: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Making two-sample GOF tests distribution free if ties are present

How to get critical values at finite sample size?

Fix X1, . . . ,Xn and use the permutation test. Let

σ : {1, . . . ,n} → {1, . . . ,n}

be random uniformly distributed permutation, independentof the X ’s.Critical values are taken from the permutation distribution

σ 7→ Tn(Xσ(1),Xσ(2), . . . ,Xσ(n)

).

Result: Power of the permutation test is equal to the power ofthe unconditional test if F for H0 would be known.

Page 44: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

References

Outline

1 Introduction

2 Local comparision of asymptotic nonparametric powerfunctions

3 Global nonparametric power functions are almost flat

4 Tests for hazard rates

5 Recommendation in practice

6 Making two-sample GOF tests distribution free if ties arepresent

7 References

Page 45: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

References

References

Andel, J., 1967. Local asymptotic power and efficiency of tests of Kolmogorov-Sminov type. Ann. Math. Stat.38, 1705–1725.

Bickel, P.J., Klaassen, C.A.J., Ritov, Y., Wellner, J.A., 1993. Efficient and Adaptive Estimation forSemiparametric Models. Johns Hopkins University Press, Baltimore.

Drees, H., Milbrodt, H., 1994. The one-sided Kolmogorov-Smirnov test in signal detection problems withGaussian white noise. J. Stat. Plann. Inference 29, 325–335

Hájek, J., Šidák, Z., 1967. Theory of rank tests. Academic Press, New York.

Janssen, A., 1994:Two-sample goodness-of-fit tests when ties are present. J. Stat. Plann. Inference 39,399-424.

Janssen, A., 1995. Principal component decomposition of non-parametric tests. Probab. Theory RelatedFields 101, 193–209.

Janssen, A., 1999. Testing nonparametric statistical functionals with applications to rank tests. J. Stat.Plann. Inference 81, 71-93. Erratum (2001), 92, 297.

Janssen, A., 2000. Global power functions of goodness of fit tests. Ann. Statist.28, 239–253.

Janssen, A., 2003. Which power of goodness of fit tests can really be expected:Intermediate versuscontiguous alternatives. Statist. Decisions 21, 301–325.

Janssen, A., Kunz, M., 2002. Global extrapolations for power functions of one-sided nonparametric tests.Stat. Decis. 20, 153-176.

Page 46: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

References

Janssen, A., Kunz, M., 2004: Brownian Type Boundary Crossing Probability for Piecewise Linear BoundaryFunctions. Commun. Statist. - Theory Meth., Vol.33, 1445-1464.

Janssen, A., Ünlü, H., 2008: Regions of alternatives with high and low power for goodness-of-fit tests. J.Stat. Planning Inference 138, 2526-2543.

Khmaladze, É.V., 1981. A martingale approach in the theory of goodness-of-fit-tests. (Russian) Teor.Veroyatnost. i Primenen. 26, 246–265.

Khmaladze, É.V., 1983. Martingale limit theorems for decomposable statistics. (Russian) Teor. Veroyatnost. iPrimenen. 28, 504–520.

Koshevnik, Y.A., Levit, B.Y., 1976. On a nonparametric analogue of the information matrix. Theory Probab.Appl. 21, 738–753.

Lehmann, E.L., Romano, J.P., 2005. Testing Statistical Hypotheses. 3rd edition, Springer, New York.

Milbrodt, H., Strasser, H., 1990. On the asymptotic power of the two-sided Kolmogorov-Smirnov test. J.Statist. Plann. Inference 26, 1-23.

Neuhaus, G., 1976. Asymptotic power properties of the Cramér-von Mises test under contiguousalternatives. J. Multivariate Anal. 6, 95–110.

Pfanzagl, J., Wefelmeyer, W., 1982. Contributions to a general asymptotic statistical theory. Lecture Notes inStatistics 13, Springer, Berlin.

Rahnenführer, J., 2003. On preferences of general two-sided tests with applications to Kolmogorov-Smirnov-type tests. Statist. Decisions 21, 149–170.

Shorack, G.R., Wellner, J.A., 1986. Empirical processes with applications to statistics. John Wiley & Sons,New York.

Strasser, H., 1985. Mathematical Theory of Statistics. De Gruyter Studies in Mathematics.

Page 47: Preferences of goodness-of-fit tests: A survey about the ... · A survey about the analysis of nonparametric power functions Table of contents 1 Introduction 2 Local comparision of

A survey about the analysis of nonparametric power functions

Thank you for your attention!