Hypothesis Testing · Hypothesis Testing Example: I Null Hypothesis H 0: Xn ˘N(0;˙) I Alternative...

Hypothesis Testing

Rianne de Heide

CWI & Leiden University

April 6, 2018

PhD-studentMachine Learning Group

Peter Grunwald

Study coordinatorM.Sc. Statistical Science

Jacqueline Meulman

Projects

I Bayesian inference under model misspecification (SafeBayes)

I Foundations of Bayesianism

I Hypothesis testing (Frequentist, Bayesian, new methods)

Peter Tom Wouter Allard

Frequentist vs. Bayesian probability

Different views on probability:

I Frequentists: frequency in a long run

I Bayesians: degree of belief

Hypothesis Testing

Example:

I Null Hypothesis H0: X n ∼ N(0, σ)

I Alternative Hypothesis H1 : X n ∼ N(θ, σ), θ 6= 0

Different types of hypothesis tests

I p-value based null hypothesis significance tests (frequentist)

I Bayes Factor hypothesis tests (about probability of H0 giventhe data)Limitations: see De Heide and Grunwald (2018)

I Test martingales and S-tests (current work)

Hypothesis Testing

Example:

Hypothesis Testing

Example:

Hypothesis Testing

Example:

Optional stopping

Optional stopping: ‘peeking at the results so far to decidewhether or not to gather more data’

→ reproducibility crisis in life and behavioural sciences (far toomany false positives)

Frequentist Optional Stopping and p-values (1)

I Data X n = X1,X2, . . . ,Xn; Xi ∈ XI Hypothesis test Tn : X n 7→ {0, 1}I Significance level α

Frequentist optional stoppingA sequence of hypothesis tests Tn : X n → {0, 1} with significancelevel α is said to be robust under frequentist optional stopping if

PH0 (∃n : Tn(X n) = 1) ≤ α.

I Data X n = X1,X2, . . . ,Xn; Xi ∈ XI Hypothesis test Tn : X n 7→ {0, 1}I Significance level α

Frequentist optional stoppingA sequence of hypothesis tests Tn : X n → {0, 1} with significancelevel α is said to be robust under frequentist optional stopping if

PH0 (∃n : Tn(X n) = 1) ≤ α.

Example:

Data X n i.i.d.∼ N(θ, 1)

H0: θ = 0H1: θ 6= 0Test statistic: Zn = X

p-value: p = 2(1−Φ(|Zn|)

Stopping rule: Continue until |Zn| > k, then stop (withα = 0.05, k = 1.96).

LIL for X1,X2, . . . i.i.d. standard normal:

lim supn→∞

∑ni=1 xi√

2n log log n= 1 a.s.

This gives us: Zn > λ√

2 log log n i.o. w.p. 1, for λ < 1.

Example:

Data X n i.i.d.∼ N(θ, 1)H0: θ = 0H1: θ 6= 0

Test statistic: Zn = X√n

lim supn→∞

∑ni=1 xi√

Example:

Data X n i.i.d.∼ N(θ, 1)H0: θ = 0H1: θ 6= 0Test statistic: Zn = X

lim supn→∞

∑ni=1 xi√

Example:

lim supn→∞

∑ni=1 xi√

Example:

lim supn→∞

∑ni=1 xi√

Example:

lim supn→∞

∑ni=1 xi√

Example:

lim supn→∞

∑ni=1 xi√

Frequentist Optional Stopping and Bayes Factors

Can Bayes Factor hypothesis tests handle frequentist optionalstopping? (Bayesian statisticians suggest they do...)

I Not always (see Sanborn and Hills (2014); De Heide andGrunwald (2018) )

I Yes, if H0 is simple

I Yes, with composite H0 and certain group invariance structure(see Hendriksen, De Heide and Grunwald (2018))Example: Bayesian t-test

Test Martingales (1)

The Test Martingale (Vovk et al., 2011):

A non-negative martingale {Mn}n∈N with initial value M0 = 1.

The Test Martingale (Vovk et al., 2011):

A non-negative martingale {Mn}n∈N with initial value M0 = 1.

Example:H0: the coin is fair, i.e. the outcomes X1,X2, . . . are i.i.d.Bernoulli(0.5)Start with capital K0 = 1At each round i

I We invest a fraction qi of our capital on the outcome{Xi = 0} and 1− qi on {Xi = 1}

I The outcome is revealed

I We get a pay-off: twice our stakes on the correct outcome,nothing on the other

After N rounds: KN = 2N∏N

t=1 q(XN |XN−1).

I If H0 is true, the bet is fair, i.e. E[KN ] ≤ K0

I If KN is large, it is an indication that H0 is not true.

I Test martingales can handle frequentist optional stopping.

S-test statistics

Test martingales can handle frequentist optional stopping and haveadditional advantages over p-values (interpretability)

Problem: how to construct test martingales for composite H0.

Current work: S-value

References

P. Grunwald, R. de Heide, and W. Koolen. Safe tests. work inprogress, 2018.

R. de Heide and P. Grunwald. Why optional stopping is a problemfor Bayesians. arXiv preprint arXiv:1708.08278, 2018.

A. Hendriksen, R. de Heide, and P. Grunwald. Folklore results onoptional stopping of Bayesian methods made precise. work inprogress, 2018.

A. Sanborn and T. Hills. The frequentist implications of optionalstopping on Bayesian hypothesis tests. Psychonomic Bulletin &Review, 21(2):283–300, 2014.

V. Vovk, N. Vereshchagin, A. Shen, and G. Shafer. Testmartingales, Bayes factors and p-values. Statistical Science, 26(1):84–101, 2011.

Hypothesis Testing · Hypothesis Testing Example: I Null Hypothesis H 0: Xn ˘N(0;˙) I Alternative...

Documents

Transcript of Hypothesis Testing · Hypothesis Testing Example: I Null Hypothesis H 0: Xn ˘N(0;˙) I Alternative...

4.1.13 - The Binomial Theorem II · The Binomial Theorem The Binomial Theorem tells us that: (x +y)n = n 0 xny0 + n 1 xn 1y1 + n 2 xn 2y2 +::: n r xn ryr +:::+ n n 1 x1yn 1 + n n

SEED Haematology - Sysmex · 2016. 11. 3. · XN Series UX-2000 XN-L Series (XN-350, XN-450, XN-550) XP-300 SNCS IQAS Online – what is it and how does it work? SNCS IQAS Online

Hypothesis tests - Math · t-tests Consider the hypothesis for a fixed value θ 0. Data for which the estimate for θ j is close to θ 0 support the hypothesis, whereas data for which

DEH-2050MPG/XN/ES DEH-2050MPG/XN/ES1

A Review of the Multiple-Sample Tests for the …...z-test can be used to test the null hypothesis, H 0: µ = 0, versus the alternative hypothesis, H 1: µ ≠ 0. However, practically,

STEREOGRAPHIC PARAMETERS AND PSEUDO- MINIMAL … fileHere X0, Xi, , Xn are the coordinates of the embedding En+i, and 0 is an arbitrary function of xi, , xn whose geometrical meaning

Biostat 200 Lecture 7 1. Hypothesis tests so far T-test of one mean: Null hypothesis µ=µ 0 Test of one proportion: Null hypothesis p=p 0 Paired t-test:

BINOMIAL THEOREM - Sakshi · BINOMIAL THEOREM * Binomial Theorem for integral index: If n is a positive integer then (x + a) n = 0 nC xn + 1 nC xn-1 a + 2 nC xn-2 a2 + …. + n Cr

XN XN - Nexmospherenexmosphere.com/document/XN-185 Quick Start Guide.pdf · 2020. 5. 27. · The XN-185 Xperience controller has 8 X-talk interfaces to which any combination of Elements

xn--webducation-dbb.comwebéducation.com/wp-content/uploads/2019/12... · xn--webducation-dbb.com

What is a Hypothesis? · ) hypothesis Null hypothesis (H 0): Is the hypothesis being tested , here we write the hypothesized value which is called reference value (the value which

DURAG · D-LX 1 I O xxx-Cx/n/84Ex/xn xn< D-LX 71O xn

e. H. - Thomas J. · PDF filedefinite matrix W such that W'W = C, so that (WXn)'(WXn) --. 0, which is equivalent with W Xn --. 0 in Rn. Since W is invertible, it is true that Xn --.

Pitfalls of hypothesis tests and model selection on ... · denoted by x = ( x 1;:::;xn)>.The considered null hypothesis states that is equal to a pre- de ned value 0.The sample mean

(1almoslim.net/documents/Emtehan_AlQloop.pdf3 ˘ >6 ‡ # , c , ˘˝;M< ˘˝P < O-Pˇ# 90d˝ ‡r < ˘ >6 ‡ # $:1! E P/ 0˝ XN F v"# ˙˝vˇ# =0 - < 4-% 0 ˜V \

18.650 (F16) Lecture 5: Parametric hypothesis testing · H. 0 : θ ∈ Θ. 0 Consider the two hypotheses: H. 1 : θ ∈ Θ. 1 H. 0 . is the null hypothesis, H. 1 . is the alternative

Green Belt Introduction to Hypothesis Testing - Lean Ohiolean.ohio.gov/Portals/0/docs/training/GreenBelt/2.2_Introduction Hypothesis.pdf · Green Belt. Introduction to Hypothesis

Section 7.1 Hypothesis Testing: Hypothesis: Null Hypothesis (H 0 ): Alternative Hypothesis (H 1 ): a statistical analysis used to decide which of two competing.

Conservative Hypothesis Tests and Confidence Intervals ... · M.T. Harrison/Conservative Hypothesis Tests and Con dence Intervals 4 for all 2[0;1] and n 0 under the null hypothesis,

xn--d1alsn.xn--b1aew.xn--p1aihttps://крду.мвд.рф/upload/site119/folder_widepage/0… · Web viewxn--d1alsn.xn--b1aew.xn--p1ai