Type-II errors of independence tests can lead to ... fileType-II errors of independence tests can...

Type-II errors of independence tests can leadto arbitrarily large errors in estimated causal

effects: an illustrative exampleWorkshop UAI 2014

Nicholas Cornia & Joris M. Mooij

University of Amsterdam

27/07/2014

1 Problem Setting

2 Estimation of the causal effect error form the observedcovariance matrix

3 Discussion

4 Conclusions and future work

1 Problem Setting

3 Discussion

Introduction

Task: Inferring causation from observational data

Challenge: Presence of hidden confounders.

Approach: Causal discovery algorithms based onconditional independence (CIs) tests .

Simplest case: Three random variables, a single CI test(LCD-Trigger setting).

Contribution: Causal predictions are extremely unstablewhen type II errors arise.

LCD-Trigger Algorithm

Cooper (1997) and Chen et al. (2007).The following causal model

X1 X2 X3

is implied by

Prior assumptionsNo Selection BiasAcyclicityFaithfulnessX2,X3 do not cause X1

Statistical testsX1 6⊥⊥ X2

X2 6⊥⊥ X3

X1 ⊥⊥ X3|X2

Application of the LCD in biology

Example

Gene expression

SNP︸︷︷︸Single Nucleotide Polymorphism

G︸︷︷︸Gene expression level

P︸︷︷︸Phenotype

Example

Disease Treatment

X︸︷︷︸Gender

Y︸︷︷︸Disease 1

Z︸︷︷︸Disease 2

Linear Gaussian model

For simplicity: linear-Gaussian case.Structural equations:

Xi =∑i 6=j

αijXj + Ei X = AX + E

E ∼ N(0,∆

)∆ = diag

and A = {αij} is the weightedadjacency matrix of the causalgraph (αij 6= 0 ⇐⇒ Xi → Xj ).

Example

X1 X2 X3α12 α23

X1 = E1

X2 = α12X1 + E2

X3 = α23X2 + E3

Then:X ∼ N

(0,Σ) Σ = Σ(A,∆)

Causal effect estimator

Causal effect of X2 on X3:

A 3 α23 =∂

∂x2E(X3|do(X2 = x2)

) Under the LCD assumptions

E(X3|X2

is a valid estimator for thecausal effect of X2 on X3.

Example

Structural equations(observed)

X1 = E1

X2 = α12X1 + E2

X3 = α23X2 + E3

Structural equations afteran intervention

X1 = E1

X2 = x2

X3 = α23x2 + E3

Fundamental question

What happens to the error in the causal effect estimator ifin reality there is a weak dependence X1 6⊥⊥ X3|X2, but wedo not have enough data to detect it?

Type II error: Erroneously accepting the null hypotesis ofindependence in the statistical test X1 ⊥⊥ X3|X2. Can westill guarantee some kind of bound for the distance

|E(X3|X2

)− E

(X3|do(X2)

From LCD to our model

Starting from the chain

X1 X2 X3 X1 ⊥⊥ X3|X2

If we consider a possible weak dependence not detected by ourtest suddenly the causal graph gains complexity

X1 X2 X3

X1 6⊥⊥ X3|X2

where X4 is a confounding variable between X2 and X3.

True model

X1 X2 X3

Prior assumptionsNo Selection BiasAcyclicityFaithfulnessX2,X3 do not cause X1

No confounders between X1 andX2, or X3, or both (for simplicity)

Statistical testsX1 6⊥⊥ X2

X2 6⊥⊥ X3

A weak conditionaldependence X1 6⊥⊥ X3|X2

Causal effect estimation error function

Belief

X1 X2 X3α23

α23 =Σ32

True model

X1 X2 X3

α23 6=Σ32

Error in the causal effect estimation function

g(A,Σ

Σ22− α23

1 Problem Setting

3 Discussion

Constraint equations

Proposition

There exists a mapΦ : (A,∆) → Σ

from the model parameters to the observed covariance matrixthat defines a set of polynomial equations.

From a geometrical point of view, given Σ

(A,∆) ∈M ⊂ R9

Non-identification of the model parameters

In our model the map Φ is not injective. Thus, the manifoldM does not reduce to a single point.

Φ−1 =?

Nevertheless it is still an interesting question whetherthe function g is a bounded function onM or not.

Main result

TheoremThere exists a map

Ψ(Σ, δ22 , δ

23 , s1, s2) = A

where s1, s2 are two signs and the δ22 , δ

23 are the variance of the noise

sources of X2 and X3 respectively.

Corollary

It is possible to express the error in the causal effect estimation function g as

g(Σ,Ψ(Σ, δ2

2 , δ23 , s1, s2)

ϑΣ12

mΣ22︸︷︷︸small for weak dep.

+ s1s2

√det Σ−mδ2

√m − Σ11δ2

m√δ2

2︸︷︷︸arbitrarily large

where ϑ = Σ13Σ22 − Σ12Σ23 and m = Σ11Σ22 − Σ212.

Approaching the singularity

Proposition

limδ2

2→0|g| = +∞

∀ δ23 ∈ [0, det Σ/m] (s1, s2) ∈ {−1, 1}2

1 Problem Setting

3 Discussion

Probabilistic estimation of the error

(δ22 , δ

23) ∈ D(Σ) ⊂ R2

MM = {(δ22 , δ

23) : |g| ≤ M}

If we put a uniform prior on thenoise variances

Pr(|g| ≤ M) =||MM ||||D(Σ)||

What would be a reasonable prior distribution for δ22 , δ

Looking for an approximate bound

The causal effect error function g can be optimized over the δ23

parameters, giving a confidence interval for the causal weightα23

α23 ∈ [b−,b+] ⊂ R

b±(δ22) =

√det Σ

√m − Σ11δ

m√δ2

Looking for an approximate bound

Suppose we would have a lower bound

δ22 ≥ δ̂2

then this implies an upper bound on |g|.

What would be a practical example where we can assumesuch a lower bound for the variance δ2

1 Problem Setting

3 Discussion

Conclusions

The causal effect estimation error is sensible to erroneousconclusions in conditional independence tests.

The result is in accord with Robins et al. (2003), on thelack of uniform consistency of causal discovery algorithms,but through this paper we wish to emphasize this issue onthe more practical matter of type II errors.

In our case it was not possible to identify the modelparameters explicitly.

Proposal for future work

Bayesian model selection: What would be a reasonableprior distribution for the model parameters?

Bayesian Information Criterion: Will the BIC still givereasonable results even though the model parameters arenot identifiable? Could it deal with irregular or evensingular models?

Proposal for future work

Adding an “environment” variable: Might it bereasonable to assume that a part, or most, of the externalvariability is carried by the covariance between theenvironment variable W and the other measured ones,including possible confounders?

X1 X2 X3

Thanks for your attention!

Type-II errors of independence tests can lead to ... fileType-II errors of independence tests can...

Documents

Transcript of Type-II errors of independence tests can lead to ... fileType-II errors of independence tests can...

Stat 512 – Day 8 Tests of Significance (Ch. 6). Last Time Use random sampling to eliminate sampling errors Use caution to reduce nonsampling errors Use.

Analysis of Count Data Chapter 26 Goodness of fit Formulas and models for two-way tables - tests for independence - tests of homogeneity.

SOME EXACT CONDITIONAL TESTS OF …aa/articles/agresti_wackerly.pdfPSYCHOMETRIKA--VOL. 42, NO. 1 MARCH, 1977 SOME EXACT CONDITIONAL TESTS OF INDEPENDENCE FOR, R × C CROSS-CLASSIFICATION

Goodness of Fit Tests: Independence

Analysis of Count Data Chapter 24 Goodness of fit Formulas and models for two-way tables - tests for independence - tests of homogeneity.

Some exact conditional tests of independence for

Analysis of Count Data Chapter 14 Goodness of fit Formulas and models for two-way tables - tests for independence - tests of homogeneity.

Session IV: Contingency Tables Tests of Association and Independence (Zar, Chapters 23, 24.10)

Testing Independence Conditions in the Presence of Errors ...psych.fullerton.edu/mbirnbaum/papers/Birnbaum... · Testing Independence Conditions in the Presence of Errors and Splitting

1 Contingency Tables: Tests for independence and homogeneity (§10.5) How to test hypotheses of independence (association) and homogeneity (similarity)

Contingency Tables For Tests of Independence

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242.

2 Test of Independence. Hypothesis Tests Categorical Data.

The SURVEYFREQ Procedure - SAS Support · PROC SURVEYFREQ provides design-adjusted tests of independence, or no association, between the row and column variables. These tests include

IE 303 Discrete-Event SimulationTests for Uniformity Kolmogorov, Smirnov Tests Chi-square Tests Test for Statistical Independence from History ... Chi-square Tests Test for Statistical

Errors Requesting appropriate tests Writing prescription Reading prescription Sample collection Sampling times environmental factors Drug interferences.

Diagnostic Tests of Cross Section Independence for ...ftp.iza.org/dp2756.pdf · Diagnostic Tests of Cross Section Independence for Nonlinear Panel Data Models Cheng Hsiao University

Chi-Square Tests Chi-Square Tests Chapter1414 Chi-Square Test for Independence Chi-Square Tests for Goodness-of-Fit Copyright © 2010 by The McGraw-Hill.

Tests of Homogeneity and Independence

Consistent Nonparametric Tests of Independencejmlr.csail.mit.edu/papers/volume11/gretton10a/gretton10a.pdfCONSISTENT NONPARAMETRIC TESTS OF INDEPENDENCE on the notion of differences