Bonferonni correction+

Bonferonni correction+

Adapted from presentation of Рубанович А.В.

Experiments in finding people with paranormal powers:

Joseph Rhine (1950)

1000 people guessed the sequence of 10 cards: red or black?

12 persons guessed 9 of 10 cards, two of them all 10 cards

All these “physics” in further experiments did’t confirm their paranormal abilities

Problems of «multiple comparisons» ?

Genome-wide association: gene expression studies with DNA chips – 500 000 SNP. For the significance level 0.01 we can expect up to 5000 false associations

Meta-studies: joining and comparison of different results obtained by different authors

Multiple testing is dangerous: large probability tofind false association!

Let us generate two identically distributed sampleswith 100 persons with 20-locus genotypes

gencase gencontrol ORreal p1 6 7 0.85 0.7822 12 9 1.4 0.5133 8 7 1.2 0.7964 9 9 1.0 1.005 9 9 1.0 1.006 9 11 0.80 0.6557 10 10 1.0 1.008 8 11 0.70 0.4919 8 9 0.88 0.80810 7 14 0.46 0.12711 12 9 1.4 0.51312 8 10 0.78 0.63713 12 10 1.2 0.67014 10 10 1.0 1.0015 7 6 1.2 0.78216 12 11 1.1 0.83517 10 9 1.1 0.81918 11 9 1.2 0.65519 13 10 1.3 0.53220 7 5 1.4 0.564




How it happens? Appearance of false associations

OR p Gene Sample 1 Sample 2Cases ControlsOdd Ratio –w/o association OR=1

1

Should be OR=1

Significant!

234

All 3 loci are Associated with

a disease!

How to avoid false associations?

Applying m independent statistical tests with significance level a, a probability of at least one false association should be

1-(1-a)m < 0.05

Carlo Bonferroni (1935):When applying m independent statistical test, only significant

results are results with

Bonferroni correction kills the significance of certainresults:

Control (100)

Cases (100)

OR p

Mutation 1 1 8 8,61 0,044

Mutation 2 5 15 3,35 0,024

But adjusted by Bonferroni it should be:p < 0,05/2=0,025

Two mutations associated with the disease

1 against 8 with equal size samples :

case_mut1=matrix(1,8,1)case_non_mut1=matrix(0,92,1)control_mut1=matrix(1,1,1)control_non_mut1=matrix(0,99,1)data=rbind(case_mut1,case_non_mut1,control_mut1,control_non_mut1)res=rbind(matrix(1,100,1),matrix(0,100,1))mylogit<- glm(as.formula(res~data), family=binomial(link="logit"), na.action=na.pass)exp(mylogit$coefficients[2])summary(mylogit)[["coefficients"]][,"Pr(>|z|)"]

case_mut1=matrix(1,15,1)case_non_mut1=matrix(0,85,1)control_mut1=matrix(1,5,1)control_non_mut1=matrix(0,95,1)data=rbind(case_mut1,case_non_mut1,control_mut1,control_non_mut1)res=rbind(matrix(1,100,1),matrix(0,100,1))mylogit<- glm(as.formula(res~data), family=binomial(link="logit"), na.action=na.pass)exp(mylogit$coefficients[2])summary(mylogit)[["coefficients"]][,"Pr(>|z|)"]

Example to compute OR

Assessment of individual sensitivity to ionizing radiation and DNA repair efficiency in a healthy population

F. Marcona, C. Andreoli, et al. Mut. Res., 541 (2003)

Not significant! According to Bonferroni shoud be:

Genotypes

High-Throughput Detection of GST Polymorphic Alleles in a Pediatric Cancer Population P. Barnette, R. Scholl, et al. Cancer Epidemiology, Biomarkers & PreventionVol. 13, 304–313, 2004

8 diseases

13 genotypesOR=6,4 P=0,007

OR=2,3 P=0,018

Not significant! Bonferroni correction requests:

Homozygocity in GST prevents cancer!

Control

Bonferroni method creates more problems than it solves (Thomas Perneger, 1998):

Bonferroni correction leads to very high probability to miss proper association!

“Bonferroni adjustments are, at best, unnecessary and, at worst, deleterious to sound statistical inference…”

Errors by statistical testing

Type I Error Probability to reject null hypothesis=probability to find differences where there are any = Probability of false discovery

Type II ErrorProbability to accept wrong null hypothesis= Probability not to find existing differences = Probability to miss proper discovery

Test power = 1- Type II error = Probability to reject correctly null hypothesis = Probability to make a discovery

Null hypothesis – usually about absence of differences in two samples

Traditionally a biologist is trying to avoid Type I error, i.e. to guarantee avoidance of

False discoveries

… and is not taking care aboutthe possibility to miss discovery (Type II Error)

0

0,2

0,4

0,6

0,8

0 5 10 15 20Число тестов

Ош

ибк

а II

род

а

Dependence of Type II error on number of tests usingthe Bonferroni correction

Probability to miss gene with OR=2.7 with sample sizes 100 (case) and 100 (control)

With 100 comparisons to guarantee avoidance of 1 false discovery, we miss 88% proper discoveries!

For m=100 the probability of error is 0.88

1

In single test a probability tomiss the discovery is 0.2

With 5 comparisons we miss 50% of discoveries

Number of tests

New algorithm to test statistical hypothesis: FDR-control

False Discovery Rate control: Benjamini, Hochberg (1995))

Probability of false discovery < Significance level Type I Error < 0.05

Average fraction of false discoveries < Significance level chosen

Traditional principle is replaced by

>105 papers in

Algorithm of FDR control(Benjamini(Benjamini,, Hochberg, 1995) Hochberg, 1995)

Order tests according to p-value Order tests according to p-value : :

pp11 < p < p22 < … < p < … < pmm..

For For FDR control FDR control onon αα level level ( (e.g.e.g. 0.05) 0.05),,

we findwe find

Differences are assumed to be significant Differences are assumed to be significant for for j = 1, …, j*.j = 1, …, j*.

ForFor j > jj > j* * differences are assumed not to be significant

m

jpjj j:max*

Order number ofgene

Significance levelrequired

Total number of tests

(genes)

P-value for j-th test(gene)

BonferroniCorrection

0,005

0,005

0,005

0,005

0,005

0,005

0,005

0,005

0,005

0,005

Example: multiple comparisons on 10 tests

FDR correction

0,005

0,010

0,015

0,020

0,025

0,030

0,035

0,040

0,045

0,050

Test pi

1 0,001

2 0,0055

3 0,01

4 0,015

5 0,02

6 0,04

7 0,3

8 0,5

9 0,6

10 0,8

Significant p-valueswithout correction

Order tests in ascendingorder of p-value

Bonferonni correctionleaves only first value

In first cellBonferroni p-value

In secondtwo times larger

Three times largerand so on ….

For 6th testp-value is larger than FDR

Significant corrections

after FDR control

That’s it!!!

ExampleExample: : expression ofexpression of 3051 3051 genesgenes in leykomiain leykomiaGolub T.R. Molecular classification of cancer: class discovery and class Golub T.R. Molecular classification of cancer: class discovery and class

prediction by gene expression monitoring.prediction by gene expression monitoring. // Science. 2001, v.286.// Science. 2001, v.286.

t-t-testtest: 1045 : 1045 genes, for which genes, for which p<0.05p<0.05 Bonferroni correctionBonferroni correction: 98 : 98 genes with genes with p’<0.000016p’<0.000016 FDR: 681 FDR: 681 genes, for which genes, for which FDR< 0.05FDR< 0.05

t-statistics for the comparison of gene expression in healthy

and ill patients

Number of geneswith this level of t-statistics

Bonferonni correction+

Documents

Transcript of Bonferonni correction+