Nonparametric maximum likelihood estimation (MLE) for bivariate censored data

42
Nonparametric maximum likelihood estimation (MLE) for bivariate censored data Marloes H. Maathuis advisors: Piet Groeneboom and Jon A. Wellner

description

Nonparametric maximum likelihood estimation (MLE) for bivariate censored data. Marloes H. Maathuis advisors: Piet Groeneboom and Jon A. Wellner. Motivation. Estimate the distribution function of the incubation period of HIV/AIDS: Nonparametrically Based on censored data: - PowerPoint PPT Presentation

Transcript of Nonparametric maximum likelihood estimation (MLE) for bivariate censored data

Page 1: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Nonparametric maximum likelihood estimation (MLE)

for bivariate censored data

Marloes H. Maathuis

advisors:

Piet Groeneboom and Jon A. Wellner

Page 2: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Motivation

Estimate the distribution function of the

incubation period of HIV/AIDS:– Nonparametrically– Based on censored data:

• Time of HIV infection is interval censored

• Time of onset of AIDS is interval censored

or right censored

Page 3: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Approach

• Use MLE to estimate the bivariate distribution

• Integrate over diagonal strips: P(Y-X ≤ z) X (HIV)

Y (AIDS)

z

Page 4: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Main focus of the project

• MLE for bivariate censored data:– Computational aspects– (In)consistency and methods to repair the

inconsistency

Page 5: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Main focus of the project

• MLE for bivariate censored data:– Computational aspects– (In)consistency and methods to repair the

inconsistency

Page 6: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

1980

1992

1996

1980 1983 1986 X (HIV)

Y (AIDS)In

terv

al o

f on

set

of A

IDS

Interval ofHIV infection

Page 7: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

1980

1992

1996

1980 1983 1986 X (HIV)

Y (AIDS)In

terv

al o

f on

set

of A

IDS

Interval ofHIV infection

Observation rectangle Ri

Page 8: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

X (HIV)

Y (AIDS)

Observation rectangle Ri

1

max log ,n

F F i i ii

P X Y R

F

Page 9: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

X (HIV)

Y (AIDS)Maximal intersections

Observation rectangle Ri

1

max log ,n

F F i i ii

P X Y R

F

Page 10: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

X (HIV)

Y (AIDS)Maximal intersections

Observation rectangle Ri

1

max log ,n

F F i i ii

P X Y R

F

Page 11: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

X (HIV)

Y (AIDS)Maximal intersections

Observation rectangle Ri

1

max log ,n

F F i i ii

P X Y R

F

Page 12: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

X (HIV)

Y (AIDS)Maximal intersections

Observation rectangle Ri

1

max log ,n

F F i i ii

P X Y R

F

Page 13: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

X (HIV)

Y (AIDS)Maximal intersections

Observation rectangle Ri

1

max log ,n

F F i i ii

P X Y R

F

Page 14: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

α1 α2

α3 α4

X (HIV)

Y (AIDS)Maximal intersections

Observation rectangle Ri

1

max log ,n

F F i i ii

P X Y R

F

s.t. and

4 1 1 3max log( ) log( )

1 2 2 4log( ) log( )

3 4log( )

0, 1, , 4,i i 4

1

1ii

Page 15: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

3/5 0

0 25

X (HIV)

Y (AIDS)Maximal intersections

Observation rectangle Ri

The αi’s are not always uniquely determined: mixture non uniqueness

1

max log ,n

F F i i ii

P X Y R

F

s.t. and

4 1 1 3max log( ) log( )

1 2 2 4log( ) log( )

3 4log( )

0, 1, , 4,i i 4

1

1ii

Page 16: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Computation of the MLE

• Reduction step:

determine the maximal intersections

• Optimization step:

determine the amounts of mass assigned to the maximal intersections

Page 17: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Computation of the MLE

• Reduction step:

determine the maximal intersections

• Optimization step:

determine the amounts of mass assigned to the maximal intersections

Page 18: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Existing reduction algorithms

• Betensky and Finkelstein (1999, Stat. in Medicine) • Gentleman and Vandal (2001, JCGS) • Song (2001, Ph.D. thesis) • Bogaerts and Lesaffre (2003, Tech. report)

The first three algorithms are very slow,

the last algorithm is of complexity O(n3).

Page 19: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

New algorithms

• Tree algorithm

• Height map algorithm: – based on the idea of a height map of the

observation rectangles– very simple– very fast: O(n2)

Page 20: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

1

11

1000

0

11

33

2110

0

21

33

2121

0

21

22

1011

0

10

11

0011

0

00

21

1122

1

00

11

0011

0

00

00

0011

0

0

Height map algorithm: O(n2)

1

22

2110

0

2

Page 21: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data
Page 22: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Main focus of the project

• MLE of bivariate censored data:– Computational aspects – (In)consistency and methods to repair the

inconsistency

Page 23: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

HIV

AIDS

u1 u2

Time of HIV infection is interval censored case 2

Page 24: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

HIV

AIDS

u1 u2

Time of HIV infection is interval censored case 2

Page 25: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

HIV

AIDS

u1 u2

Time of HIV infection is interval censored case 2

Page 26: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

HIV

AIDS

t = min(c,y)

u1 u2

Time of onset of AIDS is right censored

Page 27: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

HIV

AIDS

t = min(c,y)

u1 u2

Time of onset of AIDS is right censored

Page 28: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

HIV

AIDS

t = min(c,y)

u1 u2

Time of onset of AIDS is right censored

Page 29: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

t = min(c,y)

HIV

AIDS

u1 u2

Page 30: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

HIV

AIDS

u1 u2

t = min(c,y)

Page 31: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

HIV

AIDS

u1 u2

t = min(c,y)

Page 32: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

HIV

AIDS

u1 u2

t = min(c,y)

Page 33: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Inconsistency of the naive MLE

Page 34: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Inconsistency of the naive MLE

Page 35: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Inconsistency of the naive MLE

Page 36: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Inconsistency of the naive MLE

Page 37: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Methods to repair inconsistency

• Transform the lines into strips

• MLE on a sieve of piecewise constant densities

• Kullback-Leibler approach

Page 38: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

• cannot be estimated

consistently

X = time of HIV infectionY = time of onset of AIDSZ = Y-X = incubation period

( )P Z z

Page 39: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

X = time of HIV infectionY = time of onset of AIDSZ = Y-X = incubation period

1 2( )P Z z x X x

• An example of a parameter we can estimate consis-tently is:

Page 40: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Conclusions (1)

• Our algorithms for the parameter reduction step are significantly faster than other existing algorithms.

• We proved that in general the naive MLE is an inconsistent estimator for our AIDS model.

Page 41: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Conclusions (2)

• We explored several methods to repair the inconsistency of the naive MLE.

• cannot be estimated consistently without additional assumptions. An alternative parameter that we can estimate consistently is:

. 1 2( )P Z z x X x

( )P Z z

Page 42: Nonparametric maximum likelihood estimation (MLE)  for bivariate censored data

Acknowledgements

• Piet Groeneboom

• Jon Wellner