542-08-#1 STATISTICS 542 Intro to Clinical Trials SURVIVAL ANALYSIS.

542-08-#1

STATISTICS 542STATISTICS 542Intro to Clinical TrialsIntro to Clinical Trials

SURVIVAL ANALYSISSURVIVAL ANALYSIS

542-08-#2

Survival Analysis Survival Analysis TerminologyTerminology

• Concerned about time to some event

• Event is often death

• Event may also be, for example

1. Cause specific death2. Non-fatal event or death,

whichever comes firstdeath or hospitalizationdeath or MI

death or tumor recurrence

542-08-#3

0 1 2 3 4 50

0.25

0.5

0.75

1Group A Group B

YEARS•At 5 years, survival rates the same•Survival experience in Group A appears more favorable, considering 1 year, 2 year, 3 year and 4 year rates together

Survival Rates at Yearly Intervals

542-08-#4

Beta-Blocker Heart Attack TrialBeta-Blocker Heart Attack Trial

LIFE-TABLE CUMULATIVE MORTALITY CURVELIFE-TABLE CUMULATIVE MORTALITY CURVE

542-08-#5

Survival AnalysisSurvival AnalysisDiscuss

1. Estimation of survival curves

2. Comparison of survival curves

I. Estimation

• Simple Case– All patients entered at the same time and followed for

the same length of time– Survival curve is estimated at various time points by

(number of deaths)/(number of patients)– As intervals become smaller and number of patients

larger, a "smooth" survival curve may be plotted

• Typical Clinical Trial Setting

542-08-#6

• Each patient has T years of follow-up• Time for follow-up taking place may be different for each patient

T years

T years

T years

T years

T0 2T

4

3

2

1

Time Since Start of Trial (T years)

Su

bje

ct

Staggered Entry

542-08-#7

•Failure time is time from entry until the time of the event•Censoring means vital status of patient is not known beyond that point

o

*

*

•

AdministrativeCensoring

Failure

CensoringLoss to Follow-up

Time Since Start of Trial (T years)

T 2T0

4

3

2

1

Subject

542-08-#8

o

*

•

*

T0

4

3

2

1

Subject

AdministrativeCensoring

CensoringLoss to Follow-up

Failure

Follow-up Time (T years)

542-08-#9

Clinical Trial with Common Termination DateClinical Trial with Common Termination Date

*

*

*

1

2

3

4

5

6

7

8

9

10

11

0 T 2TFollow-up Time (T years) Trial

Terminated

Subject

*

••• •

•••••

o

o

•

o o

o

o o

•

542-08-#10

Years of Cohort

Follow-Up Patients I IITotalEntered 100 100 200

1Died 20 25 45

Entered 80 75 1552

Died 20

Survived 60

Reduced Sample Estimate (1)

542-08-#11

– Suppose we estimate the 1 year survival rate

a. P(1 yr) = 155/200 = .775

b. P(1 yr, cohort I) = 80/100 = .80

c. P(1 yr, cohort II) = 75/100 = .75

– Now estimate 2 year survival

Reduced sample estimate = 60/100 = 0.60

Estimate is based on cohort I only

Loss of information

Reduced Sample Estimate (2)

542-08-#12

Ref: Berkson & Gage (1950) Proc of Mayo ClinicCutler & Ederer (1958) JCDElveback (1958) JASAKaplan & Meier (1958) JASA

- Note that we can express P(2 yr survival asP(2 yrs) = P(2 yrs survival|survived 1st yr)

P(1st yr survival) = (60/80) (155/200) = (0.75) (0.775) = 0.58

• This estimate used all the available data

Actuarial EstimateActuarial Estimate (1)

542-08-#13

• In general, divide the follow-up time into a series of intervals


• Let pi = prob of surviving Ii given patient alive at beginning of Ii (i.e. survived through Ii -1)

• Then prob of surviving through tk, P(tk)

iP PP P P P(t tP(Sk

1-ik321kk

...))

I1 I2 I3 I4 I5

t0 t1 t2 t3 t4 t5

542-08-#14

- Define the following

ni= number of subjects alive at beginning of Ii (i.e. at ti-1)di= number of deaths during interval Ii

li = number of losses during interval Ii (either administrative or lost to follow-up)

- We know only that di deaths and losses occurred inInterval Ii


Ii

ti-1 ti

i

542-08-#15

a. All deaths precede all losses

b. All losses precede all deaths

c. Deaths and losses uniform, (1/2 deaths before 1/2 losses)

Actuarial Estimate/Cutler-Ederer

- Problem is that P(t) is a function of the interval choice.

- For some applications, we have no choice, but if weknow the exact date of deaths and losses, theKaplan‑Meier method is preferred.

ii

iii

n

dn

iP

2/

2/P * i

ii

iii

n

dn

i

ii

n

dn

iP

Estimation of PEstimation of Pii

542-08-#16

Actuarial Lifetime Method (1)Actuarial Lifetime Method (1)

• Used when exact times of death are not known

• Vital status is known at the end of an interval period (e.g. 6 months or 1 year)

• Assume losses uniform over the interval

542-08-#17

Lifetable

At Number Number Adjusted Prop Prop. Surv. Up toInterval Risk Died Lost No. At Risk Surviving End of Interval

(ni) (di) (li)

0-1 50 9 0 50 41/50-0.82 0.82

1-2 41 6 1 41-1/2=40.5 34.5/40.5=0.852 0.852 x 0.82=0.699

2-3 34 2 4 34-4/2=32 30/32=0.937 0.937 x 0.699=0.655

3-4 28 1 5 28-5/2=25.5 24.5/25.5=0.961 0.961 x 0.655=0.629

4-5 22 2 3 22-3/2=20.5 18.5/20.5=0.902 0.902 x 0.629=0.567

2i

iinn

ip

ipII

Actuarial Lifetime Method (2)Actuarial Lifetime Method (2)

542-08-#18

Actuarial Survival CurveActuarial Survival Curve100

80

60

40

20

0

X ___

X___X___

X___X___

X___

1 2 3 4 5

542-08-#19

Kaplan-Meier Estimate (1)Kaplan-Meier Estimate (1)((JASAJASA, 1958), 1958)

• Assumptions

1. "Exact" time of event is known

Failure = uncensored event

Loss = censored event

2. For a "tie", failure always before loss

3. Divide follow-up time into intervals such that

a. Each event defines left side of an interval

b. No interval has both deaths & losses

542-08-#20

Kaplan-Meier Estimate (2)Kaplan-Meier Estimate (2)((JASAJASA, 1958), 1958)

• Then

ni = # at risk just prior to death at ti

• Note if interval contains only losses, Pi = 1.0

• Because of this, we may combine intervals with only losses with the previous interval containing

only deaths, for convenience

X———o—o—o——

i

ii

i n

dnP

542-08-#21

Estimate of S(t) or P(t)Estimate of S(t) or P(t)

Suppose that for N patients, there are K distinct failure (death) times. The Kaplan-Meier estimate of survival curves becomes P(t)=P (Survival t)

K-M or Product Limit Estimate

ti t i = 1,2,…,k

where ni = ni-1 - li-1 - di-1

li-1 = # censored events since death at ti-1

di-1 = # deaths at ti-1

i

ii

n

dni

tP)(

)(ˆ

542-08-#22

Estimate of S(t) or P(t)Estimate of S(t) or P(t)

• Variance of P(t)

Greenwood’s Formula

tt

dnn

d

itPtPV

i

iii

i

)( )](ˆ[)](ˆ[ 2

542-08-#23

Example (see Table 14-2 in FFD)

Suppose we follow 20 patients and observe the event time, either failure (death) or censored (+), as

[0.5, 0.6+), [1.5, 1.5, 2.0+), [3.0, 3.5+, 4.0+), [4.8],

[6.2, 8.5+, 9.0+), [10.5, 12.0+ (7 pts)]

There are 6 distinct failure or death times

0.5, 1.5, 3.0, 4.8, 6.2, 10.5

KM Estimate (1)

542-08-#24

1. failure at t1 = 0.5 [.5, 1.5)n1 = 20d1 = 1l1 = 1 (i.e. 0.6+)

If t [.5, 1.5), p(t) = p1 = 0.95

V [ P(t1) ] = [.95]2 {1/20(19)} = 0.0024

9520

120

n

dnp

1

1 1

1.

KM Estimate (2)KM Estimate (2)

^

^

542-08-#25

Data [0.5, 0.6+), [1.5, 1.5, 2.0+), 3.0 etc.

2. failure at t2 = 1.5 n2 = n1 - d1 - 1

[1.5, 3.0) = 20 - 1 - 1= 18

d2 = 22 = 1 (i.e. 2.0+)

If t [1.5, 3.0), then P(t) = (0.95)(0.89) = 0.84

V [P(t2)] = [0.84]2 { 1/20(19) + 2/18(18-2) } = 0.0068

89.18

218

2

22

2

n

dnp

KM Estimate (4)KM Estimate (4)

542-08-#26

Life Table 14.2Kaplan-Meier Life Table for 20 Subjects

Followed for One Year

Interval Interval Time

Number of death nj dj j

[.5,1.5) 1 .5 20 1 1 0.95 0.95 0.0024

[1.5,3.0) 2 1.5 18 2 1 0.89 0.84 0.0068

[3.0,4.8) 3 3.0 15 1 2 0.93 0.79 0.0089

[4.8,6.2) 4 4.8 12 1 0 0.92 0.72 0.0114

[6.2,10.5) 5 6.2 11 1 2 0.91 0.66 0.0135

[10.5, ) 6 10.5 8 1 7* 0.88 0.58 0.0164

jp̂ )(p̂

jt )](p̂V[

jt

nj : number of subjects alive at the beginning of the jth intervaldj : number of subjects who died during the jth intervalj : number of subjects who were lost or censored during the jth interval : estimate for pj, the probability of surviving the jth interval

given that the subject has survived the previous intervals : estimated survival curve : variance of

* Censored due to termination of study

jp̂

(t)pj

ˆ

(t)]pV[j

ˆ )(ˆ tP

542-08-#27

0

0.5

0.6

0.7

0.8

0.9

1.0

2 4 6 8 10 12

*

* *

*

*

*

*

Survival Time t (Months)

Est

imat

ed S

urvi

val C

ure

[P(t

)]Survival Curve

Kaplan-Meier Estimate

^

o

oo o

o o

oooo

ooo

542-08-#28

Comparison of Comparison of Two Survival CurvesTwo Survival Curves

• Assume that we now have a treatment group and a control group and we wish to make a comparison between their survival experience

• 20 patients in each group

(all patients censored at 12 months)

Control 0.5, 0.6+, 1.5, 1.5, 2.0+, 3.0, 3.5+, 4.0+,

4.8, 6.2, 8.5+, 9.0+, 10.5, 12+'s

Trt 1.0, 1.6+, 2.4+, 4.2+, 4.5, 5.8+, 7.0+, 11.0+, 12+'S

542-08-#29

1. t1 = 1.0 n1 = 20 p1 = 20 - 1 = 0.95

d1 = 1 20

l1 = 3

p(t)= .95

2. t2 = 4.5 n2 = 20 - 1 - 3 p2 = 16 - 1 =0 .94

= 16 16

d2 = 1

0024.0)]t(P̂V[1

89.0)94.0)(95.0()t(P̂ 0530.)](ˆ 2

tPV[

Kaplan-Meier Estimate for Kaplan-Meier Estimate for TreatmentTreatment

^

542-08-#30

Kaplan-Meier EstimateKaplan-Meier Estimate

0

0.5

0.6

0.7

0.8

0.9

1.0

2 4 6 8 10 12

*

* *

*

*

*

Survival Time t (Months)

Est

imat

ed S

urvi

val C

ure

[P(t

)]^

TRT

CONTROL

o

* *

o o

ooooo

oo o

542-08-#31

Comparison of Comparison of Two Survival CurvesTwo Survival Curves

• Comparison of Point Estimates

– Suppose at some time t* we want to compare PC(t*) for the control and PT(t*) for treatment

– The statistic

has approximately, a normal distribution under H0

– Example:

2/1]*)(ˆ*)(ˆ[

*)(ˆ*)(ˆ

tPVtPV

tPtPZ

CT

CT

)(ˆ)(ˆ 6P vs6P CompareCT

3211290

170

0114000530

7208921

..

.

]..[

../

Z

542-08-#32

• Comparison of Overall Survival Curve

H0: Pc(t) = PT(t)

A. Mantel-Haenszel Test

Ref: Mantel & Haenszel (1959) J Natl Cancer InstMantel (1966) Cancer Chemotherapy Reports

- Mantel and Haenszel (1959) showed that a series of 2 x 2

tables could be combined into a summary statistic(Note also: Cochran (1954) Biometrics)

- Mantel (1966) applied this procedure to the comparison of

two survival curves

- Basic idea is to form a 2 x 2 table at each distinct deathtime, determining the number in each group who were

atrisk and number who died

542-08-#33

Suppose we have K distinct times for a death occurring

ti i = 1,2, .., K. For each death time,

Died At Risk at ti Alive (prior to ti)

Treatment ai bi ai + bi

Control ci di ci + di

ai + ci bi + di Ni

• Consider ai, the observed number ofdeaths in the TRT group, under H0

Comparison of Comparison of Two Survival Curves (1)Two Survival Curves (1)

542-08-#34

E(ai) = (ai + bi)(ai + ci)/Ni

Mantel-Haenszel Statistic

1)(NN

)d(c)b(a)db)(c(a )V(a

i

2

i

iiiiiiii

i

21iii χ~)V(a Σ/)]}a(E[a{MH 2

N(0,1) ~ MHZ

Comparison of Comparison of Two Survival Curves(2)Two Survival Curves(2)

542-08-#35

Table 14.3Table 14.3Comparison of Survival Data for a Control Group and an Comparison of Survival Data for a Control Group and an

Intervention Group Using the Mantel-Haenszel ProcedureIntervention Group Using the Mantel-Haenszel ProcedureRank Event Intervention Control Total

Times

j tj aj + bj aj j cj + dj cj j aj + cj bj + dj

1 0.5 200 0 201 1 1 39

2 1.0 201 0 180 0 1 37

3 1.5 190 2 182 1 2 35

4 3.0 170 1 151 2 1 31

5 4.5 161 0 120 0 1 27

6 4.8 150 1 121 0 1 26

7 6.2 140 1 111 2 1 24

8 10.5 130 1 8 1 1 20

aj + bj = number of subjects at risk in the intervention group prior to the death at time t j

cj + cj = number of subjects at risk in the control group prior to the death at time t j aj = number of subjects in the intervention group who died at time t j

cj = number of subjects in the control group who died at time t j

j = number of subjects who were lost or censored between time t j and time tj+1

aj + cj = number of subjects in both groups who died at time tj

bj + dj = number of subjects in both groups who are at risk minus the number who died at time t j

542-08-#36

• Operationally

1. Rank event times for both groups combined2. For each failure, form the 2 x 2 table

a. Number at risk (ai + bi, ci + di)b. Number of deaths (ai, ci)c. Losses (lTi, lCi)

• Example (See table 14-3 FFD) - Use previous data set

Trt: 1.0, 1.6+, 2.4+, 4.2+, 4.5, 5.8+, 7.0+, 11.0+, 12.0+'s

Control: 0.5, 0.6+, 1.5, 1.5, 2.0+, 3.0, 3.5+, 4.0+, 4.8, 6.2, 8.5+, 9.0+, 10.5, 12.0+'s

Mantel-Haenszel TestMantel-Haenszel Test

542-08-#37

1. Ranked Failure Times - Both groups combined

0.5, 1.0, 1.5, 3.0, 4.5, 4.8, 6.2, 10.5 C T C C T C C C

8 distinct times for death (k = 8)

2. At t1 = 0.5 (k = 1) [.5, .6+, 1.0)

T: a1 + b1 = 20 a1 = 0 lT1 = 0 c1 + d1 = 20 c1 = 1 lC1 = 1 1 loss @ .6+

D A R

T 0 20 20

C 1 19 20

1 39 40

E(a1)= 1•20/40 = 0.5

V(a1) = 1•39 • 20 • 20 402 •39

542-08-#38

3. At t2 = 1.0 (k = 2) [1.0, 1.5)

T: a2 + b2 = (a1 + b1) - a1 - lT1 a2 = 1.0= 20 - 0 - 0

= 20 lT2 = 0

C. c2 + d2 = (c1 + d1) - c1 - lC1 c2 = 0 = 20 - 1 - 1

= 18 lC2 = 0

so

D A R

T 1 19 20

C 0 18 18

1 37 38

E(a2)= 1•20 38V(a2) = 1•37 • 20 • 18

382 •37

542-08-#39

Eight 2x2 Tables Corresponding to the Event TimesEight 2x2 Tables Corresponding to the Event TimesUsed in the Mantel-Haenszel Statistic in Survival Used in the Mantel-Haenszel Statistic in Survival

Comparison of Treatment (T) and Control (C) GroupsComparison of Treatment (T) and Control (C) Groups

1. (0.5 mo.)* D† A‡ R§ 5. (4.5 mo.)* D A RT 0 20 20 T 1 15 16C 1 19 20 C 0 12 12

1 39 40 1 27 28

2. (1.0 mo) D A R 6. (4.8 mo.) D A RT 1 19 20 T 0 15 15C 0 18 18 C 1 11 12

1 37 38 1 26 27

3. (1.5 mo.) D A R 7. (6.2 mo.) D A RT 0 19 19 T 0 14 14C 2 16 18 C 1 10 11

2 35 37 1 24 25

4. (3.0 mo.) D A R 8. (10.5 mo.) D A RT 0 17 17 T 0 13 13C 1 14 15 C 1 7 8

1 31 32 1 20 21

* Number in parentheses indicates time, tj, of a death in either group† Number of subjects who died at time tj

‡ Number of subjects who are alive between time tj and time tj+1

§ Number of subjects who were at risk before the death at time tj R=D+A)

542-08-#40

Compute MH StatisticsCompute MH Statistics

Recall K = 1 K = 2 K = 3t1 = 0.5 t2 = 1.0 t3 = 1.5

D A0 20 201 19 201 39 40

D A1 19 200 18 181 37 38

D A0 19 192 16 182 35 37

a. ai = 2 (only two treatment deaths)

b. E(ai ) = 20(1)/40 + 20(1)/38 + 19(2)/37 + . . .= 4.89

c. V(ai) =

= 2.22d. MH = (2 - 4.89)2/2.22 = 3.76 or ZMH =

K

i 1

K

i 1

K

i 1

...)(

))()((

)(

))()((

3738

1820371

3940

202039122

941.MH

542-08-#41

B. Gehan Test (Wilcoxon)Ref: Gehan, Biometrika (1965)

Mantel, Biometrics (1966)Gehan (1965) first proposed a modified Wilcoxon rankstatistic for survival data with censoring. Mantel (1967) showed asimpler computational version of Gehan’s proposed test.

1. Combine all observations XT’s and XC’s into a single sample Y1, Y2, . . ., YNC + NT

2. Define Uij wherei = 1, NC + NT j = 1, NC + NT

-1 Yi < Yj and death at Yi Uij = 1 Yi > Yj and death at Yj

0 elsewhere

3. Define Ui

i = 1, … , NC + NT ji

ij

NTNC

1ji

U U

542-08-#42

Note:

Ui = {number of observed times definitely less than i}{number of observed times definitely greater}

4. Define W = Ui (controls)

5. V[W] = NCNT

Variance due to Mantel

6.

• Example (Table 14-5 FFD)Using previous data set, rank all observations

Gehan TestGehan Test

NTNC

1i

N(NN(N

U

TCTC

2

i

)) 1

~V(W)

WZ

or

(0,1) N ~V(W)

WZ

2

1

2

2

G

G

542-08-#43

The Gehan Statistics, Gi involves the scores Ui and is defined as

G = W2/V(W)

where W = Ui (Uis in control group only)

and

iC NN

ii

iCiC

iC UNNNN

NNWV

1

2

1)(

))(()(

542-08-#44

Example of Gehan Statistics Scores UExample of Gehan Statistics Scores Uii for for

Intervention and Control (C) GroupsIntervention and Control (C) GroupsObservation Ranked Definitely Definitely = Ui

i Observed Time Group Less More

1 0.5 C 0 39 -392 (0.6)* C 1 0 13 1.0 I 1 37 -364 1.5 C 2 35 -335 1.5 C 2 35 -336 (1.6) I 4 0 47 (2.0) C 4 0 48 (2.4) I 4 0 49 3.0 C 4 31 -27

10 (3.5) C 5 0 511 (4.0) C 5 0 512 (4.2) I 5 0 513 4.5 I 5 27 -2214 4.8 C 6 26 -2015 (5.8) I 7 0 716 6.2 C 7 24 -1717 (7.0) I 8 0 818 (8.5) C 8 0 819 (9.0) C 8 0 820 10.5 C 8 20 -1221 (11.0) I 9 0 9

22-40 (12.0) 12I, 7C 9 0 9

*Censored observations

542-08-#45

Thus W = (-39) + (1) + (-36) + (-33) + (4) + . . . .

= -87

and V[W] = (20)(20) {(-39)2 +12 + (-36)2 + . . . } (40)(39)

= 2314.35

so

• Note MH and Gehan not equal

811352314

87.

.

GZ

Gehan TestGehan Test

542-08-#46

Cox Proportional Hazards ModelCox Proportional Hazards ModelRef: Cox (1972) Journal of the Royal Statistical Association

• Recall simple exponential

S(t) = e-t

• More complicated

If (s) = , get simple model

• Adjust for covariates

• Cox PHM

(t,x) =0(t) ex

})(exp{)( dsstS t 0

}),(exp{),( dsxsxtS t 0

542-08-#47

So

S(t1,X) =

=

=

• Estimate regression coefficients (non-linear estimation) , SE()

• Examplex1 = 1 Trt

2 Controlx2 = Covariate 1

indicator of treatment effect, adjusted for x2, x3 , . . .

• If no covariates, except for treatment group (x1),PHM = logrank

}exp ds(s)e{- xB'

0t

0

xB'00 e(s)ds ]e t[

xB'e

0(t)][S

,ˆ1

Cox Proportional Hazards ModelCox Proportional Hazards Model

542-08-#48

Homework ProblemHomework Problem

PatientNumber

Treatmenta Length ofObservationb Statusc

1 D 6 D2 P 16 A3 P 2 D4 D 1 D5 D 1 D6 P 4 A7 D 16 A8 D 3 A9 P 16 A

10 D 8 D11 P 16 A12 P 2 D13 D 4 D14 P 16 A15 P 5 A16 D 1 D17 D 10 A18 P 2 A19 P 2 A20 D 10 D21 P 16 A22 D 1 A23 P 16 A24 D 15 A

1. Kaplan-Meier2. Gehan-Wilcoxon3. Mantel-Haenszel

a D = drug; P = placebob In weeksc A = alive; D = dead

Source: P.B. Gregory (1974)

542-08-#49

Survival Analysis SummarySurvival Analysis Summary• Time to event methodology very useful in

multiple settings• Can estimate time to event probabilities or

survival curves• Methods can compare survival curves

– Can stratify for subgroups– Can adjust for baseline covariates using

regression model

• Need to plan for this in sample size estimation & overall design

542-08-#1 STATISTICS 542 Intro to Clinical Trials SURVIVAL ANALYSIS.

Documents

Transcript of 542-08-#1 STATISTICS 542 Intro to Clinical Trials SURVIVAL ANALYSIS.