Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf ·...

43
Statistics in Cancer Trials Statistics in Cancer Trials Dr. Avijit Hazra Department of Pharmacology, IPGME&R Dr. Avijit Hazra Department of Pharmacology, IPGME&R

Transcript of Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf ·...

Page 1: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Statistics inCancer TrialsStatistics in

Cancer Trials

Dr. Avijit HazraDepartment of Pharmacology, IPGME&R

Dr. Avijit HazraDepartment of Pharmacology, IPGME&R

Page 2: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Phase II studies - Therapeutic exploratory trials

Small studies early in the development of a regimen.Typically focus on toxicity and response data.Require precise definition of response & toxicity criteria.

Phase II studies - Therapeutic exploratory trials

Small studies early in the development of a regimen.Typically focus on toxicity and response data.Require precise definition of response & toxicity criteria.

Phase III studies - Therapeutic confirmatory trials

Larger comparative studies.Also assess survival and disease-free survival.Require survival analysis.

Phase III studies - Therapeutic confirmatory trials

Larger comparative studies.Also assess survival and disease-free survival.Require survival analysis.

Cancer clinical trialsCancer clinical trials

Phase IV - Postmarketing surveillance / trials

May or may not be comparative studies.May be done to assess benefit, harm, QOL, economics.

Phase IV - Postmarketing surveillance / trials

May or may not be comparative studies.May be done to assess benefit, harm, QOL, economics.

2

Page 3: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

“… statisticians apply, to problems in which we are interested, a technique which we do not understand. It is exasperating, when we have studied a problem by methods that we have spent laborious years in mastering, to find our conclusions questioned, and perhaps refuted, by someone who could not have made the observations himself.”

“… statisticians apply, to problems in which we are interested, a technique which we do not understand. It is exasperating, when we have studied a problem by methods that we have spent laborious years in mastering, to find our conclusions questioned, and perhaps refuted, by someone who could not have made the observations himself.”

- Sir Austin Bradford HillFather of the Randomized Controlled Trial

Medical statisticsMedical statistics

Page 4: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Statistics is the science of data analysis.Statistics is the science of data analysis.

“If your experiment needs statistics, then you ought to have done a better experiment.”

- Ernst Rutherford

“If your experiment needs statistics, then you ought to have done a better experiment.”

- Ernst Rutherford

Can data be analyzed without statistics?Can data be analyzed without statistics?

Most biological researchers are not so lucky• Enormous biological variability• Inability to control all relevant variables• Interest in small effects

Most biological researchers are not so lucky• Enormous biological variability• Inability to control all relevant variables• Interest in small effects

Medical statisticsMedical statistics

4

Page 5: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

evolved out of a felt needevolved out of a felt need

5

Medical statisticsMedical statistics

Page 6: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Clinical trial designsClinical trial designs

� Crossover

� Parallel groups

R*

R*

* R = Randomization Point

� Single treatmentbaseline endpoint

Page 7: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� FactorialTreatment I

ac ad

bdbc

Dose a

Dose b

Dose c Dose d

� Survivalmorbidityendpoint

baseline

mortalityendpoint

subjects withoutmorbidity ormortality

� WithdrawalR*

non-responders

Clinical trial designsClinical trial designs

* R = Randomization Point

Page 8: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� No treatment

Placebo

� Dose-response 5 mg 10 mg

� External(including Historical)

� Placebo

� Active

vs.

vs.

vs.

vs.

vs.

Controls in clinical trialsControls in clinical trials

5 mg

Nil

Page 9: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

The goal of medical statisticsThe goal of medical statistics

Page 10: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Categorical / (Ordinal) / Qualitative dataThat which can be classified into mutually exclusive

categories based on a predetermined set of criteria.e.g. Tumor response - Complete response, Partial response,

Stable disease, Increasing

Categorical / (Ordinal) / Qualitative dataThat which can be classified into mutually exclusive

categories based on a predetermined set of criteria.e.g. Tumor response - Complete response, Partial response,

Stable disease, Increasing

Data typesData types

Numerical / Measurement / Quantitative dataThat which comes from measurement.e.g. CA-125 level in ovarian cancer.

Numerical / Measurement / Quantitative dataThat which comes from measurement.e.g. CA-125 level in ovarian cancer.

Time-to-event data is a special case of quantitative data.

Time-to-event data is a special case of quantitative data.

10

Page 11: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Process by which we acquire information about populations from samples.

� Three procedures for making inferences.

� Process by which we acquire information about populations from samples.

� Three procedures for making inferences.

Statistical Inference

Sample Population

Statistical Inference

Sample Population

Point estimatesPoint estimates Interval estimatesInterval estimates

Hypothesis testingHypothesis testing

Inferential statisticsInferential statistics

11

Page 12: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Point estimatorPoint estimator

Sample distributionSample distribution

ParameterParameter

??

Population distributionPopulation distribution

A point estimator draws inference about a population by estimating the value of an unknown parameter using a single value.

A point estimator draws inference about a population by estimating the value of an unknown parameter using a single value.

Point estimationPoint estimation

12

Page 13: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Interval estimatorInterval estimator

Sample distributionSample distribution

An interval estimator draws inferences about a population by estimating the value of an unknown parameter using an interval.

An interval estimator draws inferences about a population by estimating the value of an unknown parameter using an interval.

Population distributionPopulation distribution ParameterParameter

Interval estimationInterval estimation

13

Page 14: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� A confidence interval has the form:

point estimate ±±±± margin of error

� The point estimate is our guess for the value of the unknown parameter.

� The margin of error shows how accurate we believe our guess is, based on the sampling distribution of the estimate.

� A confidence interval has the form:

point estimate ±±±± margin of error

� The point estimate is our guess for the value of the unknown parameter.

� The margin of error shows how accurate we believe our guess is, based on the sampling distribution of the estimate.

Confidence interval 1Confidence interval 1

14

Page 15: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Confidence interval 2Confidence interval 2

� A confidence interval has the form:

point estimate ±±±± margin of error

� The confidence interval is therefore likely to include the true population parameter with the given confidence level.

� The given confidence level (e.g. 95% or 99%) shows how confident we are that the procedure will catch the true population parameter.

� Sample size influences the confidence interval.

� A confidence interval has the form:

point estimate ±±±± margin of error

� The confidence interval is therefore likely to include the true population parameter with the given confidence level.

� The given confidence level (e.g. 95% or 99%) shows how confident we are that the procedure will catch the true population parameter.

� Sample size influences the confidence interval.

15

Page 16: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

16

Consider Hazard ratio:

Ratio of death rates in two study arms.

Consider Hazard ratio:

Ratio of death rates in two study arms.

Estimated Hazard Ratio of 2.1 in favor of a new Rx seems appealing.

Estimated Hazard Ratio of 2.1 in favor of a new Rx seems appealing.

However if 95% CI is 0.4 to 3.8 , the Hazard Ratio is no longer so appealing.

However if 95% CI is 0.4 to 3.8 , the Hazard Ratio is no longer so appealing.

Confidence interval 3Confidence interval 3

Page 17: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

17

n = 100

Chi-square p value 0.3149 ⇒⇒⇒⇒ Non-significant

n = 100

Chi-square p value 0.3149 ⇒⇒⇒⇒ Non-significant

Importance of a large sample 1Importance of a large sample 1

2525 2525

3030 2020

+ –+ –

A

B

A

B

ResponseResponse

Rx armRx arm

Page 18: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

18

n = 1000

Chi-square p value 0.0015 ⇒⇒⇒⇒ Significant

n = 1000

Chi-square p value 0.0015 ⇒⇒⇒⇒ Significant

Importance of a large sample 2Importance of a large sample 2

250250 250250

300300 200200

+ –+ –

A

B

A

B

ResponseResponse

Rx armRx arm

Page 19: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

An RCT is planned to compare surgery and radiotherapy in the Rx of operable lung cancer.

However, during conduct of the trial it was found that some patients randomized for surgery were

inoperable and had to be Rx by radiotherapy.

An RCT is planned to compare surgery and radiotherapy in the Rx of operable lung cancer.

However, during conduct of the trial it was found that some patients randomized for surgery were

inoperable and had to be Rx by radiotherapy.

Continued ...Continued ...

What is to be done for analysis in this situation?

Analysis strategy 1Analysis strategy 1

19

Page 20: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Inoperable patients apparent only in

surgery arm

Inoperable patients apparent only in

surgery arm

SurgerySurgeryRadiotherapyRadiotherapy

There are likely to be inoperable patients in both groups but they cannot be identified in the radiotherapy group.

Patients with inoperable tumors are likely to have more advanced disease.

There are likely to be inoperable patients in both groups but they cannot be identified in the radiotherapy group.

Patients with inoperable tumors are likely to have more advanced disease.

Analysis strategy 2Analysis strategy 2

Intention-to-treat analysis is the best approach.Intention-to-treat analysis is the best approach.

20

Page 21: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

The outcome of interest is time to an event - this does not necessarily have to be death. Studying time to an event poses two problems:

� Time interval can vary from one subject to another. Also at the end of the study the event may not have occurred for many subjects.

So time to event is not Normally distributed.

� In a long period of follow-up many subjects drop out. The information we have about them is only till last follow-up. So data is Censored.

The outcome of interest is time to an event - this does not necessarily have to be death. Studying time to an event poses two problems:

� Time interval can vary from one subject to another. Also at the end of the study the event may not have occurred for many subjects.

So time to event is not Normally distributed.

� In a long period of follow-up many subjects drop out. The information we have about them is only till last follow-up. So data is Censored.

Survival analysisSurvival analysis

21

Page 22: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Model time to death or time to event.

• Unlike linear regression, survival analysis has a dichotomous (binary) outcome.

• Unlike logistic regression, survival analysis analyzes the time to an event.

� Able to account for censoring.

� Can compare survival between 2+ groups.

� Can assess relationship between covariates and survival time.

� Model time to death or time to event.

• Unlike linear regression, survival analysis has a dichotomous (binary) outcome.

• Unlike logistic regression, survival analysis analyzes the time to an event.

� Able to account for censoring.

� Can compare survival between 2+ groups.

� Can assess relationship between covariates and survival time.

Survival analysisSurvival analysis

22

Page 23: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

TechniquePredictorVariables

OutcomeVariable

Censoringpermitted?

Linear

Regression

Categorical orcontinuous

Normallydistributed

No

Logistic

Regression

Categorical orcontinuous

Binary (except in

polytomous logisticregression)

No

Survival

Analyses

Time andcategorical orcontinuous

Binary Yes

Regression vs. survival analysisRegression vs. survival analysis

23

Page 24: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Technique Mathematical model Yields

Linear

Regression

Y=B1X + Bo(linear)

Linear changes

Logistic

Regression

Ln(P/1-P) = B1X+Bo(sigmoidal prob.)

Odds ratios

Survival

Analyses

h(t) = ho(t)exp(B1X+Bo) Hazard rates

Regression vs. survival analysisRegression vs. survival analysis

24

Page 25: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Examples

• Time to death or clinical endpoint

• Time in remission after Rx of disease

• Time to wean from ventilator support

• Time to relapse in alcoholism Rx

� When one believes that 1+ explanatory variable(s) explains the differences in time to an event

� Especially when follow-up is incomplete or variable

� Examples

• Time to death or clinical endpoint

• Time in remission after Rx of disease

• Time to wean from ventilator support

• Time to relapse in alcoholism Rx

� When one believes that 1+ explanatory variable(s) explains the differences in time to an event

� Especially when follow-up is incomplete or variable

When to use survival analysisWhen to use survival analysis

25

Page 26: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Survival prospects are assumed to stay the same throughout the study.

� Subjects lost to follow-up have the same prognosis as those who continue with the study.

� The probability that an individual is censored is unrelated to the probability that the individual suffers an event.

� Anything which affects the hazard does so by the same proportion at all time.

� Survival prospects are assumed to stay the same throughout the study.

� Subjects lost to follow-up have the same prognosis as those who continue with the study.

� The probability that an individual is censored is unrelated to the probability that the individual suffers an event.

� Anything which affects the hazard does so by the same proportion at all time.

Assumptions in survival analysisAssumptions in survival analysis

26

Page 27: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Subject does not experience event of interest

� Incomplete follow-up• Lost to follow-up• Withdraws from study• Dies (if death not a

study outcome)

� Left or right censored

� Subject does not experience event of interest

� Incomplete follow-up• Lost to follow-up• Withdraws from study• Dies (if death not a

study outcome)

� Left or right censored

Censored data in survival analysisCensored data in survival analysis

27

Page 28: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Like other statistical analysis, survival type data can be dealt with by:

• Descriptive statistics

• Univariate statistics

• Multivariate statistics

Like other statistical analysis, survival type data can be dealt with by:

• Descriptive statistics

• Univariate statistics

• Multivariate statistics

Approaches to survival analysisApproaches to survival analysis

28

Page 29: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Average survival

• When can this be calculated?

• What test should be used to compare average survival between 2 groups?

� Average hazard rate

• Total number of failures divided by observed survival time (units are therefore 1/t or 1/pt-yrs)

• An incidence rate, with a higher value indicating more events per time

� Average survival

• When can this be calculated?

• What test should be used to compare average survival between 2 groups?

� Average hazard rate

• Total number of failures divided by observed survival time (units are therefore 1/t or 1/pt-yrs)

• An incidence rate, with a higher value indicating more events per time

Descriptive statistics in survival analysisDescriptive statistics in survival analysis

29

Page 30: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Also known as product-limit method

� Accounts for censoring

� Generates the characteristic ‘crazy stairs’ survival curves

� Does not account for confounding or effect modification by other covariates

� Also known as product-limit method

� Accounts for censoring

� Generates the characteristic ‘crazy stairs’ survival curves

� Does not account for confounding or effect modification by other covariates

Univariate methods:Kaplan-Meier survival plots

Univariate methods:Kaplan-Meier survival plots

30

Page 31: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Typical Kaplan-Meier survival plotsTypical Kaplan-Meier survival plots

Page 32: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Survivor function, S(t) defines the probability of surviving longer than time t

• This is what the Kaplan-Meier curves show.

• Hazard function is the derivative of the survivor function over time h(t)=dS(t)/dt

– instantaneous risk of event at time t

� Survivor and hazard functions can be converted into each other

� Survivor function, S(t) defines the probability of surviving longer than time t

• This is what the Kaplan-Meier curves show.

• Hazard function is the derivative of the survivor function over time h(t)=dS(t)/dt

– instantaneous risk of event at time t

� Survivor and hazard functions can be converted into each other

Relationship between Survivor Function and Hazard Function

Relationship between Survivor Function and Hazard Function

32

Page 33: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Log-rank test can be used to compare survival curves

� Compares ‘observed’ and ‘expected’ number of deaths in the two samples

� ‘Expected’ calculated by sharing the next death between the groups in proportion to numbers at risk in each group

� Assumption is no difference in risk factors between groups

� Test statistic is compared to χ2 distribution

� Log-rank test can be used to compare survival curves

� Compares ‘observed’ and ‘expected’ number of deaths in the two samples

� ‘Expected’ calculated by sharing the next death between the groups in proportion to numbers at risk in each group

� Assumption is no difference in risk factors between groups

� Test statistic is compared to χ2 distribution

Comparing Kaplan-Meier plots 1Comparing Kaplan-Meier plots 1

33

Page 34: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Hypothesis test (test of significance)

• H0: the curves are statistically the same

• H1: the curves are statistically different

� Can be generalized to more than two groups

� Then use judicious pair-wise testing

� Other non-parametric tests available

• Mantel-Cox, Mantel-Haenszel, Peto & Peto, Gehan, etc.

� Hypothesis test (test of significance)

• H0: the curves are statistically the same

• H1: the curves are statistically different

� Can be generalized to more than two groups

� Then use judicious pair-wise testing

� Other non-parametric tests available

• Mantel-Cox, Mantel-Haenszel, Peto & Peto, Gehan, etc.

Comparing Kaplan-Meier plots 2Comparing Kaplan-Meier plots 2

34

Page 35: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Multiple pair-wise comparisons produce cumulative Type I error – multiple comparison problem

� Instead, compare all curves at once

• Analogous to using ANOVA to compare more than two groups

• Then use judicious pair-wise testing

� Multiple pair-wise comparisons produce cumulative Type I error – multiple comparison problem

� Instead, compare all curves at once

• Analogous to using ANOVA to compare more than two groups

• Then use judicious pair-wise testing

Comparing multiple Kaplan-Meier plots Comparing multiple Kaplan-Meier plots

35

Page 36: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� What happens when you have several covariates that you believe contribute to survival?• e.g. Tumor grade at presentation, nodal metastasis,

histological type, family history, time to start of Rx

� Can use stratified K-M curves – for 2 or maybe 3 covariates

� A multivariate approach needed for many covariates• Think multivariate regression or logistic regression

rather than a Student’s t-test or the Odds ratio from a 2 x 2 table

� What happens when you have several covariates that you believe contribute to survival?• e.g. Tumor grade at presentation, nodal metastasis,

histological type, family history, time to start of Rx

� Can use stratified K-M curves – for 2 or maybe 3 covariates

� A multivariate approach needed for many covariates• Think multivariate regression or logistic regression

rather than a Student’s t-test or the Odds ratio from a 2 x 2 table

Limit of Kaplan-Meier plots Limit of Kaplan-Meier plots

36

Page 37: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Used to assess effect of multiple covariates on survival

� The the most commonly used multivariate survival method

• Relatively easy to implement in standard statistical software

• Parametric approaches are an alternative, but they require stronger assumptions about h(t).

� Used to assess effect of multiple covariates on survival

� The the most commonly used multivariate survival method

• Relatively easy to implement in standard statistical software

• Parametric approaches are an alternative, but they require stronger assumptions about h(t).

Multivariate methods:Cox proportional hazards model

Multivariate methods:Cox proportional hazards model

37

Page 38: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Conveniently separates baseline hazard function from covariates

• Baseline hazard function over time

–h(t) = ho(t)exp(B1X+Bo)

• Covariates are time independent

• B1 is used to calculate the hazard ratio, which is similar to the relative risk

� Nonparametric

� Quasi-likelihood function

� Conveniently separates baseline hazard function from covariates

• Baseline hazard function over time

–h(t) = ho(t)exp(B1X+Bo)

• Covariates are time independent

• B1 is used to calculate the hazard ratio, which is similar to the relative risk

� Nonparametric

� Quasi-likelihood function

Cox proportional hazards model 1 Cox proportional hazards model 1

38

Page 39: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Can handle both continuous and

categorical predictor variables

� Without knowing baseline hazard ho(t),

can still calculate coefficients for each

covariate, and therefore hazard ratio

� Assumes multiplicative risk — this is the proportional hazard assumption

� Can handle both continuous and

categorical predictor variables

� Without knowing baseline hazard ho(t),

can still calculate coefficients for each

covariate, and therefore hazard ratio

� Assumes multiplicative risk — this is the proportional hazard assumption

Cox proportional hazards model 2 Cox proportional hazards model 2

39

Page 40: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Does not accommodate variables that change over time

• Luckily most variables (e.g. gender, ethnicity, congenital condition) are constant.

� Baseline hazard function, ho(t), is never specified

• You can estimate ho(t) accurately if you need to estimate S(t).

� Does not accommodate variables that change over time

• Luckily most variables (e.g. gender, ethnicity, congenital condition) are constant.

� Baseline hazard function, ho(t), is never specified

• You can estimate ho(t) accurately if you need to estimate S(t).

Limitations of theCox proportional hazards model

Limitations of theCox proportional hazards model

40

Page 41: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Survival analyses quantifies time to a single, dichotomous event.

� Handles censored data well.

� Survival and hazard can be mathematically converted to each other.

� Kaplan-Meier survival curves can be compared mathematically and graphically.

� Cox proportional hazards models help distinguish individual contributions of covariates on survival, provided certain assumptions are met.

� Survival analyses quantifies time to a single, dichotomous event.

� Handles censored data well.

� Survival and hazard can be mathematically converted to each other.

� Kaplan-Meier survival curves can be compared mathematically and graphically.

� Cox proportional hazards models help distinguish individual contributions of covariates on survival, provided certain assumptions are met.

Summing up survival analysis Summing up survival analysis

41

Page 42: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

� Cancer is a heterogenous disease - even within a given site, stage and histological type. Little understood disease trends, large response variability, and unquantifiable physician & patient biases are all indications for carefully controlled RCTs.

� Careful attention to design is needed for success of such RCTs. Population, intervention, endpoints all need to be carefully defined and potential benefits and harms carefully quantified with defined limits of precision.

� Cancer is a heterogenous disease - even within a given site, stage and histological type. Little understood disease trends, large response variability, and unquantifiable physician & patient biases are all indications for carefully controlled RCTs.

� Careful attention to design is needed for success of such RCTs. Population, intervention, endpoints all need to be carefully defined and potential benefits and harms carefully quantified with defined limits of precision.

And finally ... And finally ...

42

Page 43: Statistics in Cancer Trialsaroi.org/aroi-cms/uploads/media/1583577008Dr.-Avijit-Hazra.pdf · Statistics is the science of data analysis. “If your experiment needs statistics, then

Statistics in Cancer TrialsStatistics in Cancer Trials

“… it is an error to argue in front of your data. You find yourself insensibly twisting them round to fit your theories.”

- Sherlock Holmes- Sherlock Holmes

Statistical principles, and thus statisticians, have a large role - but

certainly not the sole role - in the design, analysis and conduct of cancer trials.

Statistical principles, and thus statisticians, have a large role - but

certainly not the sole role - in the design, analysis and conduct of cancer trials.