Survival Analysis On Kidney Failure of Kidney Tranplant Patients

45
“SURVIVAL ANALYSIS ON KIDNEY FAILURE FOR KIDNEY TRANSPLANT PATIENTS” Summary The Accelerated Failure Time model presents a way to easily describe and interpret survival regression data. It approaches the data differently than the widely used and well described Cox proportional hazard model, by assuming proportional effect of the covariates on the log-failure time rather than on the hazard function. In this report, we present semiparametric methods (the Cox PH model) and parametric methods (AFT model) for analyzing survival data. We have the data set for 469 patients with kidney transplants along with the graft survival and failure time. Eight variates have been recognized which might have a relation with the survival experience. Introduction Survival analysis is a statistical method for data analysis where the outcome variable of interest is the time to the occurrence of an event. Hence, survival analysis is also referred to as "time to event analysis", which is applied in a number of applied fields such as medicine, public health, social science, and engineering. The Cox proportional hazards (PH) model is now the most widely used for the analysis of survival data in the presence of covariates or prognostic factors. This is the most popular model for survival analysis because of its simplicity, and not being based on any assumptions about the survival distribution. The model assumes that the underlying hazard rate is a function of the independent covariates, but no assumptions are made about the nature or shape of the hazard function. The accelerated failure time (AFT) model is another alternative method for the analysis of survival data. The AFT model assumes a certain parametric distribution for the failure times and that the effect of the covariates on the failure time is multiplicative. The appeal of the AFT model lies in the ease of interpreting the results, because the AFT models the effect of predictors and covariates directly on the survival time instead of through the hazard function.

Transcript of Survival Analysis On Kidney Failure of Kidney Tranplant Patients

“SURVIVAL ANALYSIS ON KIDNEY FAILURE FOR KIDNEY TRANSPLANT PATIENTS”

Summary

The Accelerated Failure Time model presents a way to easily describe and interpret survival regression data. It approaches the data differently than the widely used and well described Cox proportional hazard model, by assuming proportional effect of the covariates on the log-failure time rather than on the hazard function. In this report, we present semiparametric methods (the Cox PH model) and parametric methods (AFT model) for analyzing survival data. We have the data set for 469 patients with kidney transplants along with the graft survival and failure time. Eight variates have been recognized which might have a relation with the survival experience.

Introduction

Survival analysis is a statistical method for data analysis where the outcome variable of interest is the time to the occurrence of an event. Hence, survival analysis is also referred to as "time to event analysis", which is applied in a number of applied fields such as medicine, public health, social science, and engineering.

The Cox proportional hazards (PH) model is now the most widely used for the analysis of survival data in the presence of covariates or prognostic factors. This is the most popular model for survival analysis because of its simplicity, and not being based on any assumptions about the survival distribution. The model assumes that the underlying hazard rate is a function of the independent covariates, but no assumptions are made about the nature or shape of the hazard function.

The accelerated failure time (AFT) model is another alternative method for the analysis of survival data. The AFT model assumes a certain parametric distribution for the failure times and that the effect of the covariates on the failure time is multiplicative. The appeal of the AFT model lies in the ease of interpreting the results, because the AFT models the effect of predictors and covariates directly on the survival time instead of through the hazard function.

Purpose and Methods

The purpose of this report is to analyze the data using the Cox models and the AFT models. This will be studied by means of real dataset which is from a randomized data set for 469 patients with kidney transplants.

We start with the AFT models checking the AIC and BIC (goodness of fit tests) for each of the distributions including all the covariates. We then discuss briefly possible types of response and prognostic variables. We will select the covariate based on the backward selection procedure.The final model will be chosen based on this result.

Second part of the report consists of the Cox’s PH model. We are going to try to fit the data based on the Cox’s PH model based on different ties and compare with the parametric model exponential and Weibull. This will also be followed by selection of significant covariates and the final model will be chosen.

SAS software package was used and proc lifereg, proc phreg are used for estimating the results for AFT and Cox PH model.

Conclusion

We apply these methods to a randomized data of 495 kidney transplants. Our conclusion is that Age, Diabetes (DIBT) and ALG (an immune drug) are the most significant covariates, thus being the interacting variables which is possibly predictive of the outcome under study. The major goal of this report is also to support an argument for the consideration of the AFT model as an alternative to the PH model in the analysis of some survival data by means of this real dataset.

In conclusion, although the Cox proportional hazards model tends to be more popular in the literature, the AFT model should also be considered when planning a survival analysis. It should go without saying that the choice should be driven by the desired outcome or the fit to the data, and never by which gives a significant P value for the predictor of interest. The choice should be dictated only by the research hypothesis and by which assumptions of the model are valid for the data being analyzed.

Analysis,Procedure along with computation and output with interpretation

Data set

Following is the data for 469 patients with kidney transplants. The primary interest was graft survival, and time to graft failure was recorded in months (which was subject to right censoring). This study included measurements of many covariates that may be related to survival experience. Use both Cox's PH model and the accelerated failure time model to analyze the data and write a report.

The 10 covariates included are:

AGE: Age at transplant in years

SEX: 1=female, 0=male

DIALY: Duration of hemodialysis prior to transplant in days

DBT: Diabetes; 1=yes, 0=no

PTX: Number of prior transplants

BLOOD: Amount of blood transfusion, in blood units

MIS: Mismatch score

ALG: Use of ALG, an immune suppression drug; 1=yes, 0=no

MONTH: Duration time starting from transplant , in months

FAIL: status of the new kidney; 1=new kidney failed, 0=functioning

A. AFT Models - Under AFT models we measure the direct effect of the explanatory variables on the survival time instead of hazard, as we do in the PH model. This characteristic allows for an easier interpretation of the results because the parameters measure the effect of the correspondent covariate on the mean survival time. Currently, the AFT model is not commonly used for the analysis of clinical trial data, although it is fairly common in the field of manufacturing. Similar to the PH model, the AFT model describes the relationship between survival probabilities and a set of covariates

Procedure-

Using proc lifereg (SAS code) we perform goodness of fit tests for each distribution (Exponential, Weibull, Lognormal, Gamma and Log-logistic).This process is done by taking all the covariates into consideration.

SAS output-

Exponential-

Fit Statistics

-2 Log Likelihood1266.50

9

AIC (smaller is better)1284.50

9AICC (smaller is better)

1284.901

BIC (smaller is better)1321.86

4

Weibull-

Fit Statistics

-2 Log Likelihood1189.10

3

Fit Statistics

AIC (smaller is better)1209.10

3AICC (smaller is better)

1209.583

BIC (smaller is better)1250.60

9

Lognormal-

Fit Statistics

-2 Log Likelihood1171.51

5

AIC (smaller is better)1191.51

5AICC (smaller is better)

1191.996

BIC (smaller is better)1233.02

1

Gamma-

Fit Statistics

-2 Log Likelihood1166.62

5

AIC (smaller is better)1188.62

5AICC (smaller is better)

1189.202

BIC (smaller is better)1234.28

1

Loglogistic-

Fit Statistics

-2 Log Likelihood1182.57

8

AIC (smaller is better)1202.57

8AICC (smaller is better)

1203.059

BIC (smaller is better)1244.08

5

Interpretation-

From the above tables it is quite evident that gamma distribution being the chosen one with the lowest AIC amongst the others.

Choice of Covariates-

In this section we will be selecting the most significant covariates among the eight covariates that are given in the data set. Covariates are selected by using the backward selection procedure.

The backward selection procedure is an elimination process in which all the covariates are included in the model at the beginning and are removed one by one according to a significance criterion. The specific parameters that define the parametric model and the coefficients of all the covariates are estimated first. Then the Wald test is used to examine each covariate.

We delete the predictor with the highest p-value and re run the model deleting the predictors with highest p-value until all of them satisfies our constraints. The predictors left will be our significant variable.

To be thorough with our selection procedure of the covariates we have done the backward selection procedure for each distributions.

Age, Diabetes (DBT) and ALG(an immune suppression drug) are the three significant covariates.

Following tables shows in details the selection process of the covariates for each and every distribution.

SAS Tables-

Exponential distribution-

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.4088 0.3362 4.7500 6.0676 258.90 <.0001AGE 1 -0.0224 0.0062 -0.0345 -0.0102 13.07 0.0003SEX 1 0.0393 0.1494 -0.2535 0.3321 0.07 0.7924DIALY 1 -0.0002 0.0002 -0.0005 0.0001 1.31 0.2516DBT 1 -0.5263 0.1897 -0.8981 -0.1544 7.70 0.0055PTX 1 0.0234 0.2159 -0.3998 0.4466 0.01 0.9137BLOOD 1 -0.0018 0.0049 -0.0113 0.0078 0.13 0.7162MIS 1 -0.1278 0.0906 -0.3053 0.0497 1.99 0.1582

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqALG 1 0.5028 0.1688 0.1720 0.8335 8.87 0.0029Scale 0 1.0000 0.0000 1.0000 1.0000    Weibull Shape 0 1.0000 0.0000 1.0000 1.0000    

We reject PTX

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.4182 0.3252 4.7807 6.0556 277.55 <.0001AGE 1 -0.0224 0.0061 -0.0345 -0.0104 13.30 0.0003SEX 1 0.0401 0.1492 -0.2523 0.3325 0.07 0.7880DIALY 1 -0.0002 0.0002 -0.0005 0.0001 1.35 0.2444DBT 1 -0.5270 0.1896 -0.8986 -0.1554 7.73 0.0054BLOOD 1 -0.0016 0.0047 -0.0109 0.0076 0.12 0.7287MIS 1 -0.1291 0.0897 -0.3050 0.0467 2.07 0.1500ALG 1 0.4996 0.1662 0.1738 0.8254 9.03 0.0027Scale 0 1.0000 0.0000 1.0000 1.0000    Weibull Shape 0 1.0000 0.0000 1.0000 1.0000    

We reject sex

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.4301 0.3225 4.7981 6.0621 283.56 <.0001AGE 1 -0.0223 0.0061 -0.0344 -0.0103 13.24 0.0003DIALY 1 -0.0002 0.0002 -0.0005 0.0001 1.34 0.2471DBT 1 -0.5277 0.1896 -0.8992 -0.1561 7.75 0.0054BLOOD 1 -0.0014 0.0047 -0.0106 0.0077 0.09 0.7596

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqMIS 1 -0.1310 0.0894 -0.3063 0.0442 2.15 0.1428ALG 1 0.5021 0.1659 0.1770 0.8272 9.17 0.0025Scale 0 1.0000 0.0000 1.0000 1.0000    Weibull Shape 0 1.0000 0.0000 1.0000 1.0000    

We reject blood

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.4090 0.3147 4.7922 6.0259 295.39 <.0001AGE 1 -0.0221 0.0061 -0.0340 -0.0102 13.20 0.0003DIALY 1 -0.0002 0.0002 -0.0005 0.0001 1.80 0.1792DBT 1 -0.5232 0.1890 -0.8935 -0.1528 7.67 0.0056MIS 1 -0.1303 0.0893 -0.3052 0.0447 2.13 0.1444ALG 1 0.4991 0.1655 0.1747 0.8235 9.09 0.0026Scale 0 1.0000 0.0000 1.0000 1.0000    Weibull Shape 0 1.0000 0.0000 1.0000 1.0000    

We reject DIALY

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.3699 0.3150 4.7524 5.9874 290.52 <.0001AGE 1 -0.0236 0.0060 -0.0353 -0.0119 15.59 <.0001DBT 1 -0.5050 0.1884 -0.8742 -0.1357 7.19 0.0074MIS 1 -0.1357 0.0904 -0.3129 0.0416 2.25 0.1336ALG 1 0.5012 0.1655 0.1768 0.8256 9.17 0.0025Scale 0 1.0000 0.0000 1.0000 1.0000    Weibull 0 1.0000 0.0000 1.0000 1.0000    

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqShape

We reject MIS

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.1775 0.2850 4.6190 5.7361 330.11 <.0001AGE 1 -0.0234 0.0060 -0.0351 -0.0117 15.40 <.0001DBT 1 -0.4895 0.1883 -0.8585 -0.1204 6.76 0.0093ALG 1 0.4760 0.1648 0.1530 0.7989 8.34 0.0039Scale 0 1.0000 0.0000 1.0000 1.0000    Weibull Shape 0 1.0000 0.0000 1.0000 1.0000    

Interpretation-The above is the final table showing all the covariates that are significant and will be used in fitting the model.

AGE, DBT and ALG are chosen.

Weibull distribution-

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.7554 0.5405 4.6961 6.8147 113.39 <.0001AGE 1 -0.0287 0.0099 -0.0482 -0.0093 8.37 0.0038SEX 1 0.0449 0.2424 -0.4301 0.5200 0.03 0.8529

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqDIALY 1 -0.0002 0.0003 -0.0007 0.0003 0.50 0.4814DBT 1 -0.5754 0.3101 -1.1831 0.0323 3.44 0.0635PTX 1 0.1449 0.3570 -0.5548 0.8447 0.16 0.6848BLOOD 1 -0.0073 0.0081 -0.0232 0.0085 0.82 0.3663MIS 1 -0.1537 0.1495 -0.4467 0.1393 1.06 0.3038ALG 1 0.9736 0.2852 0.4146 1.5327 11.65 0.0006Scale 1 1.6326 0.1013 1.4457 1.8436    Weibull Shape 1 0.6125 0.0380 0.5424 0.6917    

We reject sex

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.7695 0.5356 4.7198 6.8193 116.05 <.0001AGE 1 -0.0287 0.0099 -0.0481 -0.0092 8.36 0.0038DIALY 1 -0.0002 0.0003 -0.0007 0.0003 0.48 0.4873DBT 1 -0.5760 0.3101 -1.1838 0.0317 3.45 0.0632PTX 1 0.1474 0.3566 -0.5515 0.8463 0.17 0.6793BLOOD 1 -0.0071 0.0080 -0.0228 0.0086 0.79 0.3750MIS 1 -0.1551 0.1493 -0.4477 0.1375 1.08 0.2988ALG 1 0.9761 0.2849 0.4177 1.5345 11.74 0.0006Scale 1 1.6327 0.1013 1.4458 1.8438    Weibull Shape 1 0.6125 0.0380 0.5424 0.6917    

We reject PTX-

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.8259 0.5200 4.8068 6.8450 125.54 <.0001AGE 1 -0.0291 0.0099 -0.0485 -0.0097 8.63 0.0033DIALY 1 -0.0002 0.0003 -0.0007 0.0003 0.54 0.4626DBT 1 -0.5814 0.3096 -1.1883 0.0255 3.53 0.0604BLOOD 1 -0.0061 0.0077 -0.0213 0.0090 0.63 0.4265MIS 1 -0.1644 0.1476 -0.4537 0.1249 1.24 0.2653ALG 1 0.9553 0.2801 0.4064 1.5042 11.64 0.0006Scale 1 1.6315 0.1011 1.4448 1.8422    Weibull Shape 1 0.6129 0.0380 0.5428 0.6921    

We reject DIALY-

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.8205 0.5225 4.7964 6.8446 124.09 <.0001AGE 1 -0.0307 0.0097 -0.0496 -0.0118 10.12 0.0015DBT 1 -0.5696 0.3097 -1.1765 0.0373 3.38 0.0658BLOOD 1 -0.0078 0.0074 -0.0223 0.0067 1.11 0.2918MIS 1 -0.1692 0.1488 -0.4608 0.1224 1.29 0.2554ALG 1 0.9634 0.2805 0.4137 1.5132 11.80 0.0006Scale 1 1.6346 0.1013 1.4477 1.8457    Weibull Shape 1 0.6118 0.0379 0.5418 0.6908    

We reject Blood-

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.6919 0.5056 4.7010 6.6828 126.75 <.0001AGE 1 -0.0303 0.0096 -0.0491 -0.0115 9.93 0.0016DBT 1 -0.5447 0.3081 -1.1485 0.0592 3.13 0.0771MIS 1 -0.1651 0.1486 -0.4563 0.1261 1.24 0.2664ALG 1 0.9531 0.2796 0.4050 1.5011 11.62 0.0007Scale 1 1.6321 0.1011 1.4455 1.8428    Weibull Shape 1 0.6127 0.0380 0.5426 0.6918    

We reject MIS-

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.4626 0.4582 4.5647 6.3606 142.16 <.0001AGE 1 -0.0303 0.0096 -0.0492 -0.0114 9.91 0.0016DBT 1 -0.5236 0.3086 -1.1285 0.0813 2.88 0.0898ALG 1 0.9318 0.2798 0.3835 1.4801 11.09 0.0009Scale 1 1.6373 0.1015 1.4500 1.8488    Weibull Shape 1 0.6107 0.0379 0.5409 0.6896    

Interpretation-The above is the final table showing all the covariates that are significant and will be used in fitting the model.

AGE, DBT and ALG are chosen.

Log Normal Distribution-

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 4.7366 0.5776 3.6046 5.8686 67.26 <.0001AGE 1 -0.0269 0.0109 -0.0482 -0.0057 6.15 0.0131SEX 1 0.0886 0.2639 -0.4288 0.6059 0.11 0.7372DIALY 1 -0.0001 0.0003 -0.0006 0.0005 0.06 0.8096DBT 1 -0.6132 0.3323 -1.2645 0.0381 3.41 0.0650PTX 1 0.0198 0.3720 -0.7092 0.7489 0.00 0.9574BLOOD 1 -0.0064 0.0098 -0.0256 0.0127 0.44 0.5088MIS 1 -0.1226 0.1617 -0.4396 0.1943 0.58 0.4482ALG 1 1.2503 0.3281 0.6073 1.8933 14.52 0.0001

We reject PTX

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 4.7419 0.5689 3.6268 5.8570 69.47 <.0001AGE 1 -0.0270 0.0108 -0.0482 -0.0057 6.19 0.0128SEX 1 0.0883 0.2638 -0.4289 0.6054 0.11 0.7380DIALY 1 -0.0001 0.0003 -0.0006 0.0005 0.06 0.8117DBT 1 -0.6139 0.3320 -1.2646 0.0368 3.42 0.0644BLOOD 1 -0.0063 0.0093 -0.0246 0.0120 0.46 0.4995MIS 1 -0.1239 0.1598 -0.4372 0.1893 0.60 0.4380ALG 1 1.2484 0.3260 0.6094 1.8874 14.66 0.0001Scale 1 2.3609 0.1312 2.1173 2.6327    

We reject Dialy

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 4.7419 0.5691 3.6265 5.8574 69.42 <.0001AGE 1 -0.0276 0.0105 -0.0482 -0.0071 6.94 0.0084SEX 1 0.0834 0.2631 -0.4322 0.5990 0.10 0.7513DBT 1 -0.6098 0.3315 -1.2596 0.0400 3.38 0.0659BLOOD 1 -0.0067 0.0092 -0.0247 0.0113 0.54 0.4644MIS 1 -0.1251 0.1598 -0.4383 0.1881 0.61 0.4338ALG 1 1.2490 0.3260 0.6099 1.8880 14.67 0.0001Scale 1 2.3611 0.1312 2.1174 2.6328    

We reject sex

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 4.7819 0.5555 3.6931 5.8706 74.10 <.0001AGE 1 -0.0279 0.0105 -0.0484 -0.0073 7.08 0.0078DBT 1 -0.6148 0.3312 -1.2639 0.0343 3.45 0.0634BLOOD 1 -0.0063 0.0091 -0.0241 0.0115 0.48 0.4882MIS 1 -0.1256 0.1598 -0.4388 0.1876 0.62 0.4319ALG 1 1.2508 0.3261 0.6116 1.8899 14.71 0.0001Scale 1 2.3617 0.1313 2.1179 2.6335    

We reject blood

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 4.6899 0.5380 3.6355 5.7443 76.00 <.0001AGE 1 -0.0279 0.0105 -0.0483 -0.0074 7.10 0.0077DBT 1 -0.5998 0.3300 -1.2466 0.0470 3.30 0.0691MIS 1 -0.1216 0.1596 -0.4343 0.1912 0.58 0.4461ALG 1 1.2481 0.3257 0.6096 1.8865 14.68 0.0001Scale 1 2.3597 0.1311 2.1163 2.6311    

We reject MIS

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 4.5301 0.4947 3.5604 5.4998 83.84 <.0001AGE 1 -0.0284 0.0105 -0.0489 -0.0079 7.38 0.0066DBT 1 -0.5833 0.3298 -1.2297 0.0632 3.13 0.0770ALG 1 1.2441 0.3262 0.6048 1.8835 14.55 0.0001Scale 1 2.3636 0.1313 2.1198 2.6354    

Interpretation-The above is the final table showing all the covariates that are significant and will be used in fitting the model.

AGE,DBT and ALG are chosen.

Gamma distribution-

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 3.2686 0.6703 1.9549 4.5824 23.78 <.0001AGE 1 -0.0186 0.0113 -0.0409 0.0036 2.69 0.1007SEX 1 0.0996 0.2613 -0.4126 0.6118 0.15 0.7031DIALY 1 0.0001 0.0003 -0.0005 0.0006 0.05 0.8292DBT 1 -0.5701 0.3240 -1.2052 0.0650 3.10 0.0785PTX 1 -0.1666 0.3615 -0.8752 0.5419 0.21 0.6448BLOOD 1 -0.0008 0.0102 -0.0208 0.0193 0.01 0.9408MIS 1 -0.0631 0.1533 -0.3636 0.2374 0.17 0.6805ALG 1 1.1667 0.3428 0.4949 1.8386 11.58 0.0007

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqScale 1 2.6270 0.1461 2.3556 2.9296    Shape 1 -1.0848 0.2450 -1.5649 -0.6046    

We reject Blood

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 3.2418 0.6617 1.9449 4.5387 24.00 <.0001AGE 1 -0.0185 0.0114 -0.0407 0.0038 2.65 0.1037SEX 1 0.0971 0.2585 -0.4096 0.6038 0.14 0.7072DIALY 1 0.0001 0.0003 -0.0005 0.0006 0.05 0.8286DBT 1 -0.5682 0.3235 -1.2022 0.0658 3.09 0.0790PTX 1 -0.1764 0.3420 -0.8467 0.4939 0.27 0.6060MIS 1 -0.0625 0.1531 -0.3626 0.2375 0.17 0.6830ALG 1 1.1628 0.3427 0.4912 1.8344 11.52 0.0007Scale 1 2.6256 0.1460 2.3544 2.9280    Shape 1 -1.0998 0.2490 -1.5878 -0.6119    

We reject dialy

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 3.2802 0.6535 1.9993 4.5611 25.19 <.0001AGE 1 -0.0182 0.0110 -0.0397 0.0033 2.74 0.0976SEX 1 0.1028 0.2572 -0.4014 0.6069 0.16 0.6895DBT 1 -0.5736 0.3238 -1.2082 0.0609 3.14 0.0764PTX 1 -0.1584 0.3342 -0.8133 0.4966 0.22 0.6355MIS 1 -0.0618 0.1533 -0.3622 0.2386 0.16 0.6866ALG 1 1.1756 0.3406 0.5079 1.8432 11.91 0.0006Scale 1 2.6282 0.1455 2.3580 2.9293    Shape 1 -1.0678 0.2335 -1.5255 -0.6101    

We reject sex

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 3.3702 0.6187 2.1576 4.5828 29.68 <.0001AGE 1 -0.0190 0.0108 -0.0402 0.0022 3.10 0.0785

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqDBT 1 -0.5885 0.3222 -1.2200 0.0431 3.34 0.0678PTX 1 -0.1638 0.3343 -0.8191 0.4915 0.24 0.6242MIS 1 -0.0595 0.1536 -0.3606 0.2416 0.15 0.6986ALG 1 1.1823 0.3404 0.5152 1.8494 12.07 0.0005Scale 1 2.6304 0.1453 2.3604 2.9311    Shape 1 -1.0502 0.2174 -1.4763 -0.6242    

We reject MIS

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 3.2396 0.5946 2.0742 4.4050 29.68 <.0001AGE 1 -0.0191 0.0108 -0.0403 0.0022 3.10 0.0785DBT 1 -0.5815 0.3212 -1.2111 0.0481 3.28 0.0702PTX 1 -0.1444 0.3281 -0.7875 0.4986 0.19 0.6598ALG 1 1.1802 0.3408 0.5123 1.8481 11.99 0.0005Scale 1 2.6291 0.1460 2.3580 2.9313    Shape 1 -1.0905 0.2471 -1.5748 -0.6062    

We reject PTX

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 3.2692 0.5712 2.1498 4.3886 32.76 <.0001AGE 1 -0.0197 0.0107 -0.0407 0.0014 3.36 0.0669DBT 1 -0.5758 0.3218 -1.2065 0.0549 3.20 0.0736ALG 1 1.1947 0.3403 0.5276 1.8618 12.32 0.0004Scale 1 2.6330 0.1455 2.3628 2.9341    Shape 1 -1.0523 0.2170 -1.4777 -0.6269    

Interpretation-The above is the final table showing all the covariates that are significant and will be used in fitting the model.

AGE,DBT and ALG are chosen.

Log logistic distribution-

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 4.8836 0.5809 3.7451 6.0221 70.68 <.0001

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqAGE 1 -0.0300 0.0108 -0.0512 -0.0089 7.76 0.0053SEX 1 0.1009 0.2663 -0.4210 0.6227 0.14 0.7048DIALY 1 -0.0001 0.0003 -0.0006 0.0005 0.11 0.7400DBT 1 -0.6366 0.3362 -1.2955 0.0224 3.58 0.0583PTX 1 0.0587 0.3906 -0.7069 0.8242 0.02 0.8806BLOOD 1 -0.0085 0.0094 -0.0269 0.0100 0.81 0.3694MIS 1 -0.1580 0.1652 -0.4818 0.1657 0.92 0.3387ALG 1 1.3047 0.3317 0.6545 1.9549 15.47 <.0001Scale 1 1.3746 0.0832 1.2209 1.5476    

We reject PTX

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 4.9011 0.5690 3.7860 6.0163 74.20 <.0001AGE 1 -0.0302 0.0107 -0.0512 -0.0092 7.92 0.0049SEX 1 0.1018 0.2662 -0.4199 0.6234 0.15 0.7022DIALY 1 -0.0001 0.0003 -0.0006 0.0005 0.11 0.7419DBT 1 -0.6387 0.3359 -1.2970 0.0196 3.62 0.0572BLOOD 1 -0.0080 0.0090 -0.0257 0.0096 0.80 0.3719MIS 1 -0.1611 0.1638 -0.4822 0.1599 0.97 0.3253ALG 1 1.2983 0.3289 0.6537 1.9428 15.58 <.0001Scale 1 1.3744 0.0831 1.2207 1.5473    

We reject Dialy

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 4.8988 0.5693 3.7829 6.0147 74.04 <.0001AGE 1 -0.0311 0.0104 -0.0514 -0.0108 8.98 0.0027SEX 1 0.0930 0.2649 -0.4262 0.6123 0.12 0.7255DBT 1 -0.6319 0.3353 -1.2890 0.0253 3.55 0.0595BLOOD 1 -0.0085 0.0089 -0.0259 0.0088 0.93 0.3357MIS 1 -0.1635 0.1638 -0.4846 0.1576 1.00 0.3183ALG 1 1.3038 0.3284 0.6602 1.9475 15.76 <.0001Scale 1 1.3748 0.0831 1.2212 1.5478    

We reject sex

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 4.9432 0.5556 3.8542 6.0321 79.16 <.0001AGE 1 -0.0313 0.0104 -0.0516 -0.0109 9.08 0.0026DBT 1 -0.6363 0.3351 -1.2931 0.0206 3.60 0.0576BLOOD 1 -0.0080 0.0088 -0.0252 0.0091 0.84 0.3582MIS 1 -0.1657 0.1635 -0.4862 0.1548 1.03 0.3110ALG 1 1.3044 0.3285 0.6606 1.9483 15.77 <.0001Scale 1 1.3752 0.0832 1.2215 1.5482    

We reject blood

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 4.8255 0.5407 3.7658 5.8852 79.66 <.0001AGE 1 -0.0313 0.0104 -0.0516 -0.0109 9.09 0.0026DBT 1 -0.6190 0.3343 -1.2743 0.0362 3.43 0.0641MIS 1 -0.1598 0.1632 -0.4797 0.1601 0.96 0.3275ALG 1 1.2998 0.3283 0.6563 1.9433 15.67 <.0001Scale 1 1.3751 0.0831 1.2215 1.5480    

We reject MIS

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 4.6074 0.4917 3.6437 5.5711 87.81 <.0001AGE 1 -0.0318 0.0104 -0.0521 -0.0114 9.35 0.0022DBT 1 -0.5949 0.3343 -1.2501 0.0604 3.17 0.0752ALG 1 1.2934 0.3294 0.6477 1.9390 15.41 <.0001Scale 1 1.3795 0.0833 1.2255 1.5528    

Interpretation-The above is the final table showing all the covariates that are significant and will be used in fitting the model.

AGE,DBT and ALG are chosen.

Goodness of fit test for the selected covariates and selction of model

In this section we perform the goodness of fit test once again for the three covariates selected in the above section (AGE,DBT and ALG).

We select the Gamma distribution as our appropriate model with the lowest AIC value compared to the others.

Following tables shows the detailed AIC and BIC values for each and every distributions.

SAS output-

Exponential distribution-

Fit Statistics

-2 Log Likelihood1270.64

6

AIC (smaller is better)1278.64

6AICC (smaller is better)

1278.732

BIC (smaller is better)1295.24

9

Weibull distribution-

Fit Statistics

-2 Log Likelihood1192.09

8

AIC (smaller is better)1202.09

8AICC (smaller is better)

1202.227

BIC (smaller is better)1222.85

1

Lognormal distribution-

Fit Statistics

-2 Log Likelihood1172.73

5

AIC (smaller is better)1182.73

5AICC (smaller is better)

1182.865

BIC (smaller is better)1203.48

8

Gamma distribution-

Fit Statistics

-2 Log Likelihood1167.17

1

AIC (smaller is better)1179.17

1AICC (smaller is better)

1179.353

BIC (smaller is better)1204.07

5

Log logistic distribution-

Fit Statistics

-2 Log Likelihood1184.61

5

AIC (smaller is better)1194.61

5AICC (smaller is better)

1194.745

BIC (smaller is better)1215.36

8

Interpretation-Gamma distribution is selected with the lowest AIC value of 1179.171 compared to the others.

Fitting the appropriate model

Gamma AFT model has been selected as the appropriate model for the given data set.

For the gamma distribution we have the following table-

Analysis of Maximum Likelihood Parameter Estimates

ParameterDFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 3.2692 0.5712 2.1498 4.3886 32.76 <.0001AGE 1 -0.0197 0.0107 -0.0407 0.0014 3.36 0.0669DBT 1 -0.5758 0.3218 -1.2065 0.0549 3.20 0.0736ALG 1 1.1947 0.3403 0.5276 1.8618 12.32 0.0004Scale 1 2.6330 0.1455 2.3628 2.9341    Shape 1 -1.0523 0.2170 -1.4777 -0.6269    

Hence if T be defined as the survival time we can write the appropriate model as-

Log (Ti)= 3.2692 - .0197(AGE) - .5758(DBT) + 1.1947(ALG) + 2.6330(Error)

B. Cox’s PH Model

The non-parametric method does not control for covariates and it requires categorical predictors. When we have several prognostic variables, we must use multivariate approaches.But we cannot use multiple linear regression or logistic regression because they cannot deal with censored observations. We need another method to model survival data with the presence of censoring. One very popular model in survival data is the Cox proportionalhazards model.

Procedure-

We are going to use proc phreg to estimate the regression coefficient (parameter estimate) based on different methods for ties on the given data.Then we are going to compare those results with the two parametric models, exponential and Weibull for further clarification.

Conclusion-From the test results we can say that the signs of regression coefficient of Age, Duration of hemodialysis (DIALY), Diabetes (DBT), Blood are all positive and thus have a higher hazard risk in the kidney transplants.The coefficients of SEX, PTX (number of prior transplants) and ALG (an immune suppression drug) are all negative indicating low hazard risk in the kidney transplantation.

It is also to be noted that if we compare these value with that of exponential and Weibull they give the opposite results. According to Weibull and exponential-SEX, PTX and ALG have higher hazard risk whereas the Age,DIALY,DBT,Blood and MIS have low hazard risk.

The following SAS tabular values explains our outcome in details.

Breslow-

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

RatioAGE 1 0.01732 0.00611 8.0468 0.0046 1.017SEX 1 -0.02983 0.14882 0.0402 0.8411 0.971

DIALY 10.000093

10.000155

6 0.3581 0.5495 1.000DBT 1 0.35004 0.19236 3.3115 0.0688 1.419PTX 1 -0.07341 0.21763 0.1138 0.7359 0.929BLOOD 1 0.00372 0.00497 0.5592 0.4546 1.004MIS 1 0.08376 0.09196 0.8295 0.3624 1.087ALG 1 -0.60697 0.17033 12.6983 0.0004 0.545

Discrete-

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

RatioAGE 1 0.01776 0.00619 8.2292 0.0041 1.018SEX 1 -0.03055 0.15076 0.0411 0.8394 0.970

DIALY 10.000096

20.000158

3 0.3688 0.5437 1.000DBT 1 0.35882 0.19528 3.3763 0.0661 1.432PTX 1 -0.07548 0.22026 0.1174 0.7318 0.927BLOOD 1 0.00378 0.00503 0.5652 0.4522 1.004MIS 1 0.08623 0.09312 0.8576 0.3544 1.090ALG 1 -0.62313 0.17320 12.9439 0.0003 0.536

Efron-

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

RatioAGE 1 0.01741 0.00610 8.1433 0.0043 1.018SEX 1 -0.03178 0.14888 0.0456 0.8310 0.969

DIALY 10.000090

30.000155

7 0.3364 0.5619 1.000

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

RatioDBT 1 0.35510 0.19234 3.4086 0.0649 1.426PTX 1 -0.07098 0.21839 0.1056 0.7452 0.931BLOOD 1 0.00373 0.00497 0.5617 0.4536 1.004MIS 1 0.08553 0.09205 0.8633 0.3528 1.089ALG 1 -0.61692 0.17051 13.0902 0.0003 0.540

Exact-

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

RatioAGE 1 0.01741 0.00610 8.1440 0.0043 1.018SEX 1 -0.03178 0.14889 0.0456 0.8310 0.969

DIALY 10.000090

30.000155

7 0.3362 0.5620 1.000DBT 1 0.35513 0.19235 3.4086 0.0649 1.426PTX 1 -0.07099 0.21842 0.1056 0.7452 0.931BLOOD 1 0.00373 0.00497 0.5617 0.4536 1.004MIS 1 0.08554 0.09205 0.8636 0.3527 1.089ALG 1 -0.61703 0.17054 13.0913 0.0003 0.540

Parameter models-

Exponential-

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.4088 0.3362 4.7500 6.0676 258.90 <.0001AGE 1 -0.0224 0.0062 -0.0345 -0.0102 13.07 0.0003SEX 1 0.0393 0.1494 -0.2535 0.3321 0.07 0.7924DIALY 1 -0.0002 0.0002 -0.0005 0.0001 1.31 0.2516DBT 1 -0.5263 0.1897 -0.8981 -0.1544 7.70 0.0055PTX 1 0.0234 0.2159 -0.3998 0.4466 0.01 0.9137BLOOD 1 -0.0018 0.0049 -0.0113 0.0078 0.13 0.7162MIS 1 -0.1278 0.0906 -0.3053 0.0497 1.99 0.1582ALG 1 0.5028 0.1688 0.1720 0.8335 8.87 0.0029Scale 0 1.0000 0.0000 1.0000 1.0000    Weibull Shape 0 1.0000 0.0000 1.0000 1.0000    

Weibull-

Analysis of Maximum Likelihood Parameter Estimates

Parameter DFEstimateStandard

Error95% Confidence LimitsChi-SquarePr > ChiSqIntercept 1 5.7554 0.5405 4.6961 6.8147 113.39 <.0001AGE 1 -0.0287 0.0099 -0.0482 -0.0093 8.37 0.0038SEX 1 0.0449 0.2424 -0.4301 0.5200 0.03 0.8529DIALY 1 -0.0002 0.0003 -0.0007 0.0003 0.50 0.4814DBT 1 -0.5754 0.3101 -1.1831 0.0323 3.44 0.0635PTX 1 0.1449 0.3570 -0.5548 0.8447 0.16 0.6848BLOOD 1 -0.0073 0.0081 -0.0232 0.0085 0.82 0.3663MIS 1 -0.1537 0.1495 -0.4467 0.1393 1.06 0.3038ALG 1 0.9736 0.2852 0.4146 1.5327 11.65 0.0006Scale 1 1.6326 0.1013 1.4457 1.8436    Weibull Shape 1 0.6125 0.0380 0.5424 0.6917    

Choice of Covariates

Like we did in the AFT model in this section we will be selecting the significant covariates only, using Breslow’s method (default procedure) for ties, the backward selection method, and the SAS proc phreg.

Conclusion- AGE, DBT and ALG are the three covariates that are significant from our test results.

Following SAS tables give us the detailed process of the backward selection procedure.

Using Breslow approximation of ties and backward selection-

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

RatioAGE 1 0.01732 0.00611 8.0468 0.0046 1.017SEX 1 -0.02983 0.14882 0.0402 0.8411 0.971

DIALY 10.000093

10.000155

6 0.3581 0.5495 1.000DBT 1 0.35004 0.19236 3.3115 0.0688 1.419PTX 1 -0.07341 0.21763 0.1138 0.7359 0.929BLOOD 1 0.00372 0.00497 0.5592 0.4546 1.004MIS 1 0.08376 0.09196 0.8295 0.3624 1.087ALG 1 -0.60697 0.17033 12.6983 0.0004 0.545

We reject sex-

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

RatioAGE 1 0.01731 0.00611 8.0406 0.0046 1.017

DIALY 10.000091

00.000155

1 0.3441 0.5574 1.000DBT 1 0.35031 0.19234 3.3171 0.0686 1.420PTX 1 -0.07537 0.21728 0.1203 0.7287 0.927BLOOD 1 0.00359 0.00493 0.5305 0.4664 1.004MIS 1 0.08476 0.09181 0.8524 0.3559 1.088ALG 1 -0.60890 0.17001 12.8275 0.0003 0.544

We reject PTX-

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

RatioAGE 1 0.01753 0.00609 8.2854 0.0040 1.018

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

Ratio

DIALY 10.000095

00.000154

2 0.3799 0.5376 1.000DBT 1 0.35394 0.19211 3.3945 0.0654 1.425BLOOD 1 0.00309 0.00475 0.4232 0.5154 1.003MIS 1 0.08957 0.09078 0.9735 0.3238 1.094ALG 1 -0.59912 0.16774 12.7568 0.0004 0.549

We reject Dialy-

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

RatioAGE 1 0.01835 0.00593 9.5785 0.0020 1.019DBT 1 0.34675 0.19167 3.2728 0.0704 1.414BLOOD 1 0.00393 0.00454 0.7474 0.3873 1.004MIS 1 0.09187 0.09122 1.0142 0.3139 1.096ALG 1 -0.60177 0.16772 12.8728 0.0003 0.548

We reject blood-

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

RatioAGE 1 0.01824 0.00592 9.4984 0.0021 1.018DBT 1 0.33636 0.19123 3.0940 0.0786 1.400MIS 1 0.09052 0.09121 0.9848 0.3210 1.095ALG 1 -0.59750 0.16753 12.7202 0.0004 0.550

We reject MIS-

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

RatioAGE 1 0.01827 0.00591 9.5362 0.0020 1.018DBT 1 0.32304 0.19083 2.8656 0.0905 1.381ALG 1 -0.58475 0.16708 12.2492 0.0005 0.557

This is the table for final model with significant covariates.

Fitting of the Model

Analysis of Maximum Likelihood Estimates

ParameterDFParameter

EstimateStandard

ErrorChi-SquarePr > ChiSqHazard

RatioAGE 1 0.01827 0.00591 9.5362 0.0020 1.018DBT 1 0.32304 0.19083 2.8656 0.0905 1.381ALG 1 -0.58475 0.16708 12.2492 0.0005 0.557

Form the above table we can get our significant covariates and hence we can construct our final model

The final model with significant (p<.10) covariates is-

Log[h(t)/ho(t)]=.01827 AGE + .32304 DBT - .58475 ALG

Interpretation- The positive sign of the regression coefficient of AGE and DBT implies high hazard risk rate in kidney transplant whereas the negative coefficient of ALG drug implies low hazard risk rate.