Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… ·...

33
Risk Stra*fica*on for Health‐Care Associated C. diff Jenna Wiens, John V. Gu?ag and Eric Horvitz* Massachuse?s Ins*tute of Technology * MicrosoI Research Learning Evolving Pa*ent Risk Processes for C. diff Coloniza*on 1

Transcript of Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… ·...

Page 1: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

RiskStra*fica*onforHealth‐CareAssociatedC.diff

JennaWiens,JohnV.Gu?agandEricHorvitz*Massachuse?sIns*tuteofTechnology

*MicrosoIResearch

LearningEvolvingPa*entRiskProcessesforC.diffColoniza*on

1

Page 2: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

Clostridiumdifficile(C.diff)

•  Bacteriatakesoverthegutwhennormalfloragetswipedout

•  Transmi?edthroughthemouth•  Causesseverediarrhea,intes*naldiseases•  Treatment:metronidazole,oralvancomycin

•  20%ofcasesrelapsewithin60‐days

Source:h?p://www.helladelicious.com/wp‐content/uploads/2012/05/Gut‐Flora‐Bacteria.jpgh?p://newsimg.bbc.co.uk/media/images/41223000/jpg/_41223637_clostridium203spl.jpg

(PepinJetal.,2005)

2

Page 3: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

Prevalence

•  Hospital‐acquired:178,000/year

•  Onparwithnumber

ofnewcasesofinvasivebreastcancerintheUSeachyear

(McDonaldetal.,2006)

(CDC,2011)(AmericanCancerSociety,2009)

3

Page 4: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

Represen*ngandreasoningabouttemporalprocessespromisestoenhancetheaccuracyofinferencesaboutrisk.

PATIENT RISK

•  Collectedatthe*meofadmission

•  e.g.,admissioncomplaint,previousadmissions,homemeds

•  Changesduringthe

hospitaliza*on•  e.g.,currentmeds,current

procedures,currentloca8on,hospitalcondi8ons

RiskFactors

4

Page 5: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

TypicalApproachinClinicalLiterature

Time(days)

IndexEvent

Es*m

ated

Risk

Riskbasedonpa*ent’sstateat*meofadmission

Riskbasedonpa*ent’sstatexdaysbeforeindexevent

(Dubberkeetal.,2011)

(Tanneretal.,2009)

5

Page 6: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

OurApproach

Time(days)

IndexEvent

P(IndexEvent)

EvolvingRiskProfile

Es*m

ated

Risk

6

Page 7: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

Es*m

ated

risk High

Low

High

Low

RiskProcessesHypothesis:extrac*ngandanalyzingevolvingpa*entriskcanleadtoamoreaccuratemodelforpredic*nginfec*ons

RiskProcess:describestheevolu*onofriskoverthecourseofahospitaladmission

Posi*vePa*ents

7

Page 8: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

InferringRiskProcesses

•  Challenges:•  Nogroundtruthaboutrisk

o Retrospec*vedatanotallpa*entsgettestedo Actualriskonanydayisunknowable

•  Thousandsofcorrelatedvariables

Time(days)

Risk

8

Page 9: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

TheData

•  DatabasefromalargeurbanhospitalintheUS•  In‐pa*entstaysfromasingleyear•  Inclusioncriteria(seepaperfordetails)– Eliminateeasilyiden*fiablecases

Popula*on:•  ~10,000hospitaladmissions•  ~200Posi*veC.diffcases

9

Page 10: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

ExperimentalSetup

•  Training&Tes*ng– Randomlysubsampledthenega*veclass– Splitdataintostra*fiedtrainingandtestsets70/30.– Trainingset1,251admissions(127posi*ve)– Tes*ngset532admissions(50posi*ve)

10

Page 11: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

TimeInvariant TimeVarying•  prev.ICD9codes •  pa*ent’sage •  labresults•  homemedica*ons •  pa*ent’smaritalstatus •  procedures•  prev.admissionmedica*ons •  pa*ent’ssex •  loca*onroom•  pa*ent’scity •  expectedsurgery •  loca*onunit•  a?endingMD •  ERadmission •  medica*ons•  Hospitalservice •  dialysis •  vitals•  admissionsource •  diabe*c •  dayofadmission•  financialclasscode •  historyofC.diff •  unitCP•  admissioncomplaint •  num.hospitalvisits(90days) •  hospitalwideCP•  admissionprocedure •  avg.,max.,totallos(90days)

•  pa*ent’srace

Features

11

Page 12: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

Features:>10,000variablesforeachdayofeveryhospitaladmission

TimeInvariant TimeVarying•  prev.ICD9codes •  pa*ent’sage •  labresults•  homemedica*ons •  pa*ent’smaritalstatus •  procedures•  prev.admissionmedica*ons •  pa*ent’ssex •  loca*onroom•  pa*ent’scity •  expectedsurgery •  loca*onunit•  a?endingMD •  ERadmission •  medica*ons•  Hospitalservice •  dialysis •  vitals•  admissionsource •  diabe*c •  dayofadmission•  financialclasscode •  historyofC.diff •  unitCP•  admissioncomplaint •  num.hospitalvisits(90days) •  hospitalwideCP•  admissionprocedure •  avg.,max.,totallos(90days)

•  pa*ent’srace

12

Page 13: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

Represen*ngahospitalstay: pn

13

Page 14: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

OurApproachtoRiskStra*fica*on

RiskStra*fy

SVM

… …

p1p2

pn

r1

rn

r2

r = [r1, r2,..., r

n]

y

P

14

Page 15: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

OurApproachtoRiskStra*fica*on

RiskStra*fy

SVM

… …

p1p2

pn

r1

rn

r2

r = [r1, r2,..., r

n]

ySVMrequireslabels.

P

15

Page 16: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

LabelingtheData

DaysfromAdmission

IndexEvent

GroundTruth

Risk

16

Page 17: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

LabelingtheData

Posi*veTest

GroundTruth

Risk

DaysfromAdmission

17

Page 18: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

LabelingtheData

Posi*veTest

Risk

X

DaysfromAdmission

+1

‐1

18

Page 19: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

LabelingtheData

Posi*veTest

Wehopetoiden*fyhighriskpa*entsasearlyaspossible.

Risk

DaysfromAdmission

+1

‐1

Weassigneachdayofadmissioninwhichapa*enteventuallytestsposi*veasposi*ve(highrisk)…

…andnega*ve(lowrisk)otherwise.

19

Page 20: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

LearningtheDecisionBoundary

Weexpectapa*ent’sriskfluctuatesnoiseinthetraininglabels

20

Page 21: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

LearningtheDecisionBoundary

Note:Simplifiedillustra*on.Welearnalinearhyperplaneinthehighdimensionalfeaturespace. 21

Page 22: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

DailyRisk‐>SVMCon*nuousPredic*ons

WeconsiderthedistanceeachfeaturevectorliesfromtheSVMdecisionboundarythisresultsinacon7nuouspredic*onforeachday.

!3 !2 !1 0 1 2 30

500

1000

1500

SVM Predictions

Fre

quency

NegativePositive

(Trainingdata)

rd = w• pd ! b

22

Page 23: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

OurApproachtoRiskStra*fica*on

RiskStra*fy

SVM

… …

p1p2

pn

r�

rn

r2

r = [r1, r2,..., r

n]

y

P

23

Page 24: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

ExampleRiskProcesses

24

Page 25: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

UsingRiskProcessesforRiskStra*fica*on

•  Instantaneousapproach:– Analogoustotypicalriskstra*fica*onapproaches– Considersvalueofriskprocessonlyondayofpredic*on

•  Cumula*veapproach:–  Combinees*matesfromallpreviousdays

E.g.,constant,linear,andquadra*cweightedaverages

(Dubberkeetal.,2011)

25

Page 26: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

Evalua*ngInstantaneousApproach

•  Considerinstantaneouses*mateforpa*entriskataconstantdistancebeforetheindexevente.g.,2days

Computeclassifierperformancebysweepingthedecisionthresholdfrommintomax.

1 2 3 4 5 6−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1

Days from Admission

Ris

k

True LabelPredicted RiskThreshold

Pa*enttestsposi*veforC.diffonday8

Es*m

ated

Risk

(r)

DaysfromAdmissiontoHospital

PatientRisk = rindex!2

26

Page 27: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

Evalua*ngCumula*veApproach

•  Combinees*matesforpa*entriskfromthe*meofadmissionuptoaconstantdistancefromtheindexevente.g.,2days

Computeclassifierperformancebysweepingthedecisionthresholdfrommintomax.

1 2 3 4 5 6−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1

Days from Admission

Ris

k

True LabelPredicted RiskThreshold

Pa*enttestsposi*veforC.diffonday8

Es*m

ated

Risk

(r)

DaysfromAdmissiontoHospital

PatientRisk ' = f ([r1, r2,..., rindex!2 ])

27

Page 28: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

DefiningtheIndexEvent

•  Posi*veExamplesdayofposi*vetestresult– Weconsideronlydatacollecteduptotwodaysbeforeaposi*vetestresult

•  Nega*veExamplesmidpointofadmission– Consideringdischargeastheindexeventcanleadtodecep*velygoodresults

28

Page 29: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

Results

Approach Tes7ngAUROC(95%CI)

Constantweightedavg. 0.7518(0.69‐0.81)

Linearweightedavg. 0.7444(0.67‐0.80)

Quadra*cweightedavg. 0.7360(0.67‐0.80)

Instantaneous 0.6870(0.61‐0.77)

Cumula*

ve

29

Page 30: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

Results

1st 2nd 3rd 4th 5th0

0.025

0.05

0.075

0.1

0.125

0.15

0.175

0.2

0.225

0.25

Quintile

Pro

babili

ty o

f P

osi

tive T

est

Pa*entsinthe5thquin*leareat>20‐foldgreaterriskthanthoseinthe1stquin*le!

30

Page 31: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

Conclusion•  Firststepinanalyzinghowpa*entriskforacquiringC.diffmayevolveduringahospitaliza*on–  Improvementoverexis*ngmethods

•  Nextsteps:–  Findpa?ernsofriskthatleadtoworse/be?eroutcomes–  Inves*gateapplica*oninothercontexts(e.g.,otherHAIs,in‐hospitalmortality,LOS)

31

Page 32: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

Acknowledgements

•  NaturalSciencesandEngineeringResearchCouncilofCanada(NSERC)

•  Na*onalScienceFounda*on(NSF)•  QuantaComputerInc.

32

Page 33: Risk Straficaon for Health‐Care Associated C. diffwiensj/talks/ICML_workshop_presentatio… · o Retrospecve data not all paents get tested o Actual risk on any day is unknowable

WorksCited•  McDonaldLC,etal.Clostridiumdifficileinfec*oninpa*ents

dischargedfromUSshort‐stayhospitals,1996‐2003.EmergInfectDis2006;12:409‐15.

•  PepinJ,etal.IncreasingriskofrelapseaIertreatmentofClostridiumdifficilecoli*esinQuebec,Canada.ClinInfectDis.2005Jun1;40(11):1591‐7.

•  AmericanCancerSociety.BreastCancerFacts&Figures2009‐2010.Atlanta:AmericanCancerSociety,Inc.

•  Tanner,J.etal.Waterlowscoretopredictpa*entsatriskofdevelopingclostridiumdifficile‐associateddisease.Journalofhospitalinfec*on,71(3):239‐244,2009.

•  Dubberke,E.etal.,Developmentandvalida*onofaclostridiumdifficileinfec*onriskpredic*onmodel.InfectControlHospEpidemiol32(4):360‐366,2011.

33