1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH...

60
1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH [email protected] *The findings and conclusions in this report are those of the authors and do not necessarily represent the views of the National Institute for Occupational Safety and Health

Transcript of 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH...

Page 1: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

1

Model Averaging: Beyond Model Uncertainty in Risk Analysis *

Matthew W. Wheeler

[email protected]

*The findings and conclusions in this report are those of the authors and do not necessarily represent the views of the National Institute for Occupational Safety and Health

Page 2: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

2

Acknowledgements

• A. John Bailer

Miami University/NIOSH

Page 3: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

3

Outline

• Introduction/Motivation.

• Model Averaging 101.

• Validation Simulation Study.

• MA Software for dichotomous response.

Page 4: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

4

Introduction/Motivation

• Model choice is frequently often a point of contention among risk assessors/risk managers.

• Frequently multiple models describe the data “equivalently.”

• Two model’s estimates of risk, especially at the lower bound, can differ dramatically.

• Model uncertainty is inherent in most risk estimation, though practically ignored in most situations.

Page 5: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

5

• Consider the problem of estimating a benchmark dose (BMD) from dichotomous dose response data.

• Here we seek to estimate the BMD from a “plausible” model, given experimental data.

• In these experiments: – Animals are exposed to some potential hazard. – The adverse response is assumed to be distributed binomially.– Risk (i.e, probability of adverse response) is estimated using regression

modeling.– Multiple dose-response models can be used to estimate risk.

Introduction/Motivation (cont)

Page 6: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

6

logistic model: (1)

log-logistic model: (2)

gamma: (3)

multistage (4)

probit (5) dad )(5

...)exp(1)1()( 2214 ddd

dtetd td

0

13

1)1()(

dd

exp1

1)(1

dd

lnexp1

)1()(2

Common Dose-Response Models Used:

Page 7: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

7

log-probit (6)

quantal-linear (7)

quantal-quadratic (8)

Weibull (9)

where Γ(α)= gamma function evaluated at α, for Ф(x) = CDF N(0,1) and πi = γ when di=0 for models (2) and (7).

)exp(1)1()(9 dd

)exp(1)1()( 28 dd

)exp(1)1()(7 dd

Common Dose-Response Models Used:

)ln()1()(6 dad

Page 8: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

8

Benchmark dose estimation• BMD is the dose associated with the a specified increase in

response relative to the control response (BMR); e.g.,:

dose d such that: BMR=[πd- π0]/[1- π0] or BMR= πd- π0

• The BMR is commonly set at values of 1%, 5%, 10%.

• BMDL = 100(1-α)% lower confidence limit on the BMD.

NOTE: As πd is dependent on a model thus the BMD is model dependent!!

Page 9: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

9

• Given data (in absence of mechanistic information), a typical analyst will:– Estimate the regression coefficients for

models (1)-(9).

– Estimate the BMD/BMDL given the model.

– Pick the “best model.”

Typical Risk Estimation Process

Page 10: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

10

• Consider TiO2 lung tumor data which has been combined from the studies of Heinrich et al. (1995) Muhle et al. (1991) and Lee et al. (1985).

• Here the benchmark dose (BMD) as well as its lower bound (BMDL) are estimated at BMRs of 10 and 1%.

Example of this:

Page 11: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

11

Model *

Extra Risk

BMD(BMDL)

AIC‡X2 GOF P-value 10% 1%

Quantal-Quadratic 364.95 0.570.96 0.30

(0.84) (0.26)

Logistic 365.39 0.501.01 0.25

(0.92) (0.21)

Probit 365.66 0.480.97 0.22

(0.87) (0.18)

Log-Probit 365.74 0.551.01 0.51

(0.79) (0.20)

Gamma 365.90 0.531.03 0.52

(0.83) (0.18)

Log-Logistic 366.03 0.521.04 0.50

(0.82) (0.18)

Weibull 366.11 0.521.04 0.49

(0.83) (0.17)

Multistage (3-Degree Polynomial) 368.11 0.611.04 0.47

(0.85) (0.14)

Quantal-Linear 370.35 0.260.80 0.08

(0.62) (0.06)

** All fits were obtained using the US EPA’s BMDS‡ Calculated using the number of parameters vs. the number of non-bounded parameters.

Page 12: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

12

Page 13: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

13

• If we pick best AIC the quantal-quadratic model risk estimates would be chosen.

• If we pick the best Pearson Χ2 test statistic the 3-degree multistage model would be chosen.

• All of the models are reasonable based on these statistics.

Model Choice

Page 14: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

14

• Estimates are “reasonably similar” at the 10% estimate.

• Estimates vary by a factor of 5 for the 1% estimate.

• If a BMR of 0.1% is used (results not shown) the Ti02 BMD/BMDL estimates differ by a factor of 35.

• This heterogeneity in model estimates exists even though the model fit statistics are very similar.

• Model uncertainty (and heated debate) result when any one of the above models is chosen.

Model Choice (cont)

Page 15: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

15

• A better way would be to find an adequate way to combine all estimates, and thus describe/account for model uncertainty.

• Model Averaging (MA) is one such method that may satisfactorily account for model uncertainty.

• Instead of focusing on a single model it allows researchers to focus on “plausible behavior.”

Model Averaging

Page 16: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

16

• We can think of any model contributing information (including possible bias) to an analysis.

• Picking any one model ignores other plausible information, and possibly introduces bias into the analysis.

• Model averaging is a method that attempts to synthesize all of the information available.

Model Averaging (cont.)

Page 17: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

17

• Kang et al. (Regulatory Toxicology and Pharmacology, 2000) and Bailer et al. (Risk Analysis,2005) proposed model averaging for risk assessment.

• They used an “Average-BMD” methodology.

Model Averaging (cont)

Page 18: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

18

• In this situation the benchmark dose is computed as :

• Here the BMDk is estimated from one of K models individually• The BMDL is computed as:

• Weights are based upon the formula:

• Where Ii=AIC, Ii=KIC , or Ii=BIC. Other weights are possible.

k

kkMA DMBwDMB ˆˆ

k

kkMA DLMBwDLMB ˆˆ

K

ii

jj

I

Iw

1

)2/exp(

)2/exp(

“Average Dose” MA

Page 19: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

19

• This method was found to perform unreliably in many situations in terms of nominal coverage and bias. (Wheeler and Bailer, Environmental and Ecological Statistics, In Press).

• The results suggested a different BMD estimation technique may perform better. We call this the “average model” MA.

“Average Dose” MA

Page 20: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

20

“Average-Model” MA

• Given the fits of models (1)-(9) MA:– Calculates the dose-response based upon a weighted

average of dose-responses Raftery et al. (1997), Buckland et al. (1997)

– Estimates the MA dose-response curve as:

– Weights are formed as above.

K

iii wdd

1MA ),()( θ

Page 21: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

21

0.0 0.2 0.4 0.6 0.8 1.0

Dose

0.05

0.10

0.15

0.20

Pro

babi

lity

of R

espo

nse

Model Average

Quantal Linear (wi=0.25)

Logistic (wi=0.45)

Gamma (wi=0.3)

Page 22: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

22

• Given this “Average-model,” the benchmark dose is then computed by finding the dose that satisfies the equation

BMR = [πMA(d)i- π MA(0)]/[1- π MA(0)].

• The BMDL is computed through a parametric bootstrap. Here the 5th percentile of the bootstrap distribution is used to compute the 95% lower tailed confidence limit estimate on the BMD.

“Average-Model” MA Benchmark Dose

Page 23: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

23

• This is substantially different from the “average-dose” method.– “Average-Dose:”

Model Fits → Individual BMD estimation → BMD MA estimate

– “Average-Model:”

Model Fits → MA Model Estimate → MA-BMD estimation

“Average-Model” vs. “Average-Dose”

Page 24: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

24

TiO2 Analysis RevisitedExtra Risk

BMD (BMDL)

AIC

Weights AIC

X2 GOF P-value

10% 1%

0.96 0.30 Quantal-Quadratic 0.192 364.95 0.57 (0.84) (0.26)

1.01 0.25 Logistic 0.154 365.39 0.50 (0.92) (0.21)

0.97 0.22 Probit 0.134 365.66 0.48 (0.87) (0.18)

1.01 0.51 Log-Probit 0.129 365.74 0.55 (0.79) (0.20)

1.03 0.52 Gamma 0.119 365.90 0.53

(0.83) (0.18)

1.04 0.50 Log-Logistic 0.112 366.03 0.52 (0.82) (0.18)

1.04 0.49 Weibull 0.108 366.11 0.52 (0.83) (0.17)

1.04 0.47 Multistage 0.039 368.11 0.61 (0.85) (0.14)

0.80 0.08 Quantal-Linear 0.013 370.35 0.26

(0.62) (0.06)

“average-model” Model Averaging BMD 1.01 0.36

(BMDL) n/a n/a n/a

(0.82) (0.12)

“average dose” Model Averaging BMD 1.00 0.38

(BMDL) n/a n/a n/a

(0.84) (0.20)

Page 25: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

25

• MA seems like a good idea, however we need to know if it works well in practice.

• A simulation study was conducted to investigate the behavior of MA.

Validation Study

Page 26: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

26

Validation Study

• A Monte Carlo simulation study was conducted to investigate the behavior of “average model” model averaging.

• 54 true model conditions, using models (1) – (9) were used in the simulation.

• Full study described in Wheeler and Bailer (Risk Analysis, in Press)

Page 27: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

27

Truth was determined by using various response patterns. These were chosen to reflect shallow, medium, and steep curvature as well as low and high background response rates. In all six response patterns were chosen.

Response Pattern

Responses at doses of 0, 0.5, 1

1 (2%,7%,20%) 2 (2%,14%,34%) 3 (2%,25%,50%) 4 (7%,12%,25%) 5 (7%,19%,39%) 6 (7%,30%,55%)

-0.1 0.1 0.3 0.5 0.7 0.9 1.1

0.0

0.1

0.2

0.3

0.4

0.5

Step 1: Decide upon “TRUTH”

Page 28: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

28

Using these 6 dose-response patterns the parameters associated with models (1)-(9) (e.g. γ,α,β,θ,θ2 ) were determined. For example:

These fits were used as the true underlying model in the simulation.Three points allowed a unique specification of all models. A variety of dose-response patterns were represented by the 54(6 conditions x 9 models) true dose response curves.

Response Pattern 2 Dose 0 0.5 1 Resp 2% 14% 34%

Weibull Fit True Weibull Model γ α β

0.02 1.60 0.40

Step 1 (cont)

Page 29: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

29

• The simulation proceeded by generating hypothetical toxicology experiments with response probability π(d).

• With π(d) specified by one of the 54 true dose-response curves estimated in step 1.

• These experiments consisted of 4 dose group design with doses of 0, 0.25, 0.50, and 1.0.

• n=50 for all dose groups.• 2000 experiments were generated per true dose-response

curve. • Bias as well as coverage [i.e., Pr(BMDL ≤ BMDtrue)] was

estimated. • Coverage is reported here.

Step 2: Given truth set up simulation conditions.

Page 30: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

30

• In each experiment the “average-model” BMD as well as the BMDL was estimated.

• BMRs of 1% and 10% were used to estimate the BMD.• Two model spaces for averaging were considered.

– One space consisted of three flexible models: the multistage, Weibull and the log-probit model.

– The second space had seven models that added the probit, logistic, quantal-linear, and quantal-quadratic to the three model space.

• The simulation took approximately 1 CPU year of computation.

Step 3: Gather appropriate statistics.

Page 31: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

31

Coverage

• The BMDL was computed [with α = 0.05] and compared to the true benchmark dose at the specified BMR.

• Coverage probability [i.e., Pr(BMDL ≤ BMDtrue)] was estimated across 2000 simulations.

Page 32: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

32

3 6 3 6

3 6

0.7

0.8

0.9

1.0

0.7

0.8

0.9

1.0

0.7

0.8

0.9

1.0

Gamma (None) Log-Logistic (None) Logistic (7)

Probit (7) Quantal Linear (7) Quantal Quadratic (7)

Multistage (3,7) Log-Probit (3,7) Weibull (3,7)

7 Model MA3 Model MA

Coverage BMR = 10%

Page 33: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

33

3 6 3 6

3 6

70.0%

80.0%

90.0%

100.0%

70.0%

80.0%

90.0%

100.0%

70.0%

80.0%

90.0%

100.0%

Gamma (None) Log-Logistic (None) Logistic (7)

Probit (7) Quantal Linear (7) Quantal Quadratic (7)

Multistage (3,7) Log-Probit (3,7) Weibull (3,7)

7 Model MA3 Model MA

Coverage BMR = 1%

Page 34: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

34

• Nominal coverage is reached for most simulation conditions.

• MA fails to reach nominal coverage in the quantal-linear and similar cases.

Coverage (Summary)

Page 35: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

35

Quantal-Linear Problems

• The quantal-linear case represents a cause of concern if MA is to be used.

• It is important to understand when (and why) it doesn’t work, and compare the method to current practice.

• We first compare it to the common practice of picking the “best model.”

• The “best-model” is chosen by a chi-squared goodness of fit statistic.

Page 36: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

36

BMR = 10% BMR = 1%

Best Modelfrom X2*

”average model” MA Best Modelfrom X2*

”average model” MA

7Model 3 Model 7Model 3 Model

Quantal Linear 1 0.89 0.83 0.90 0.72 0.79 0.89

Quantal Linear 2 0.80 0.78 0.90 0.75 0.83 0.89

Quantal Linear 3 0.77 0.83 0.91 0.76 0.83 0.88

Quantal Linear 4 0.90 0.88 0.95 0.69 0.90 0.97

Quantal Linear 5 0.69 0.81 0.93 0.68 0.86 0.88

Quantal Linear 6 0.70 0.80 0.91 0.69 0.83 0.89

Coverage ComparisonMA vs. Best Model

*True model included in the model set.

Page 37: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

37

• As a decision rule, in terms of coverage, “Average-model” 3-model averaging performs better than picking the best fitting model.

NOTE: The true model was included in the set of models the “best model” was chosen from.

MA vs. “Best Model”

Page 38: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

38

• Despite the above it is disconcerting that a set of flexible models like the 3-model space model average fails to describe quantal-linear behavior.

• We look at the sampling distribution of the experiment for the quantal-linear model, in attempt to explain this behavior.

Further Analysis of Quantal Linear Model

Page 39: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

390.0 0.2 0.4 0.6 0.8 1.0

05

10

15

20

Dose

Ob

serv

ed

Inci

de

nce

Sampling distribution for the quantal-linear model

Page 40: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

400.0 0.2 0.4 0.6 0.8 1.0

Dose

0.00

0.05

0.10

0.15

Exc

ess

Ris

k

Quantal Linear Truth

Weibull

Log-Probit

Multistage

Average fit for 3-model MA models

Page 41: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

41

• The flexibility of the models combined with the sampling distribution introduces bias into the estimation of the dose-response curve.

• The bias carries through in BMD estimation.

• This also may be the cause of the conservative behavior (i.e. coverage > 99%) seen in the quantal-quadratic case.

Further Analysis of Quantal Linear Model

Page 42: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

42

Bias Corrected Intervals• The evident bias suggests a search for a corrective mechanism.

• Percentile based bootstrap confidence intervals are not optimal.

• Biased Corrected and Accelerated (BCa) confidence intervals (CI) are preferred (Efron, 1994)

• The BCa CIs are computationally more difficult. (i.e., full simulation becomes longer if these were being computed in all conditions)

• Only the quantal-linear conditions were analyzed using the BCa confidence interval.

Page 43: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

43

7 Model Space 3 Model Space

BCa Percentile BCa Percentile

Quantal-linear 1 0.852 0.829 0.868 0.903

Quantal-linear 2 0.891 0.776 0.928 0.901

Quantal-linear 3 0.926 0.831 0.918 0.906

Quantal-linear 4 0.840 0.881 0.836 0.946

Quantal-linear 5 0.868 0.807 0.924 0.927

Quantal-linear 6 0.909 0.798 0.917 0.914

Page 44: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

44

• BCa CIs fail to be appropriate in shallow dose response cases.

• In these cases an infinite BMD can be estimated through the bootstrap re-samples.

• This causes a problem for the BCa technique, as it violates its underlying assumptions.

• These problems are identifiable through histogram plots of the bootstrap resample.

Page 45: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

45

0.1344790.923721

1.7129622.502204

3.2914464.080687

4.8699295.659171

6.4484127.237654

8.0268968.816137

9.605379

Bootstrap Resamples

0

500

1000

1500

2000

Sampling distribution where BCa BMDLfails to cover the true BMD when the percentile based method does.

Page 46: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

46

• Model averaging recovers the central estimate, as well as the lower bound in most cases. (Some bias is however evident)

• BMDs and BMDLs with BMRs of 1% are most problematic.

• Problems exhibited with true quantal-linear dose-response mechanisms.

• BCa confidence intervals can help remediate some of these problems.

Simulation Conclusions

Page 47: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

47

• “Average-model” model averaging is superior to picking the best model.

• It recovers the true BMD uncertainty (in terms of correct coverage probabilities) when the true model is not in the MA model space.

• The results show MA is not a panacea, it is however a step in the right direction.

Page 48: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

48

Page 49: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

49

Model Averaging Software

• The results are promising but implementation of this approach is difficult.

• The simulation code has been repackaged to allow users to implement dichotomous dose-response model averaging.

• This is done in a simple MS Windows command prompt program.

Page 50: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

50

• Software is described that will implement Model Averaging for Dichotomous dose-response data (affectionately named MADr for short).

• The software, as well as the code, is freely available under the GNU public license.

• It will be available, in the near future, on NIOSH’s web site. A beta version is distributed with this talk.

Page 51: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

51

MADr-BMD: Algorithm Overview

DichotomousData Input File

Model 1 Model K…. ….

Model AverageBMD

Parametric Bootstrap BMDL

Create Resampled Data

Model 1 …. ….

Model AverageBMD

Model K

Output

Page 52: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

52

Data Specification

• MADr is run on the command line by using the command “madr” at the prompt.

• It’s input is a text file that can be edited using notepad or similar text editor.

• By default it expects the file “input.txt”. • Alternate input file names can be specified.

For Example:

“madr tio2.txt”

“madr newtoxdata.txt”

Page 53: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

53

Example input file

250 1e-8 1e-80 0 1 1 1 1 1 1 1101 2 1 0.1 0.95 5000 0240 50 022.5 50 345.0 50 1090.0 50 29

250 1e-8 1e-8 Iterations, Parameter Conv. , Relative Function Conv.

0 0 1 1 1 1 1 1 1 MA Specification: 1 = Included 0 = Not-includedModel Order: (Quantal-Linear, Quantal-Quadratic, Multistage

Logistic, Probit, Weibull, Log-Probit, Log-Logistic, Gamma)

101 Random Seed (Specifying 0 implies current clock time will be used)

2 1 0.1 Averaging Criterion ( 1 = BIC, 2 = AIC, 3=KIC, 4 = BICB, 5 = AICB, 6 = KICB ) , Risk Type ( 1 = Added Risk, 2 = Extra Risk),BMR

0.95 5000 0 Nominal Coverage Probability, Number of bootstrap resamples, Output Bootstrap resamples (0 = no 1 = yes)

2 Degree of multistage polynomial.

4 Number of data lines

0 50 022.5 50 345.0 50 1090.0 50 29

Data Specification: Dose, Number of experimental units, Number of observed responses

Page 54: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

54

Program output

• The output for the analysis is printed to the screen.

• This can be redirected to a text file using the “>” command. For Example:

“madr > output.txt”

“madr tio2.txt > anal1_output.txt”

Page 55: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

55

Model Weight -2log(L) AIC BIC--------------------------------------------------------------------------Multistage 0.100 50.61 56.61 62.35Logistic 0.228 50.96 54.96 58.79Probit 0.218 51.05 55.05 58.88Weibull 0.100 50.61 56.61 62.35Log-Probit 0.122 50.21 56.21 61.95Log-Logistic 0.121 50.23 56.23 61.97Gamma 0.111 50.40 56.40 62.13--------------------------------------------------------------------------

------------------------------------------'Average-Model' Benchmark Dose Estimate------------------------------------------Nominally Specified Confidence Limit:0.950Weighting Criterion: AICBMD Calculation: Added RiskBMR: 0.100000BMD: 0.894456594251BMDL(BCa):0.303525585681BMDL(Percentile):0.296586853918Acceleration: -0.034445Bootstrap Resamples: 5000Random Seed: 101------------------------------------------

Page 56: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

56

MODEL: Multistage, 2-degree polynomial:

-----------------------------------------

Parameters Estimate StdErr

------------------------------------------

GAMMA: 0.171088 0.079273

BETA(1): 0.000000 N/A

BETA(2): 0.141611 0.052429

Optimization Succeeded

------------------------------------------

MODEL: Logistic:

------------------------------------------

Parameters Estimate StdErr

------------------------------------------

ALPHA: -1.824682 0.540209

BETA: 1.003971 0.301075

Optimization Succeeded

------------------------------------------

MODEL: Probit:

------------------------------------------

Parameters Estimate StdErr

------------------------------------------

ALPHA: -1.083244 0.300300

BETA: 0.590790 0.160163

Optimization Succeeded

------------------------------------------

Page 57: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

57

Conclusions

• Simulation Results + MADr implies dichotomous based model averaging can be used reliably in ones own research.

• Though we have tested MADr, it is still a “use at your own risk program.”

Page 58: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

58

• As mentioned before model averaging is not a panacea.

• As such it does not: – Relieve scientists from using their expert

judgment. – Give automatic license to produce a low dose

extrapolations. – Remove the need for adequate individual model fit

diagnostics. – Remove all model uncertainty from the analysis.

Page 59: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

59

• It does: – Reframe the debate of model choice. – Produces relatively stable central estimates often

independent of a given model being included in the average.

Page 60: 1 Model Averaging: Beyond Model Uncertainty in Risk Analysis * Matthew W. Wheeler NIOSH MWheeler@cdc.gov *The findings and conclusions in this report are.

60

Selected ReferencesRaftery, A. E. (1995). Bayesian model selection in social research. Sociological

Methodology, 25, 111-163.

Hoeting, J.A., Madigan, D., Raftery, A.E., & Volinsky, C.T. (1999) Bayesian model averaging: a tutorial. Statistical Science, 14, 382-417.

Buckland, S.T., Burnham, K. P., & Augustin, N. H., (1997). Model Selection: An Integral Part of Inference. Biometrics, 53, 603-618.

Kang, S.H., Kodell, R.L., Chen, J.J. (2000) Incorporating Model Uncertainties along with Data Uncertainties in Microbial Risk Assessment. Regulatory Toxicology and Pharmacology, 32, 68-72.

Bailer, A.J., Noble R.B. and Wheeler, M. (2005) Model uncertainty and risk estimation for quantal responses. Risk Analysis, 25,291-299.

Wheeler, M. W., & Bailer, A.J., (In Press). Properties of model-averaged BMDLs: A study of model averaging in dichotomous risk estimation. Risk Analysis