Validation of Predictive Models: Acceptable Prediction Zone Method

Thomas P. Oscar, Ph.D.USDA, Agricultural Research ServiceMicrobial Food Safety Research UnitUniversity of Maryland Eastern Shore

Princess Anne, MD

Background Information

Terminology

• Performance evaluation

– Process of comparing observed and predicted

values.

• Validation

– A potential outcome of performance evaluation.

– Requires establishment of criteria.

Criteria

• Test Data– Interpolation

– Extrapolation

• Performance– Bias

– Accuracy

– Systematic Bias

Secondary Models

Predictive Modeling

PrimaryModel

Observed No Predicted No

Observed Predicted

Observed max Predicted max

Observed Nmax Predicted Nmax

PredictedN(t)

ObservedN(t)

TertiaryModel

PredictedN(t)

Stage 3

Stage 2

Stage 1

Performance Evaluation

Goodness-of-fitPrimary/Secondary Models

VerificationTertiary Models

InterpolationAll Models

ExtrapolationAll Models

Test Data CriteriaInterpolation

• Independent data.

• Within the response surface.

– Uniform coverage.

• Collected with same methods.Incomplete and biased evaluation

Model data (10 to 40C) versus

Test data (25 to 40C)

Test Data CriteriaExtrapolation

• Independent data.

• Outside the response surface.

– Only one variable differs.

• Collected with same methods.Confounded comparison

Strain A in broth versus

Strain B in food

Acceptable Prediction Zone MethodDescription

Relative Error (RE)

RE for = (predicted - observed)/predicted

RE for N(t), No, max and Nmax = (observed - predicted)/predicted

RE < 0 are “fail-safe”

RE > 0 are “fail-dangerous”

"Acceptable"

"Overly Fail-safe"

"Overly Fail-dangerous"

4 5 6 7 8 9 10 11-1.2

Predicted N(t) (log CFU/g)

Performance Factor %RE = REIN/RETOTAL

Performance Criteria

• Acceptable Predictions

-0.30 < RE < 0.15 for max

-0.60 < RE < 0.30 for

-0.80 < RE < 0.40 for N(t), No, Nmax

• Acceptable Performance

%RE => 70

Acceptable Prediction Zone MethodDemonstration

Model Development Design

• Salmonella Typhimurium

– No = 4.8 log CFU/g

• Sterile cooked chicken

– 10, 12, 14, 16, 20, 24, 28, 32,

36, 38, 40C

• Viable counts

– BHI agar

– 12 per growth curve

Performance Evaluation DesignSecondary Models (Interpolation)

– 11, 13, 15, 18, 22, 26, 30, 34,

37, 39C

• Viable counts

– BHI agar

Primary ModelLogistic with Delay

N = No if t

N = Nmax/(1+[(Nmax/No)-1]exp[-max (t-)]) if t >

0 10 20 30 404

Dependent (goodness-of-fit)

Time (h)

Primary Model PerformanceGoodness-of-fit

4 5 6 7 8 9 10 11-1.2

1.6%RE = 93.8

Secondary Model for No

No = mean No

5 10 15 20 25 30 35 40 454

Independent (interpolation)Dependent (goodness-of-fit)

Temperature (C)

No Model Performance

Type of EvaluationDependent (goodness-of-fit)Independent (interpolation)

%RE100100

4.70 4.75 4.80 4.85 4.90-1.0-0.8-0.6-0.4-0.2-0.00.20.40.60.81.0

Predicted No (log CFU/g)

Secondary Model for Hyperbola with Shape Factor

= [41.47/(T - 7.325)]1.44

5 10 15 20 25 30 35 40 451

Temperature (C)

Model Performance

0 10 20 30 40 50 60-1.0-0.8-0.6-0.4-0.2-0.00.20.40.60.81.0

%RE100100

Predicted (h)

Secondary Model for max

Modified Square Root

max = 0.01885 if T

max = 0.01885 + [0.004325(T – 11.43)]1.306 if T > 11.43

5 10 15 20 25 30 35 40 450.0

0.5Dependent (goodness-of-fit)Independent (interpolation)

Temperature (C)

max Model Performance

%RE100100

0.0 0.1 0.2 0.3 0.4-1.0-0.8-0.6-0.4-0.2-0.00.20.40.60.81.0

Predicted max (h-1)

Secondary Model for Nmax

Asymptote Model

Nmax = exp(2.348[((T – 9.64)(T – 40.74))/((T – 9.606)(T – 40.76))])

5 10 15 20 25 30 35 40 455

Temperature (C)

Nmax Model Performance

8 9 10 11-1.0-0.8-0.6-0.4-0.2-0.00.20.40.60.81.0 %RE

100100

Predicted Nmax (log CFU/g)

Secondary Models

Predictive Modeling

PrimaryModel

Observed No Predicted No

Observed Predicted

Observed max Predicted max

Observed Nmax Predicted Nmax

PredictedN(t)

ObservedN(t)

TertiaryModel

PredictedN(t)

Tertiary Model PerformanceVerification

4 5 6 7 8 9 10 11-1.2

%RE = 90.7

Comparison of Models

Model REIN REOUT RETOTAL

Primary 121 8 129

Tertiary 117 12 129

Total 238 20 258

Fisher’s exact test; P = 0.48, not significant.

Performance Evaluation DesignTertiary Model (Interpolation)

– 11, 13, 15, 18, 22, 26, 30, 34, 37,

• Viable counts

– BHI agar

Tertiary Model Performance Interpolation

0 5 10 15 20 254

Time (h)

Tertiary Model Performance Interpolation

4 5 6 7 8 9 10 11-1.0-0.8-0.6-0.4-0.2-0.00.20.40.60.81.0

%RE = 97.5

Should the validated tertiary model be used to predict chicken safety?

• Evaluation for extrapolation to:

– other initial densities (No)

– other strains

– other chicken products

Performance Evaluation DesignTertiary Model (Extrapolation)

– 10, 12, 14, 16, 20, 24, 28, 32,

36, 40C

• Viable counts

– BHI agar

Tertiary Model Extrapolation to low No

0 10 20 30 400123456789

Time (h)

4 5 6 7 8 9 10 11-10123456789

10 24 RE > 10

Predicted N (log CFU/g)

Tertiary Model PerformanceExtrapolation to low No

%RE = 2.5

Conclusions

• Criteria are important for evaluating performance of models.

• Consensus on validation would improve the quality and use of predictive models in the food industry.

Validation of Predictive Models: Acceptable Prediction Zone Method

Documents

Transcript of Validation of Predictive Models: Acceptable Prediction Zone Method

Predictive models in diabetes · PREDICTIVE MODELS IN DIABETES EARLY PREDICTION AND DETECTING OF TYPE 2 DIABETES AND RELATED COMPLICATIONS BY SIMON LEBECH CICHOSZ ... ADVANCE Action

Pathway-Structured Predictive Model for Cancer Survival … · 2016-12-29 · | INVESTIGATION Pathway-Structured Predictive Model for Cancer Survival Prediction: A Two-Stage Approach

Genomic risk prediction of coronary artery disease in ... · CAD risk factors, such as lipids, blood pressure, and smoking, become predictive in middle life, their predictive ability

SPATIAL PREDICTION OF AIR TEMPERATURE IN EAST CENTRAL ...€¦ · time-consuming work. Therefore, prediction of climate variables at unmeasured sites at an acceptable accuracy is

Data Mining and Predictive Analytics - Assignment 1 Image … · 2018. 8. 29. · Data Mining and Predictive Analytics - Assignment 1 Image Popularity Prediction on Social Networks

Deep Predictive Coding Networks for Video Prediction and ...€¦ · a predictive neural network (“PredNet”) architecture that is inspired by the concept of “predictive coding”

Experiences from running internal prediction challenges ... · Experiences from running internal prediction challenges within a pharmaceutical company BBS Seminar : Predictive modelling,

Predictive Analytics for Business and Marketing © 2007 Prediction Impact, Inc. All rights reserved Hosted by Prediction Impact, Inc. in association with.

A Machine Learning Perspective on Predictive Coding with ... · PDF fileA Machine Learning Perspective on Predictive Coding with PAQ8 and New Applications by Byron Knoll ... PPM Prediction

Aircraft Trajectory Prediction Made Easy with Predictive ...hjs/pubs/sigkdd16-header.pdf · Aircraft Trajectory Prediction Made Easy with Predictive Analytics Samet Ayhan Hanan Samet

Protein structure Predictive methods. Topics Covered Secondary structure prediction methods 3D fold prediction –Ab initio protein structure prediction.

Análisis competencia Larga · PDF fileOM501/2 Euro 5 OM470 Euro 5 Euro 5 Euro 6 . Scania Active Prediction vs. Predictive Powershift Control •Predictive Powershift Control – disponible

Predictive Analytics - CATCH Intelligence · Predictive Analytics Be among the most well-informed leaders in your industry - proactively move into the future with the power of prediction!

Predictive decision support. - Microsoft · Acumen Hypotension Prediction Index software offers the only predictive monitoring parameter for hypotension that is available in Europe.

Predictive validity of 4 risk assessment scales for ...liu.diva-portal.org/smash/get/diva2:695507/FULLTEXT01.pdf · prediction of pressure ulcer development in a hospital ... Predictive

Predictive Model of Energy Consumption in Beer Production · Predictive Model of Energy Consumption in Beer ... prediction model of energy consumption in beer production ... the intervention

Predictive Model for Type 1 Diabetes. - Automatic … · Predictive Model for Type 1 Diabetes. A Case Study ... models were used for simulation and prediction with 10, 30, ... diabetes

Human Body Shape Prediction and Analysis Using Predictive …people.scs.carleton.ca/~c_shu/pdf/body_predict_PCT_3... · 2012-05-03 · Human Body Shape Prediction and Analysis Using

Predictive State Recurrent Neural Networks · 2018-02-13 · We present a new model, Predictive State Recurrent Neural Networks (PSRNNs), for ﬁltering and prediction in dynamical

What can we Learn from Predictive Modeling? · placed on evaluating the predictive performance of the model. We propose thinking about the role of prediction in theory-building as