Florida Public Hurricane Loss Model (FPHLM 3.0) · Florida Public Hurricane Loss Model (FPHLM 3.0)...

Florida Public Hurricane Loss Model (FPHLM 3.0)

STATISTICAL STANDRADS

B. M. Golam KibriaDepartment of Statistics

Florida International UniversityMiami, FL 33199

S-1 Modeled Results and Goodness-of-Fit

A. The use of historical data in developing the model shall be supported by rigorous methods published in currently accepted scientific literature.

The historical data for the period 1900-2007 was modeled using scientifically accepted methods that had been published.

B. Modeled and historical results shall reflect agreement using currently accepted scientific and statistical methods.

Modeled and historical results are in agreement as indicated by appropriate statistical and scientific tests. Some of these tests are discussed below.

S1: Modeled Results and Goodness of Fit (GOF)

Comparison of modeled vs historical occurrences

0.0000

0.10000.2000

0.30000.4000

0.50000.6000

0.7000

1 2 3 4 5 6 7

Events per year

Occ

urre

nce

rate

Modeled ProbabilityHistorical Probability

Comparison of modeled vs historical occurrences

H0: Historical and modeled data follow the same distribution

Ha: They are from different distributions.

Chi-square test statistic

Chi square goodness of fit, p-value =0.82 (DF=2)

Since p-value is 0.82, we can not reject the null hypothesis at 5% level of significance.

Comparison between observed and modeled Holland B

0.5 1 1.5 2 0123456789

10111213141516

Distribution of the B parameter

ObservedModel Scaled

B param eter

Occ

urre

nce

Model Holland B by Normal Distribution

Chi square goodness of fit Test: p-value = 0.57 (DF=8)

Kolmogorov-Smirnov goodness of fit Test: p-value= 0.85

Comparison between observed and modeled R Max

Gamma vs Observed

0

5

10

15

20

25

5 10 15 20 25 30 35 40 45 50 55

ObservedGamma

Model Rmax by Gamma Distribution

Maximum likelihood estimators:

53547.5ˆ =k 67749.4ˆ =θ

Chi square goodness of fit test: p-value= 0.26 (DF= 5)

KS-

goodness of fit test: p-value = 0.80

Assessment of Uncertainty in Loss Costs

Coefficient of variation provides a relative measure of

data dispersion compared to the mean.

2.5 3.0 3.5 4.0 4.5 5.0 5.5CV

0

5

10

15

20

25

Histogram of CVs

S-2 Sensitivity Analysis for Model Output

The modeler shall have assessed the sensitivity of temporal and spatial outputs with respect to the simultaneous variation of input variables using currently accepted scientific and statistical methods and have taken appropriate action.

We have assessed the sensitivity of temporal and spatial outputs with respect to the simultaneous variation of input variables using currently accepted scientific and statistical methods and submitted to the commission under 2007 standard.

S-2 Sensitivity Analysis (SA) for Model Output

Standardized Regression Coefficients:

A linear regression model that regress maximum sustained surface wind speed (MSSWS) on Rmax, VT, CP and Holland B (HB) could be fit for 966 grid points for each time point and for each hurricane category. The model is defined as

From a fitted model, we can to compare the regression coefficients in terms of the magnitude of their effects.

The magnitude of βs can not be compared directly due to their different unit of measurements.

Standardized regression coefficients (SRC) may be used to compare the importance of different regressors.

0 1 2 3Y= + Rmax + VT+ CP+ 4 Holland B+eβ β β β β

Standardized Regression Coefficients vs. Time at Grid Coordinates (30,0) for Cat 1

Grid 30,0

-1-0.8-0.6-0.4-0.2

00.20.40.60.8

1

0 1 2 3 4 5 6 7 8 9 10 11 12

Time (hr)

Stan

dard

ized

Reg

ress

ion

Coe

ffici

ent HollandB

RmaxVTCP

Standardized Regression Coefficients vs. Time at Grid Coordinates (30,0) for Cat 5

Grid 30,0

-1-0.8-0.6-0.4-0.2

00.20.40.60.8

11.2

0 1 2 3 4 5 6 7 8 9 10 11 12

Time (hr)

Stan

dard

ized

Reg

ress

ion

Coe

ffici

ent HollandB

RmaxVTCP

S-3 Uncertainty Analysis for Model Output

The modeler shall have performed an uncertainty analysis on the temporal and spatial outputs of the model using currently accepted scientific and statistical methods and have taken appropriate action. The analysis shall identify and quantify the extent that input variables impact the uncertainty in model output as the input variables are simultaneously varied.

We have performed an uncertainty analysis on the temporal and spatial outputs of the model using currently accepted scientific and statistical methods and submitted to the commission in 2007 submission.

S-3 Uncertainty Analysis (UA) for Model Output

Expected Percentage Reduction (EPR):

The goal of uncertainty analysis is to quantify the contributions of the input parameters to the uncertainty in maximum sustained surface wind speed. The simple model is

The variance of Y can be expressed as a conditional variance onX’s. Here X’s are Rmax, CP, VT and HB etc.

The expected percentage reduction in V(Y) is expressed as

1 2 3 4Y=X +X +X +X

Var(E(Y/X)EPR in Var(Y)= 100Var(Y)

×

Summary for UA

From the following figures, we notice that the major contribution to the uncertainty in the model is R-max followed by VT, Holland B and CP. At hour 0, Rmax produces the most uncertainty and at hour 12 VT contributed the highest uncertainty in the model. It is also noted that at hour 2 there is no uncertainty among the four parameters except VT.uncertainty of MSSWS depends on the time, grid points and the category of hurricanes.

Expected Percentage Reductions in the Var(MSSWS) for a Cat 1 Hurricane versus Time at Coordinate (30,0)

Grid 30,0

0

25

50

75

100

125

0 1 2 3 4 5 6 7 8 9 10 11 12

Time (hr)

Expe

cted

Per

cent

age

Red

uctio

n HollandBRmaxVTCP

Expected Percentage Reductions in the Var(MSSWS) for a Cat 5 Hurricane versus Time at Coordinate (30,0)

Grid 30,0

0

25

50

75

100

125

0 1 2 3 4 5 6 7 8 9 10 11 12Time (hr)

Expe

cted

Per

cent

age

Red

uctio

n

HollandBRmaxVTCP

S-4 County Level Aggregation

At the county level of aggregation, the contribution to the error in loss cost estimates attributable to the sampling process shall be negligible

The error in the county level loss costs induced by the sampling process can be quantified by computing standard errors for the county level loss costs. These loss costs have been computed for all counties in the state of Florida using 53,500 years of simulation. The results indicate that the standard errors are less than 2.5% of the average loss cost estimates for all counties

S4: County Level Aggregation

Number of Simulation Years

For each county Y, define average loss cost by and standarddeviation by sy. The standard error of estimate (SE( )) will be 2.5% of the estimated mean, if the number of simulation years for county Y is:

Based on the initial 10,700 simulation runs, the highest minimumacceptable number is 35,080 for Lafayette County.

We use 53,500 years (500x107 years) of simulation for our final results. For the 53,500 simulation year run we find the standarderrors to be less than 2.5% of the loss cost for each county

2

0.025YsN

x⎛ ⎞= ⎜ ⎟×⎝ ⎠

YX

YX

Scatter plot between number of years vs percentage of the loss cost estimate for Lafayette County

Scatter plot between number of simulation years vs percentage of the loss cost estimate for Lafayette

County

0

2

4

6

8

10

12

14

0 10000 20000 30000 40000 50000 60000

Number of Years

Perc

enta

ge E

rror

Percent

Scatter plot between number of years vs percentage of the loss cost estimate for Nassau County

Scatter plot between number of simulation years vs percentage of the loss cost estimate for Nassau

County

02468

1012141618

0 10000 20000 30000 40000 50000 60000

Number oif Years

Per

cent

age

Erro

r

Percent

S-5 Replication of Known Hurricane Losses

The model shall estimate incurred losses in an unbiased manner on a sufficient body of past hurricane events from more than one company, including the most current data available to the modeler. This Standard applies separately to personal residential and, to the extent data are available, to mobile homes. Personal residential experience may be used to replicate structure-only and contents-only losses. The replications shall be produced on an objective body of loss data by county or an appropriate level of geographic detail.

The following Table compares the modeled and actual total losses by storm and company for residential coverage. The following Figure, correlation and statistical results indicated a reasonable agreement between the observed and modeled losses

S5: Replication of Known Hurricane losses

In February 2008 submission (2.7), the validation set consisted of 25 events (company/hurricane).

We processed additional company data since the first pro-team visit.

In May 16 revised submission (v3.0) , the validation set consisted of 31 events.

We continue to process additional company data. In the response to the deficiency letter that we submitted on May 29, 2008, eight (8) new events are added.

Since the February submission we have processed and added 16 newevents and consolidated 4 events into two

We checked the validation of some prior company/hurricane portfolios and made some corrections to thesis portfolios

We now have a total of 39 data points

Total Actual Losses and Total Modeled losses

Name Event Total Actual Loss Total Modeled Loss

A Charley 110471361 135347764

A Frances 20201407 78044136

B Andrew 2984373067 2558464266

B Charley 1037108745 735800992

B Charley_Mob 23395988 25990713

B Frances 614006549 428145647

B Frances_Mob 18467176 8466678

B Erin 50519119 60458572

C Charley 63889029 32714722

C Frances 122776727 88435747

D Charley 274702333 239567424

D Frances 224656954 142289724

E Charley 62086256 53273107

E Frances 43799401 18568918

F Charley 111013524 269149904

F Frances 94272660 380701388

G Charley 952353 900161

G Frances 10007410 4176704

Scatter plot between Total Actual Losses vs. Total Modeled losses

Actual Losses

Mod

el L

osse

s

0 5*10^8 10^9 1.5*10^9 2*10^9 2.5*10^9 3*10^9

05*

10^8

10^9

1.5*

10^9

25*

10^9

Scatter plot between Total Actual Losses and Modeled Losses

Paired t Test

The paired t-test t = 0.9882, df = 38, p-value = 0.3293

We fail to reject the null hypothesis (the difference in paired mean values equals zero), and conclude that there is insufficient evidence to suggest a difference between actual and modeled losses.

Proportion of points above and below the straight line

About 51% of the actual losses are more than the corresponding model losses and 49% of the model losses are more than the corresponding actual losses.

Concordance correlation coefficient (CCC)

The CCC is defined as

CCC evaluates the degree (measuring agreement to the identity line) to which the pairs fall on the line (straight line)Where is a Bias correction factor (measure of accuracy) that measures how far the best-fit line deviates from the line (or how closely the fitted line agrees with the identity line).ρ evaluates how closely observations fall on the fitted line

*c bCρ ρ=

bC

Concordance correlation and Pearson correlation coefficients

Actual Loss

Mod

el L

oss

0 5*10^8 10^9 1.5*10^9 2*10^9 2.5*10^9 3*10^9

05*

10^8

10^9

1.5*

10^9

2.5*

10^9

Concordance correlation and Pearson correlation coefficients

*c bCρ ρ=

Concordance correlation coefficient (CCC)

Sample concordance correlation coefficient= 0.973 Sample correlation coefficient (precision)= 0.988Bias correction factor (accuracy)=0.985

Conclusion: Very good agreement between actual and model losses.

S-6 Comparison of Projected Hurricane Loss Costs

The difference, due to uncertainty, between historical and modeled annual average statewide loss costs shall be reasonable, given the body of data, by established statistical expectations and norms.

The difference, due to uncertainty, between historical and modeled annual average statewide loss costs is reasonable as shown in the following description.

Standard S6: Comparison of projected hurricane loss costs

(Standard S-4) The number of simulation years is 53500. The standard errors are within less than 2.5% of the means for all counties.

(From S4): The 95% CI on the difference between the mean of the historical and modeled losses contains 0 indicating that the modeled losses do not differ significantly from historical losses.

Form S-1: Probability of Florida Landfalling Hurricanes per Year

Number Historical ModeledOf Hurricanes Probability Probability

Per Year(Hurdat June

2006)0 0.5794 0.60261 0.2710 0.26682 0.1215 0.09393 0.028 0.02834 0 0.00835 0 0.00016 0 07 0 0

8 or more 0 0

Form S2: An Example of a Probable Maximum Loss Based on a Limited Hypothetical Data (Part A)

Return Time (Years)

Probability of Exceedance Estimated Loss

53500 0.00% $103,728,414

10000 0.01% $70,655,088

5000 0.02% $68,977,344

2000 0.05% $58,506,876

1000 0.10% $53,610,128

500 0.20% $47,154,509

250 0.40% $41,384,473

100 1.00% $33,235,114

50 2.00% $26,297,815

20 5.00% $16,867,502

10 10.00% $9,482,954

5 20.00% $3,245,185

Form S2, Part B

Mean $2,772,311

Median $0

Standard Deviation $6,709,689

Interquartile Range $1,690,706

Sample size53500 years of

Simulated storms

Form S-3: Five Validation and Comparisons

Comparison #1: Hurricane Charley and Company A by Coverage

Company Actual Modeled Difference

CoverageLoss/Expos

ure

Loss/Expos

ure

Building 0.01788 0.02297 -0.00508

Contents 0.00349 0.00310 0.00039

Appurtenant

s 0.02764 0.01191 0.01573

ALE 0.00227 0.00214 0.00013

Total 0.01298 0.01590 -0.00292

Comparison #2: Different Companies by Different Hurricanes


Company EventLoss/Expos

ure Loss/Exposure

A Charley 0.01298 0.01590 -0.00292

B Andrew 0.10426 0.08938 0.01488

C Frances 0.01769 0.01275 0.00495

H Jeanne 0.01916 0.02093 -0.00177

M Jeanne 0.00916 0.01082 -0.00167

Comparison #3: Company B by Hurricane Andrew, Charley, Frances


Company EventLoss/

Exposure Loss/Exposure

B Andrew 0.10426 0.08938 0.01488

B Charley 0.01573 0.01116 0.00457

B Frances 0.01288 0.00898 0.00390

Comparison #4: Construction Type for Hurricane Charley


ConstructionLoss/Expo

sure

Loss/Exposure

Frame 0.01610 0.01678 -0.00068

Masonry 0.01445 0.01214 0.00231

Manufactured 0.04466 0.04962 -0.00495

Comparison #5: County-wide for Company A and Hurricane Charley


County Loss/Exposure Loss/Exposure

Lake 0.00042 0.00029 0.00013

Osceola 0.01359 0.01428 -0.00069

Sarasota 0.00030 0.00083 -0.00053

Collier 0.00073 0.00024 0.00049

St. Johns 0.00013 0.00033 -0.00020

Form S-4: Average Annual Zero Deductible Statewide Loss Costs –Historical versus Modeled

Part A (2002 FHCF Exposure Data)

The 95% CI for (μ1- μ2) is [-1.24 and 0.91].

Testing of equality two means using parametric test t = -0.3043, df = 53605, p-value = 0.7609

Equality of CDFs using nonparametric test KS = 0.0629, p-value = 0.7887

Conclusion: Both parametric and non-parametric methods give high p-values. The average historical and modeled losses are not statistically different.

Part B (2007 FHCF Exposure Data) (Continue Form S4)

The 95% CI for (μ1- μ2) is [-2.18 and 1.53].

Testing of equality two means using parametric test t = -0.3472, df = 53605, p-value = 0.7284

Equality of CDFs using nonparametric test KS = 0.0636, p-value = 0.7766

Conclusion: Both parametric and non-parametric cases, we have high p-values. The average historical and modeled losses are not statistically different.

Florida Public Hurricane Loss Model (FPHLM 3.0) · Florida Public Hurricane Loss Model (FPHLM 3.0)...

Documents

Transcript of Florida Public Hurricane Loss Model (FPHLM 3.0) · Florida Public Hurricane Loss Model (FPHLM 3.0)...