Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory...

23
Statistical Modeling of Loss Distributions Using actuar Vincent Goulet Probability Laws Grouped Data Minimum Distance Estimation Censored Data Statistical Modeling of Loss Distributions Using actuar Vincent Goulet École d’actuariat, Université Laval Québec, Canada

Transcript of Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory...

Page 1: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Statistical Modeling of LossDistributions Using actuar

Vincent Goulet

École d’actuariat, Université LavalQuébec, Canada

Page 2: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

actuar

Provides additional Actuarial Sciencefunctionality to RCurrent version covers

Loss distribution modelingRisk theory (including ruin theory)Simulation of compound hierarchical modelsCredibility theory

Page 3: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Summary

1 Probability Laws

2 Grouped Data

3 Minimum Distance Estimation

4 Censored Data

Page 4: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Summary

1 Probability Laws

2 Grouped Data

3 Minimum Distance Estimation

4 Censored Data

Page 5: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

At a Glance

Support for 18 probability laws not in base R

Mostly positive, heavy tail distributions

New utility functions in addition to dfoo, pfoo,qfoo, rfoo

Page 6: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Supported Distributions

Transformed Beta Family9 special cases (including Burr and Pareto)

Transformed Gamma Family5 special cases (including inverse distributions)

Loggamma

Single parameter Pareto

Generalized Beta

Phase-type distributions

Page 7: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

New Utility Functions

mfoo to compute theoretical raw moments

mk = E[Xk]

levfoo to compute theoretical limited moments

E[(X∧ )k] = E[min(X, )k]

mgffoo to compute the moment generatingfunction

MX(t) = E[etX]

when it exists

Also support for: beta, exponential, chi-square,gamma, lognormal, normal (no lev), uniform,Weibull, inverse Gaussian

Page 8: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

New Utility Functions

mfoo to compute theoretical raw moments

mk = E[Xk]

levfoo to compute theoretical limited moments

E[(X∧ )k] = E[min(X, )k]

mgffoo to compute the moment generatingfunction

MX(t) = E[etX]

when it exists

Also support for: beta, exponential, chi-square,gamma, lognormal, normal (no lev), uniform,Weibull, inverse Gaussian

Page 9: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Summary

1 Probability Laws

2 Grouped Data

3 Minimum Distance Estimation

4 Censored Data

Page 10: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Definition and Rationale

Data presented in an interval-frequency manner:

Group Line 1 Line 2

(0,25] 30 26(25,50] 31 33(50,100] 57 31

Need for a “standard” storage method

Useful for minimum distance estimation

Page 11: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Creation and Manipulation of Objects

> x <- grouped.data(Group = c(0, 25,+ 50, 100), Line.1 = c(30, 31, 57),+ Line.2 = c(26, 33, 31))> x

Group Line.1 Line.21 (0, 25] 30 262 (25, 50] 31 333 (50, 100] 57 31

Page 12: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Calculation of Empirical Moments

> mean(x)

Line.1 Line.249.25847 43.19444

> emm(x, 2)

Line.1 Line.23253.884 2604.167

> E <- elev(x[, -3])> E(c(25, 50))

[1] 21.82203 37.18220

Page 13: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Plot of the Histogram and Ogive

> hist(x[, -3])

Histogram of x[, −3]

x[, −3]

Den

sity

0 20 40 60 80 100

0.00

00.

004

0.00

8

> plot(ogive(x[, -3]))

0 20 40 60 80 100

0.0

0.2

0.4

0.6

0.8

1.0

ogive(x[, −3])

xF

(x)

Page 14: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Summary

1 Probability Laws

2 Grouped Data

3 Minimum Distance Estimation

4 Censored Data

Page 15: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

mde() Supports Three Distance Measures

1 Cramér-von Mises

d(θ) =n∑

j=1

j[F(j;θ)− Fn(j;θ)]2

2 Modified chi-square

d(θ) =r∑

j=1

j[n(F(cj;θ)− F(cj−1;θ))− nj]2,

3 Layer average severity

d(θ) =r∑

j=1

j[LAS(cj−1, cj;θ)− ˜LASn(cj−1, cj;θ)]2,

where LAS(, y) = E[min(X, y)]− E[min(X, )]

Page 16: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

mde() Supports Three Distance Measures

1 Cramér-von Mises

d(θ) =n∑

j=1

j[F(j;θ)− Fn(j;θ)]2

2 Modified chi-square

d(θ) =r∑

j=1

j[n(F(cj;θ)− F(cj−1;θ))− nj]2,

3 Layer average severity

d(θ) =r∑

j=1

j[LAS(cj−1, cj;θ)− ˜LASn(cj−1, cj;θ)]2,

where LAS(, y) = E[min(X, y)]− E[min(X, )]

Page 17: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

mde() Supports Three Distance Measures

1 Cramér-von Mises

d(θ) =n∑

j=1

j[F(j;θ)− Fn(j;θ)]2

2 Modified chi-square

d(θ) =r∑

j=1

j[n(F(cj;θ)− F(cj−1;θ))− nj]2,

3 Layer average severity

d(θ) =r∑

j=1

j[LAS(cj−1, cj;θ)− ˜LASn(cj−1, cj;θ)]2,

where LAS(, y) = E[min(X, y)]− E[min(X, )]

Page 18: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Summary

1 Probability Laws

2 Grouped Data

3 Minimum Distance Estimation

4 Censored Data

Page 19: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Context

Common in statistical and actuarial applicationsto work with censored data

Actuarial terminology:

left censoring ⇔ (ordinary) deductibleright censoring ⇔ policy limit

0 5 10 15

0.00

0.04

0.08

0.12

Left Censoring

0 5 10 15

0.00

0.02

0.04

0.06

0.08

0.10

0.12

Right Censoring

Page 20: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

A Different Approach

Package survival has extensive support forcensored distributions

Our approach is different

coverage() returns pdf or cdf of censoreddistribution (with many options)

function can be used in fitting as usual(fitdistr(), mde(), ...)

Page 21: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Example With Left and Right Censoring

> f <- coverage(pdf = dgamma, cdf = pgamma,+ deductible = 1, limit = 10)

> fitdistr(y, f, start = list(shape = 2,+ rate = 0.5))

shape rate4.5822202 0.8634705

(0.7672822) (0.1518537)

Page 22: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

Example With Left and Right Censoring

> f <- coverage(pdf = dgamma, cdf = pgamma,+ deductible = 1, limit = 10)

> fitdistr(y, f, start = list(shape = 2,+ rate = 0.5))

shape rate4.5822202 0.8634705

(0.7672822) (0.1518537)

Page 23: Statistical Modeling of Loss Distributions Using actuar · Loss distribution modeling Risk theory (including ruin theory) Simulation of compound hierarchical models Credibility theory.

StatisticalModeling of

LossDistributions

Usingactuar

VincentGoulet

ProbabilityLaws

GroupedData

MinimumDistanceEstimation

CensoredData

More Information

Project’s web site

http://www.actuar-project.org

Package vignettes

actuar Introduction to actuarcoverage Complete formulas used by

coveragecredibility Risk theory featureslossdist Loss distributions modeling

featuresrisk Risk theory features

Demo files