Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of...

31
Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton, Chester [email protected] Paul Northrop University College London [email protected] Environmental Extremes Royal Statistical Society April 2011

Transcript of Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of...

Page 1: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Threshold estimation in marginal modelling ofspatially-dependent non-stationary extremes

Philip JonathanShell Technology Centre Thornton, Chester

[email protected]

Paul NorthropUniversity College London

[email protected]

Environmental ExtremesRoyal Statistical Society

April 2011

Page 2: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Outline

• Motivation and application.

• Threshold modelling using quantile regression.

• Implications of QR threshold for PP model parameterisation.

• Adjusting for spatial dependence.

• Results for application.

• Initial theoretical & simulation studies.

• Conclusions.

Page 3: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Motivation: Rational design of marine structures

• Covariate effects:• Location, direction, season, ...• Multiple covariates in practice.

• Cluster dependence:• e.g. storms independent, observed (many times) at many

locations.• e.g. dependent occurrences in time.

• Scale effects:• Modelling H2

S gives different estimates cf. modelling HS .

• Threshold estimation; parameter estimation.

• Measurement issues:• Field measurement uncertainty greatest for extreme values.• Hindcast data are simulations based on pragmatic physics,

calibrated to historical observation.

Page 4: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Motivation: Rational design of marine structures

• Multivariate extremes:• Waves, winds, currents, ...• Componentwise maxima ⇔ max-stability ⇔ regular variation:

• Assumes all components extreme.• ⇒ Perfect independence or asymptotic dependence only.

• Extremal dependence:• Assumes regular variation of joint survivor function.• ⇒ Asymptotic dependence, asymptotic independence (with

+ve, -ve association).

• Conditional extremes:• Assumes, given one variable being extreme, convergence of

distribution of remaining variables.• Allows some variables not to be extreme.

• Inference:• ... a huge gap in the theory and practice of multivariate

extremes ... (Beirlant et al. 2004)

Aim: Useful models with rigourous assessment of modelperformance, especially in extreme quantiles.

Page 5: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Motivation: Good threshold estimation critical

• Considerable empirical evidence from applications thatcareful estimation of threshold including covariate effectsimportant for satisfactory modelling.

• Often reasonable to assume some (or all) extreme valueparameters are independent of (some or all) covariatesfollowing good thresholding, greatly simplifying model form.

• Quantile thresholds as functions of covariate(s) produce nearconstant rates of threshold exceedence (appealing fromdesign perspective).

Page 6: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Application: Marginal estimation of extreme HSPS

• Data from hindcast of Y storm peak significant wave height(in metres) in the Gulf of Mexico.• Wave height, h: trough to the crest of the wave.• Significant wave height, HS : the average of the largest 1/3

wave heights h in given period (usually 3 hours).• Storm peak HSP

S : largest value of HS from a storm (cf.declustering).

• 6 × 12 grid of 72 sites (≈ 14 km apart).

• Sep 1900 to Sep 2005 : 315 storms in total.

• Average of 3 observations (storms) per year, at each site.

Aim: Quantify the extremal behaviour of Y at each site, makingappropriate adjustment for spatial dependence.

Page 7: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Typical hurricane event in Gulf of Mexico

Page 8: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Spatial dependence

●● ●

●●

●●

●●

●●

● ●

●●

●●

● ●

●●

●●

●●

●●

●●

●●

● ●

●●

●●

●●

● ●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

● ●

●●

0 2 4 6 8 10 12

0

2

4

6

8

10

12

14

2 distant sites

Hssp / m, at site 1

Hssp

/ m

, at s

ite 7

2

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●

●●

●●

●●●

●●●

●●

●●

●●

●●

●●●

●●

●●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

0 5 10 15

0

5

10

15

2 nearby sites

Hssp / m, at site 30

Hssp

/ m

, at s

ite 3

1

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

Page 9: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Spatial non-stationarity

latitude

12

34

5

6

longitude

2

4

6

810

12

storm peak significant w

ave height / m

13

14

15

16

• From single event ?

Page 10: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Modelling approach

• Spatial non-stationarity:• Model threshold as Legendre polynomial in longitude and

latitude using quantile regression.• Model spatial variation of PP parameters as Legendre

polynomials in longitude and latitude.• Lots of other suitable bases: splines, random fields ...

• Spatial dependence:• Estimate parameters assuming conditional independence of

responses given covariate values.• Adjust standard errors etc. for spatial dependence.

• Estimate extreme quantiles.

Page 11: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Extreme value regression model

Conditional on covariates xij exceedances over a high thresholdu(xij) follow a 2-dimensional non-homogeneous Poisson process.

If responses Yij , i = 1, . . . , 72 (space), j = 1, . . . , 315 (storms) areconditionally independent:

L(θ) =315∏j=1

72∏i=1

exp

{− 1

λ

[1 + ξ(xij)

(u(xij)− µ(xij)

σ(xij)

)]−1/ξ(xij )+

}

×315∏j=1

∏i :yij>u(xij )

1

σ(xij)

[1 + ξ(xij)

(yij − µ(xij)

σ(xij)

)]−1/ξ(xij )−1+

.

λ : mean number of observations per year.µ(xij), σ(xij), ξ(xij) : PP parameters at xij .θ : vector of all model parameters.

Page 12: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Covariate-dependent thresholds

Arguments for:

• Asymptotic justification for EV regression model : thethreshold u(xij) needs to be high for each xij .

• Design : spread exceedances across a wide range of covariatevalues.

Set u(xij) so that P(Y > u(xij)), is approx. constant for all xij .

• Set u(xij) by trial-and-error or by discretising xij , e.g. differentthreshold for different locations, months etc.

• Quantile regression (QR) : model quantiles of a response Yas a function of covariates.

Page 13: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Constant threshold

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●●●

●●

●●

●●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●●●●

●●

●●

●●●

●●

●●●

●●●●●

●●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

estimate of 90% quantile

x

Y

Page 14: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Quantile regression

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●●●

●●

●●

●●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●●●●

●●

●●

●●●

●●

●●●

●●●●●

●●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

estimate of 90% quantile

x

Y

Page 15: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Simple quantile regression in outline

• Data {xi , yi}ni=1

• τ th conditional quantile function Qy (τ |x) = xφ(τ) estimatedby solving:

minφ

n∑i=1

ρτ (yi − xiφ)

where ρτ (r) = τ r − r I (r < 0), or (with ri = ri (φ) = yi − xiφ):

minφ{τ

n∑ri≥0|ri |+ (1− τ)

n∑ri<0

|ri |}

• As a linear program:

minφ,u,v{τ1Tn u + (1− τ)1Tn v | xφ+ u − v = y}

where {ui} and {vi} are slack variables corresponding to(absolute values of) positive and negative residuals.

Page 16: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Model parameterisation

Let p(xij) = P(Yij > u(xij)). Then, if ξ(xij) = ξ is constant,

p(xij) ≈1

λ

[1 + ξ

(u(xij)− µ(xij)

σ(xij)

)]−1/ξ.

If p(xij) = p is constant then:

u(xij) = µ(xij) + c σ(xij), for some constant c.

The form of u(xij) is determined by the extreme value model:

• if µ(xij) and/or σ(xij) are linear in xij : linear QR.

• if log(µ(xij) and/or log(σ(xij) is linear in xij : non-linear QR.

Page 17: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Adjustment for spatial dependence

• Independence log-likelihood:

`IND(θ) =k∑

j=1

72∑i=1

log fij(yij ; θ) =k∑

j=1

`j(θ)

(storms) (space)

• If correct model specification:

θ → N(θ0, I−1)

• If model mis-specified, in regular problems, as k →∞:

θ → N(θ0, I−1 V I−1)

• I = Expected information: −E(∂2

∂θ2`IND(θ0)

).

• V = var(∂∂θ `IND(θ)

).

Page 18: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Adjustment of `IND(θ)

• Idea: Adjust `IND(θ) to have correct curvature near θ usingsandwich estimate.

`ADJ(θ) = `IND(θ)

+(θ − θ)′

(−I−1 V I−1

)−1(θ − θ)

(θ − θ)′(−I )(θ − θ)

(`IND(θ)− `IND(θ)

),

• Estimate I by observed information at θ.

• Estimate V byk∑

j=1

U2j

(θ)

, Uj(θ) =∂`j (θ)∂θ .

• Vertical adjustment preserves asymptotic distribution oflikelihood ratio statistic.

• See Davison (2003), Chandler and Bate (2007).

Page 19: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Summary of modelling of wave height data

• Threshold selection:• Choice of p: look for stability in parameter estimates.• Based on µ (and u) quadratic in longtiude and latitude, σ andξ constant . . .

• Spatial model:

µ =

qx∑i=0

qy∑j=0

µi+jqyφxi (lx)φyj(ly )

where:

• φ·0(·) = 1.

• φx1(lx) = 15.5(lx − 6.5), φy1(ly ) = 1

2.5(ly − 3.5).

• φ·2(·) = 12(3φ21(·)− 1), for lx , ly ∈ [−1, 1].

Page 20: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Threshold selection : µ intercept

● ● ● ● ● ● ● ● ● ● ●●

● ● ●●

●●

probability of exceedance

0.5 0.4 0.3 0.2 0.1

2.5

3.0

3.5

4.0

4.5

5.0

µ0

Page 21: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Threshold selection : µ coefficient of latitude

● ●

●●

● ●●

●●

●●

●●

probability of exceedance

0.5 0.4 0.3 0.2 0.1

−0.

35−

0.30

−0.

25−

0.20

−0.

15−

0.10

−0.

05

µ2

Page 22: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Threshold selection : ξ

●●

●● ● ● ● ●

●●

●●

probability of exceedance

0.5 0.4 0.3 0.2 0.1

−0.

10.

00.

10.

20.

30.

4

ξ

Page 23: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Summary of modelling of wave height data

• Choice of p: look for stability in parameter estimates.Use p = 0.4.

• ξ = 0.07, with 95% confidence interval (−0.05, 0.22).

• Estimated 200 year return level at (long=7, lat=1) is 15.8mwith 95% confidence interval (12.9, 22.3)m.

• Close agreement between parameter estimates for threshold uand point process mean µ.

Page 24: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Marginal 200 year return levels

latitude

12

34

5

6

longitude

2

4

6

810

12

200 year return level / m 15.2

15.4

15.6

Page 25: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Toy study 1

Data-generating process: for covariate values x1, . . . , xn:

Yi | X = xiindep∼ GEV (µ0 + µ1 xi , σ, ξ).

Set threshold:u(x) = u0 + u1 x .

For each u1, set u0 such that the expected proportion ofexceedances is kept constant at p.

• Calculate Fisher expected information for (µ0, µ1, σ, ξ).

• Invert to find asymptotic V-C of MLEs µ0, µ1, σ, ξ and hencevar(µ1).

• Find the value of u1 that minimises var(µ1).

Page 26: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Findings of Toy study 1

Let u1 be the value of u1 that minimises var(µ1).

• If covariate values x1, . . . , xn are symmetrically distributedthen: u1 = µ1 (quantile regression).

• If x1, . . . , xn are positive (negative) skew then u1 < µ1(u1 > µ1).

. . . but the loss in efficiency from using u1 = µ1 appears to besmall.

Page 27: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Simulation study 2

• 30 years of daily data on a spatial grid.

• Spatial dependence : mimics that of wave height data.

• Temporal dependence : moving maxima : extremal index 1/2(no declustering)

• Spatial variation: location µ linear in longitude and latitude.

• ξ: −0.2, 0.1, 0.4, 0.7.

• Thresholds: 90th, 95th, 99th percentiles.

• SE adjustment: data from distinct years are independent.

• Simulations with no covariate effects and/or no spatialdependence for comparison.

Page 28: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Findings of simulation study 2

• Estimates of regression effects from QR and PP models arevery close : both estimate extreme quantiles from the samedata.

• Uncertainties in covariate effects of threshold are negligiblecompared to the uncertainty in the choice of threshold level.

• To a large extent fitting the PP model accounts foruncertainty in the covariate effects at the level of thethreshold.

• Slight underestimation of standard errors : uncertainty inthreshold ignored.

Page 29: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

Conclusions

Quantile regression:

• An intuitive and effective strategy to set thresholds fornon-stationary EV models.

• Works well in initial applications.

• Supported by initial theoretical and simulation studies.

Ideas:

• Kysely, J., et al. (2010) use quantile regression to set atime-dependent threshold for peaks-over-threshold GPmodelling of data simulated from a climate model.

• Simultaneous threshold and PP model would avoid iteration(mixed-integer optimisation; see Beirlant et al. 2004).

Page 30: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

References

Chandler, R. E. and Bate, S. B. (2007) Inference for clustered datausing the independence loglikelihood. Biometrika 94 (1), 167–183.

Kysely, J., Picek, J. and Beranova, R. (2010) Estimating extremesin climate change simulations using the peaks-over-thresholdmethod with a non-stationary threshold Global and PlanetaryChange, 72, 55-68.

Northop, P. J. and Jonathan, P. Threshold modelling ofspatially-dependent non-stationary extremes with application tohurricane-induced wave heights. Accepted for Environmetrics.

Page 31: Threshold estimation in marginal modelling of …Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton,

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●●●

●●

●●

●●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●●●●

●●

●●

●●●

●●

●●●

●●●●●

●●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

estimate of 90% quantile

x

Y

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●●●

●●

●●

●●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●●●●

●●

●●

●●●

●●

●●●

●●●●●

●●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

estimate of 90% quantile

x

Y

Thank you for your attention.