Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics...

40
Statistics of Multiple Constraints: Analysis of Global Change and the Carbon Cycle Ian G. Enting MASCOS The University of Melbourne

Transcript of Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics...

Page 1: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Statistics of Multiple

Constraints:

Analysis of Global Change

and the Carbon Cycle

Ian G. Enting

MASCOS

The University of Melbourne

1

Page 2: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Acknowledgments

The ARC funds the Center of Excellence for

Mathematics and Statistics (MASCOS).

My fellowship at MASCOS is supported by CSIRO

through a sponsorship agreement.

Collaborators:

Cathy Trudinger and YingPing Wang of CSIRO Marine

and Atmospheric Research.

Roger Francey, Denis O’Brien, Peter Rayner, formerly

of CSIRO Atmospheric Research.

Workshop on Data Assimilation for the Carbon Cycle

2

Page 3: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Summary

Theme: reviewing time series analysis for:

• improvement of modelling

• understanding issues of spatial analysis

Topics:

• interpreting the carbon cycle

• inversions as statistics

• digital filtering — resolution

• possible problems — beyond Gaussian

Workshop on Data Assimilation for the Carbon Cycle

3

Page 4: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Data for Carbon Cycle Studies

• Air sampling networks interpreted by inverse

modelling;

• Satellite data, for quantities such as leaf-area index

and phenology

• Terrestrial biosphere models;

• Convective boundary layer measurements;

• Stand-level flux networks;

• Ecosystem experiments;

• Small cuvettes.

Also, potentially satellite concentration data.

From Canadell et al. 2000.Workshop on Data Assimilation for the Carbon Cycle

4

Page 5: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Key characteristics of statistics

• magnitude;

• degree of correlation between components;

• temporal correlation structure;

• spatial correlation structure;

• distribution;

• mismatches in averaging;

• contribution from model representativeness error.

From Raupach et al. 2005

Workshop on Data Assimilation for the Carbon Cycle

5

Page 6: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Interpretation

Interpretation is an inverse problem, working

backwards from results to causes.

Two main inverse problems: calibration and

data assimilation (deconvolution).

C(t) = C(0) =∫ t

0R(t− t′)S(t′) dt′

C(t) = C(0) =∫ t

0R(t′′)S(t− t′′) dt′′

The problems of deducing model response,

R(t) and forcing term S(t) are formally

equivalent, but in practice differ in the

characteristics of the statistics.

Workshop on Data Assimilation for the Carbon Cycle

6

Page 7: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Inverse Problems as Statistics

• Any uncertainty analysis needs to be based

on statistics.

• Any statistical analysis is assuming (either

implicitly or explicitly) some statistical

model.

• any variability that cannot be modelled

deterministically . . . must be . . . modelled

statistically (Enting, 2002).

Workshop on Data Assimilation for the Carbon Cycle

7

Page 8: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Aims of statistical analysis

• Compact characterisation of variability

• Design of techniques for data processing:

statistical efficiency, robustness etc

• Formalism for propagating uncertainties

through chain of calculations

• Design of new experiments:

How much is uncertainty reduced?

• Testing statistical assumptions underlying

data analysis techniques

Workshop on Data Assimilation for the Carbon Cycle

8

Page 9: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Origins of Uncertainty

For empirical quantities: ‘model’ and ‘data’

• Statistical variation

• Variability

• Inherent randomness

• Subjective judgement

• Linguistic imprecision

• Disagreement

• Approximation

From Morgan and Henrion (my split), Uncertainty (CUP, 1990).

Workshop on Data Assimilation for the Carbon Cycle

9

Page 10: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Combining information

Bayesian: Pr(x|z) ∝ Pr(z|x)Pr0(x)

Multiple constraints:

Pr(x|z1, z2) ∝ Pr(z2|x)Pr(z1|x)Pr0(x)

∝ Pr(z2|x)Pr1(x)

For linear relations with multivariate normal

distributions, constraints zj = Gjx + ej, with

inverse covariances, Xj, combine to give

inverse covariance, W , as:

W = Wprior +∑

GjTXjGj for estimate

x = [Wprior +∑

GjTXjGj]

−1[W x0 +∑

j GjTXjzj]

Workshop on Data Assimilation for the Carbon Cycle

10

Page 11: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Digital filtering

Model is zk = sk + nk where signal has power

spectrum, fs(θ) and noise has spectrum fn(θ).

Estimate signal as sk =∑

j Φjzk−j

Mean square error of estimate:

E[(sk − sk)2] = bias + variance:

MSE =∫ π

−π

[|1− φ(θ)|2fs(θ) + |φ(θ)|2fn(θ)

]dθ

Optimal filter: φ(θ) = fs(θ)/[fn(θ) + fs(θ)]

with

MSE =fs(θ)fn(θ)

fs(θ) + fn(θ)=

1

fs(θ)+

1

fn(θ)

−1

Workshop on Data Assimilation for the Carbon Cycle

11

Page 12: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Characterising resolution

Use characteristic numbers:

Nobs How many observations?

Ndata How many effectively independent observations?

Kcomp How many components used in the calculations?

Msignal How many components needed to specify

signal?

Ksynth How many components used to fit the signal?

Ms:n How many signal components exceed the noise

level?

Ktarget How many signal components is one trying to

estimate?

Expanded from Enting (2002: Section 8.3), including Msignal and

distinguishing Nobs from Ndata.Workshop on Data Assimilation for the Carbon Cycle

12

Page 13: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

More data: Ms:n < Ndata

fsig(θ)

fnoise(θ)

φopt(θ)

Integrand

of MSE

0 2012345

0 20

1

0 20

1

0 2 4 6012345

0 2 4 60

1

0 2 4 60

1

0 2 4 6 8 10 12012345

0 2 4 6 8 10 120

1

0 2 4 6 8 10 120

1

MSE ∼ 1/Ndata due to reduced aliasing in noise.Workshop on Data Assimilation for the Carbon Cycle

13

Page 14: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Correlated data: Ndata < Nobs

fsig(θ)

fnoise(θ)

φopt(θ)

Integrand

of MSE

0 2012345

0 20

1

0 20

1

0 2 4 6012345

0 2 4 60

1

0 2 4 60

1

0 2 4 6 8 10 12012345

0 2 4 6 8 10 120

1

0 2 4 6 8 10 120

1

No change in MSE unless increasing Nobs increases Ndata.Workshop on Data Assimilation for the Carbon Cycle

14

Page 15: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Aliasing: Ndata < Msignal

fsig(θ)

fnoise(θ)

φopt(θ)

Integrand

of MSE

0 2012345

0 20

1

0 20

1

0 2 4 6012345

0 2 4 60

1

0 2 4 60

1

0 2 4 6 8 10 12012345

0 2 4 6 8 10 120

1

0 2 4 6 8 10 120

1

MSE too low, due to ignoring aliasing (truncation error).Workshop on Data Assimilation for the Carbon Cycle

15

Page 16: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Aliasing: Ndata < Msignal

fsig(θ)

fnoise(θ)

φopt(θ)

Integrand

of MSE

0 2 4 6 8 10 12012345

0 2 4 6 8 10 120

1

0 2 4 6 8 10 120

1

0 2 4 6012345

0 2 4 60

1

0 2 4 60

1

0 2012345

0 20

1

0 20

1

Treating aliased signal as an error contribution.Workshop on Data Assimilation for the Carbon Cycle

16

Page 17: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Aliasing from Truncation error

(i) objective, (ii) risk, (iii) hope, (iv) Wunsch, (v)

strong priors, (vi) correction.

(i)

desired solution

a

b (ii)

solution from biased data

a

b

unifo

rm

basis

Constraintfrom data

(iii)

solution from unbiased data

a

b

(iv)

project from full space

a

b (v)

work in target space

a

b shap

edba

sis

(vi)

apply truncation error to data

a

b

Workshop on Data Assimilation for the Carbon Cycle

17

Page 18: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Smoothing splines

Fit a set of data, zn, with a smooth curve,

f(t) chosen to minimise

Θ =∑j[zj − f(tj)]

2 + λ∫ tNt1

[f ′′(t)]2 dt

Spline acts as approximate digital filter withφ(θ) = 1/[1 + (θ/θ0.5)

4]

where 50% attenuation occurs at:θ0.5 = 2π/T0.5 = [λ∆t]−1/4

Fit is linear in data, so data uncertainties can

be linearly propagated through calculations.

Workshop on Data Assimilation for the Carbon Cycle

18

Page 19: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Spline example: Law Dome CH4

Spline fit, f(t), with ±2 s.d.

data uncertainty.

Growth rate, ˙f(t), and

source estimate,˙f(t) + ˙f(t)/τ , also ±2 s.d.

Lower pre-1500 data density

implies smoother spline.

Uncertainties are uncertainty

in spline (smooth part of

source), not uncertainty in

complete source function.

Have chosen Ktarget !!

1000 1200 1400 1600 1800600

700

800

900

CH

4 (p

pb)

1000 1200 1400 1600 1800-5

0

5

10

CH

4 in

crea

se (

ppb/

yr)

1000 1200 1400 1600 180060

70

80

90C

H4

inpu

t (pp

b/yr

)

Workshop on Data Assimilation for the Carbon Cycle

19

Page 20: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Kalman filter paradigm

Mixed deterministic-stochastic model assumes evolution

of a state, x by: x(n + 1) = F (n)x(n) + u(n) + w(n)

with indirect noisy observations: z(n) = H(n)x(n) + e(n)

Where F , H and u are taken as known, and w and e are

zero-mean multivariate normal with known covariances Q

and R.

Kalman filter formalism gives the optimal estimates, x(n)

of state, given z(1) to z(n).

Combines mulitvariate normal distributions of

observations, z(n), and projection x(n|z1 . . . zn−1).

Workshop on Data Assimilation for the Carbon Cycle

20

Page 21: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Modelling for Kalman filter

State-space model for methane from ice cores:

x1 is methane concentration, x2 is source.

F =

1−∆t/τ a∆t

0 1

Q =

0 00 Q

H = [1,0] (or ([0,0] if no data).

Data uncertainty R, unit conversion factor, a.

Thus, source is modelled as a ‘random walk’.

Simplified from Trudinger et al, 2002.

Workshop on Data Assimilation for the Carbon Cycle

21

Page 22: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Kalman filter response

Frequency domain characterisation of stationary

case of Kalman filter

E.g. noise as white

noise, fn = R/2π

Random walk model of

forcing

fs ∝ 1/(1− cos θ) 0.00 0.05 0.10 0.15 0.20 0.25frequency (y-1)

0.0

0.2

0.4

0.6

0.8

1.0

1.2

freq

uenc

y re

spon

se

20.0 10.0 6.7 5.0 4.0period (y)

R/Q ratio (9/81, 25/81, 49/36 for ∆t = 2, and

49/1 for ∆t = 20, right to left) changes the

relative weights of ‘prior’ and observations in

frequency-dependent manner.Workshop on Data Assimilation for the Carbon Cycle

22

Page 23: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Kalman filter on CO2

Combined model, including

CO2 and 13CO2, using

concentration values

corrected for firn diffusion.

Case on left uses published

δ13C uncertainties.

Case on right multiplies

these by four.

Again, scale of uncertainty

affects the optimal

smoothing.

(From Trudinger et al., 2002).

1850 1900 1950 2000280

300

320

340

360

CO

2 (

ppm

)

a)

1850 1900 1950 2000Year

-8.0

-7.5

-7.0

-6.5

-6.0

δ13C

(pe

rmil)

b)

1850 1900 1950 2000Year

-3

-2

-1

0

1

Sour

ce (

Gt C

/yr)

c)

Deduced sources

oceanbiosphere

1850 1900 1950 2000280

300

320

340

360

CO

2 (

ppm

)

d)

1850 1900 1950 2000Year

-8.0

-7.5

-7.0

-6.5

-6.0

δ13C

(pe

rmil)

e)

1850 1900 1950 2000Year

-3

-2

-1

0

1

Sour

ce (

Gt C

/yr)

f)

Deduced sources

oceanbiosphere

Workshop on Data Assimilation for the Carbon Cycle

23

Page 24: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Time-dependent CO2 inversion

As with simple Kalman filter case, statistical

assumptions about structure of errors in time

can greatly influence what is estimated:

• Synthesis in terms of independent monthly

pulses (Rayner et al 1999), effectively

assumes no long-term systematic error.

• Representing prior information as

’mean-plus-anomaly’ avoids artificially-low

uncertainty on posterior mean.

Workshop on Data Assimilation for the Carbon Cycle

24

Page 25: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

The hard stuff: spatial statistics

• Two-dimensional, (or 2 + 1 when time

involved)

– more modes to consider

– more complex statistics are possible

• Non-stationary (c.f. Kalman filter)

• Response (atmospheric transport) involved

for concentration data (c.f. Kalman filter)

Workshop on Data Assimilation for the Carbon Cycle

25

Page 26: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Spatial statistics:

In more than one

dimension, more complex

behaviour can occur.

Simple local conditional

dependence can lead to

long-range (fractal)

behaviour.

Closed-form expressions

for model statistics are

seldom known.

Identifying appropriate

statistical models is

correspondingly hard.Example from statistical physics, showing multiple length-scales.

Workshop on Data Assimilation for the Carbon Cycle

26

Page 27: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Terrestrial distribution

In process inversions, biome-specific distributions will

be modulated by climatic variations.

Map data from Matthews 1983.Workshop on Data Assimilation for the Carbon Cycle

27

Page 28: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Grassland footprint

Distribution of grasslands reflects a 13C signal due to

C3-C4 differences.

Workshop on Data Assimilation for the Carbon Cycle

28

Page 29: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Number of modes

In an ill-conditioned inverse problem, only a

limited number of modes will be resolved by

the data.

• these may not be the modes that you wantto know about

• you don’t get to choose which modes areresolved

• c.f. biased estimates by Fan et al. byputting fine source discretisation in region(Nth America) without corresondingly finedata set.

Workshop on Data Assimilation for the Carbon Cycle

29

Page 30: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Semi-quantitative

An exploratory signal-to-noise analysis doesn’t

need to be as precise as actual inversion.

x =∑j

ajzj

typically with lots on near-cancellations in an

ill-conditioned problem.

E[(x− x)2] =∑j|aj|2Rj

and so the variance calculation is much less

sensitive to errors in the aj (i.e. model error).

Workshop on Data Assimilation for the Carbon Cycle

30

Page 31: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Toy model

• Model atmospheric transport as pure

anisotropic diffusion

• Eigenmodes are spherical harmonics.

Amplification factor for relating surface

sources to surface concentration response is:

γnmk tanh(γnmk) where

γ2nmk

(p0 − p1)2=

κθ[n(n + 1) + αM2]

κpR2earth

+2πik

κpT

Vertical, N-S and E-W diffusion: κp, κθ, ακθ

Workshop on Data Assimilation for the Carbon Cycle

31

Page 32: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Reality check

• Amplitude of responses for full 3-D model

are approximated well by the 1/n response

to latitudinal variation

• Inversion factor ∝ n comparable to

numerical differentiation,

• Suggests that latitudinally-integrated flux

could have much less correlation, than the

actual flux estimates – confirmed by results

from actual inversions

Workshop on Data Assimilation for the Carbon Cycle

32

Page 33: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Generalities

• Satellite data don’t have the

1/√n(n + 1) + αm2 attenuation factor

(but vertical averages are still attenuated

relative to surface distribution)

Low precision can give valuable constraints.

• For comparable spatial resolution, E-W

sampling density possibly needs to be

greater than N-S density (however, consider

re-visiting this, with ‘toy model’ including

‘solid-body rotation’, for advective term).

Workshop on Data Assimilation for the Carbon Cycle

33

Page 34: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

From point to globe

Observations:Fluxes, forcings and proxies

↓Local parametric model

↓← Global distribution

↓ of forcing and/or proxy

Global model

Contributions to uncertainty in global model are:

global distribution * parametric uncertainty +

parametric sensitivity * uncertainty in distribution

Workshop on Data Assimilation for the Carbon Cycle

34

Page 35: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

What if it ain’t so?

If Pr(z|x) ∝ exp(−λ|z − h(x)|) and not

Pr(z|x) ∝ exp(−λ|z − h(x)|2) then

• 90% of the ‘nice’ mathematics goes away

• Results of inversions are terrible

It’s not the things you don’t know that get you into

trouble — it’s the things you know for sure that just

ain’t so. (Mark Twain)

Workshop on Data Assimilation for the Carbon Cycle

35

Page 36: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Central value

For Pr(z|θ) = exp(−|z − θ|2/σ2) then

maximum likelihood estimate is

θN = 1N

∑n zn batch mode

θn+1 = θn + 1n+1

[zn+1 − θn

]recursive

Each estimate θn contains all information about

all previous data values needed for optimal

estimator for the next step (given new data).

For Pr(z|θ) = exp(−|z − θ|/σ) the maximum

likelihood estimate θn is median of all previous

zm ( m ≤ n) — no compact stepwise algorithm.

Workshop on Data Assimilation for the Carbon Cycle

36

Page 37: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Implications for inversions

If Pr(z|x) ∝ exp(−λ|z − h(x)|) then

1: 90% of the ‘nice’ mathematics goes away

• you don’t have compact ‘sufficient

statistics’

• multi-variate cases are really messy

• automatic differentiation isn’t a lot of use

2: Results of inversions are terrible

• major features depend on how much you

believe Samoa

Workshop on Data Assimilation for the Carbon Cycle

37

Page 38: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Emerging future directions

Inversion methodology

Shrinkage estimators :

Estimating variances :

Better statistics

Errors-in variables : e.g. using proxy source

data.

Hidden Markov models :

Financial time series models : e.g.

Stochastic volatility.

Workshop on Data Assimilation for the Carbon Cycle

38

Page 39: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Concluding thoughts:

Take-home questions:

• What are your calculations really

estimating?

• Can what you want really be estimated

given limits to resolution imposed by

model, data, ill-conditioning and

signal-to-noise ratio?

• Are the results, including residuals,

consistent with the statistical assumptions?

Workshop on Data Assimilation for the Carbon Cycle

39

Page 40: Statistics of Multiple Constraints: Analysis of Global ...inez/MSRI-NCAR_CarbonDA/... · Statistics of Multiple Constraints: Analysis of Global Change ... Ian G. Enting MASCOS The

Further Information

• I. Enting: Characterising the Temporal Variability of

the Global carbon Cycle. CSIRO Atmospheric

Research, Technical paper 40.

• I. Enting: Inverse Problems in Atmospheric

Constituent Transport. 2002, CUP.

• C. Rodenbeck: Estimating CO2 sources and sinks

. . . . MPI-BGC Technical Report 6.

See also:

• Trudinger et al (2002a,b): Kalman filter analysis of

ice-core data: 1 and 2. JGR

• Enting, Trudinger and Etheridge: Propagating data

uncertainty through smoothing spline fits. Tellus:

(in press, 2006).

Workshop on Data Assimilation for the Carbon Cycle

40