New Methodology for Statistical Imputation in the FAOSTAT ...

15
http://www.foodsec.org New Methodology for Statistical Imputation in the FAOSTAT Production Domain Onno Hoffmeister, Statistician, ESS Rome, 11 April 2013 Food Security Information for Decision-Making

Transcript of New Methodology for Statistical Imputation in the FAOSTAT ...

Page 1: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

New Methodology for Statistical Imputation in the

FAOSTAT Production Domain

Onno Hoffmeister, Statistician, ESS Rome, 11 April 2013

Food Security Information for Decision-Making

Page 2: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Contents

• Context

• Description of the Methodology

• Evaluation of the Results

Page 3: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Scope of Statistical Imputations

*) Data on production and input factors of primary agricultural products in FAOSTAT from 2000 to 2010.

Focus of this presentation

Page 4: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Expert Estimates

Based on the best available knowledge of the Country Statistician, taking into account:

• background knowledge of the conditions in the country (studies and reports, country visits, country experience, …)

• fit with other elements of the SUA

• technical conversion factors in other years

• time trend of the data series

Page 5: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Potential Gains from Statistical Imputation

• economization of work load

• objectivity

• accuracy

Page 6: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Concretization of the Aim

Common objectives of imputations (Kalton and Kasprzyk 1982):

• preserving the distribution

• deriving reliable sample estimators

• “assigning values at the micro level and thereby allowing

analyses to be conducted as if the data set were complete”

Main objective for FAOSTAT

Page 7: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Applied Methodology

Growth rate is benchmarked on a related aggregate.

1

1

1,

,

1,

2,

,

1,

0

0

0

0

0

0 −

=∑∑

∑∑

∑∑

≠−

≠+−

≠+−

≠−

≠+−

l

cctc

cctc

ccltc

ccltc

ccltc

ccltc

y

y

y

y

y

yr

( )lltctc ryy += − 1ˆ ,, 00

Page 8: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Applied Methodology

1995 2000 2005

observed

Page 9: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Applied Methodology

1995 2000 2005

observed

aggregate

Page 10: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Applied Methodology

1995 2000 2005

observed

aggregate

imputed ltyr −_,

ltcyr −,0

Page 11: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Selection of the Aggregate to Benchmark on

Priority Aggregate

1. Same country / same commodity

2. Sub-region aggregate / same commodity

3. Sub-region aggregate / commodity aggregate

4. Region aggregate / same commodity

5. Region aggregate / commodity aggregate

Page 12: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Results - Coverage

Scope: FAOSTAT data in the production domain, 1990-2009.

Page 13: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Results - Accuracy

00

50

100

150

200

250

300

350

400

450

1 year 3 years 5 years

RMSE

(%

)

Time behind series end

Area harvested

Trend smoothing Benchmarking

00

50

100

150

200

250

300

350

400

450

1 year 3 years 5 years

RMSE

(%

) Time behind series end

Crops production

Trend smoothing Benchmarking

Page 14: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Work proceeds …

• Review of imputed values during data compilation

• Discussion of the approach at conferences and workshops

• Further refinement

- fit with other SUA elements

- application in time-series gaps

- broadening of the scope

• Development and testing of other approaches

Page 15: New Methodology for Statistical Imputation in the FAOSTAT ...

The EC-FAO Food Security Information for Decision Making Programme http://www.foodsec.org

Thank you very much for your attention!