Empirical Likelihood confidence intervals under unequal ...

Empirical Likelihood confidence intervals under unequal

probability sampling

Yves G. Berger

Omar De La Rivar Torres

Design-based inference without re-sampling, linearisation and variance estimation

Outline

● Issues with standard Confidence Intervals.● A new Empirical Likelihood approach:

➔Point estimation➔Estimation of Confidence Intervals

● Simulations● European Income & Living Condition Survey

2009 (NET-SILC2, EUROSTAT)● Concluding remarks

Issues with standardConfidence Intervals

● Skewed variables

→ Skewed sampling distributions

→ Poor coverages of Standard CI

→ Linearised variance estimates can be poor

● Example: Income/wealth variables

Domains Extreme values

Measures of poverty, Quantiles

Example: Confidence interval of a 10% quantile

● Skewed population (exponential)● Skewed sampling distribution

95% CI based upon: n = 80N= 800

n = 80N = 150

Linearisation 98% 99.8%Rescaled Bootstrap 97% 99%Direct Bootstrap 93% 90%Woodruff 93% 94%Proposed Empirical Likelihood 96% 94%

Example: Estimation of a mean with auxiliary variables

● Skewed data N = 150

n = 40 n = 80Standard 91% 93%Pseudo - EL1 94% 94%Pseudo - EL2 87% 89%Proposed EL 95% 94%

Example: Persistent risk of Poverty (European Income & Living Condition

Survey 2009)● Male 25 yo – 44yo Standard Emp. Likelihood

New Empirical Likelihood Approach

Does not involve● variance estimates● Linearisation● Re-sampling● normality of the point estimator● negligible sampling fractions

Remark: Pseudo-EL proposed EL

Pseudo-EL relies on variance

The parameter of interest

● Population parameter solution of estimating equations

● Examples: Mean, Total, Ratio, Quantiles,

M-estimator, Poverty indicators, regression,

Winsorisation ...

does not need to be differentiable!

Proposed Empirical Likelihood Approach

● Empirical likelihood function:

● = Unit mass of unit

Proposed Empirical Likelihood Approach

● Maximise

Under the constraint

Design + auxiliary ● Example:

auxiliary variables

strat. variables

Empirical log-likelihood ratio function (deviance)

"Reduced" "Full"

● Maximum under

Maximum Empirical Likelihood Estimator

● Maximum EL Estimator of minimises

Maximum EL Estimator is the solution of

The maximise under

● Maximise

under the constraint and

● Solution:

● Consider that

always holds

Examples of Maximum Empirical likelihood estimators

● Example 1: "model" with just an intercept

Hájek Estimator

Greg if contains auxiliary variables

Examples of Maximum Empirical likelihood estimators

● Example 2: Ratio "model"

Hortvitz-Thompson estimator

● Example 3: Auxiliary variables within

Optimal GREG● Example 4:

Kim (2009) EL

Pps sampling (with replacement)

● Under regularity conditions

under pps sampling

Empirical Likelihood Confidence Intervals (pps sampling)

● Confidence intervals (Wilks' type)

Empirical Likelihood Confidence Intervals

EL relies on normality of the estimating equation when !

Remark: with auxilliary variables, Greg

instead of HT● The point estimator does not have to be normal

or unbiased

stronger & harder to justify

Without auxilliary variables

"Reduced" "Full"

With auxilliary variables

With auxilliary variables + Stratification

πps sampling(without replacement)

under Hájek (1964) asymptotic framework "High entropy"

πps sampling(without replacement)

● reduce the effect on the CI of units with large (finite population corrections)

● not needed

● not adjusted by parameters that need to

be estimated

● Can be extended with auxilliary variables

Simulations

● Population data (skewed) Rao & Wu (2006)

● and ~ exponential●

● Value to control correlation( , )

Coverage Prob. Mean. N=800. No Auxil. Var.

Coverage Prob. Mean. N=150. No Auxil. Var.

Coverage Prob. Mean. N=150. With Auxil. Var.

Variance Length CI. Mean. N=150. With Auxil.

Coverage Prob. 1St Quartile. N=800. No Auxil.

Variance Length CI. 1St Quartile. N=800. No Auxil.

Coverage Prob. 1St Quartile. N=150. No Auxil.

Variance Length CI. 1St Quartile. N=150. No Auxil.

Persistent risk of Poverty (European Income & Living Condition Survey 2009)

● Male 25 yo – 44yo (multi-stage designs)

New EL versus Bootstrap● Does not need re-sampling. "Simpler than

bootstrap"● Wider class of parameters compared to

bootstrap. ● More stable CI than direct bootstrap● Better coverage than bootstrap● EL include design information ( , stratication,

clusters).● EL intervals take into account of the bias of the

point estimator

New EL versus Pseudo-Empirical likelihood

● New EL ≠ Pseudo-EL● The pseudo-EL function is not a standard EL

function● CI Relies on variances (design effect)● May need N for totals and counts● Limited range of parameter with pseudo-EL. E.g.

no pseudo-EL CI for quantiles (only woodruff)● More stable CI than pseudo-EL● Design information through a design effect

(estimated)● Range preserving and good coverages

New EL versus Calibration

● Equivalent point estimator.● EL can be used without auxilliary information● EL can be used for testing, CI, p-values● EL can be used with "calibration weights"

(same point estimates)

● Calibration relies on CLT & variances● Calibration relies on a distance function

disconnected from mainstream statistics

Extensions

● Multi-stage samplng (OK for small sampling fractions)

● Rao-Hartley-Cochran design● Modelling: design naturally included, random

effect no needed● Conditional Estimating Equations

Example

Can't be solved with estimating equations

Extensions

● Re-weighting (Total nonresponse)

● Random Hot-deck imputation

● Calibration on known quantiles

or on distribution functions

STD Bootstrap

RS DirectEmp. Lik.

Design based √ √ √ √

Does not rely on normality of Point estimator × √ × √

Does not need variance estimates × √ × √

Does not need re-sampling √ × × √

Does not need linearisation × √ √ √

Range preserved × √ × √

Take into account sampling distribution × √ × √

Take into account of the design √ √ ?√ √

Suitable with large sampling fractions √ × √ √Complex Parameters √ √ ? √

Concluding Remarks● Does not involve variance estimation &

Linearisation● Design based (non-parametric)● Flexible and general approach (complex

parameters, modelling)● Does not rely on normality of the point

estimator● Better coverage for confidence intervals

(better inference)● EL intervals take into account of the bias

References

BERGER & DE LA RIVA TORRES (2012).

http://eprints.soton.ac.uk/337688/

BERGER & DE LA RIVA TORRES (2012).

Proceedings of the Survey Research Method Section of the American Statistical Association, Joint Statistical Meeting, San Diego

OSIER, BERGER and GOEDEMÉ (2013)

Standard error estimation for the EU-SILC indicators of poverty and social exclusion. Eurostat “Methodologies and Working papers” series

Regularity conditions●

Empirical Likelihood confidence intervals under unequal ...

Documents

Transcript of Empirical Likelihood confidence intervals under unequal ...

Unequal pay or unequal employment? A cross-country ...public.econ.duke.edu/~hf14/teaching/povertydisc/readings/olivetti-petrongolo07.pdfUnequal pay or unequal employment? A cross-country

PARAMETER ESTIMATION ON - Unisauir.unisa.ac.za/bitstream/handle/10500/4049/dissertation...2.5 Confidence Intervals for Parameters of the GBM 8 2.6 Example 2.2 - Maximum Likelihood

Probabilities, Likelihood, Bayesian inferencepbeerli/classes/isc5315-notes/rp1_slides.pdfProbabilities are attached to intervals (i.e. ranges of values), not individual values The

denominator. · Lagrange’s interpolation formula can be used for equal and unequal intervals. Part A: 1. What is the assumptions we make when Lagrange’s formula is used? Sol:

Unequal Realities

Bartels Unequal Democracy.pdf0s,

Unequal Entry to Motherhood and Unequal Child Development:

Unequal Opportunity Unequal Results · Unequal Opportunity =Unequal Results 7 | P a g e A Few Examples of How CFE Money Improved the Quality of Education8 But the fiscal crisis got

Growing Unequal

Unequal Number Observations

Bateman Equation Adaptation for Solving and Integrating ...(unequal intervals) with the points provided from the UAF, the area under the curve, or peak ELCR, is calculated for the

Unequal Welfare States

Underpaid and Unequal

ROC Unequal Access

Maximum likelihood (ML) Conditional distribution and likelihood Maximum likelihood estimator Information in the data and likelihood Observed and Fisher’s.

AAJ Unequal Harm

Lecture 1 Interval Estimation - fsalamri · Methods for constructing confidence intervals (1)Pivotal Quantity Method (2)Maximum Likelihood Estimator (MLE) Method (3)Bayesian Method

Measures of Central Tendencyncert.nic.in/textbook/pdf/kest105.pdfvarious class intervals are taken. We have already known that class intervals may be exclusive or inclusive or of unequal

Topic 8 - Purdue University · 2008. 5. 23. · Topic 8 Topic Overview This topic will cover More on Multiple Comparisons / Con dence Intervals Two-way ANOVA with unequal numbers

Testing and confidence intervals for high dimensional ......Simulation studies show that all tests proposed perform well in controlling type I errors.Moreover, the partial likelihood