Detecting local dependence in latent class models

94
Example LCA Local dependence BVR and MI EPC Conclusions References Detecting local dependence in latent class models Daniel Oberski Department of methodology and statistics (Based on joint work with Jeroen Vermunt and Geert Van Kollenburg) Detecting local dependence in latent class models Daniel Oberski

Transcript of Detecting local dependence in latent class models

Page 1: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Detecting local dependence inlatent class models

Daniel Oberski

Department of methodology and statistics

(Based on joint work with Jeroen Vermunt and Geert Van Kollenburg)

Detecting local dependence in latent class models Daniel Oberski

Page 2: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Latent class analysis

• Latent class analysis (LCA) used for: model-basedclassification, clustering, latent structure analysis,estimating false positives and false negatives, (specificityand sensitivity) of error-prone variables;

• assumes local independence;

• If local dependence, severe bias can occur.

Detecting local dependence in latent class models Daniel Oberski

Page 3: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Latent class analysis

• Latent class analysis (LCA) used for: model-basedclassification, clustering, latent structure analysis,estimating false positives and false negatives, (specificityand sensitivity) of error-prone variables;

• assumes local independence;• If local dependence, severe bias can occur.

Detecting local dependence in latent class models Daniel Oberski

Page 4: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Latent class analysis

• Latent class analysis (LCA) used for: model-basedclassification, clustering, latent structure analysis,estimating false positives and false negatives, (specificityand sensitivity) of error-prone variables;

• assumes local independence;• If local dependence, severe bias can occur.

Detecting local dependence in latent class models Daniel Oberski

Page 5: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Detecting local dependence in LCA

• Local dependence in LCA leads to model misfit;• Usual reaction: increase number of classes;

This talk:• Do not increase number of classes;• Model the local dependencies directly;

• Use specifically tailored measures to detect which localdependencies should be modeled:

• Bivariate residuals (BVR),• Score test or ``modification index'' (MI),• Expected parameter change (EPC).

Detecting local dependence in latent class models Daniel Oberski

Page 6: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Detecting local dependence in LCA

• Local dependence in LCA leads to model misfit;• Usual reaction: increase number of classes;

This talk:• Do not increase number of classes;• Model the local dependencies directly;• Use specifically tailored measures to detect which localdependencies should be modeled:

• Bivariate residuals (BVR),• Score test or ``modification index'' (MI),• Expected parameter change (EPC).

Detecting local dependence in latent class models Daniel Oberski

Page 7: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Detecting local dependence in LCA

• Local dependence in LCA leads to model misfit;• Usual reaction: increase number of classes;

This talk:• Do not increase number of classes;• Model the local dependencies directly;• Use specifically tailored measures to detect which localdependencies should be modeled:

• Bivariate residuals (BVR),

• Score test or ``modification index'' (MI),• Expected parameter change (EPC).

Detecting local dependence in latent class models Daniel Oberski

Page 8: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Detecting local dependence in LCA

• Local dependence in LCA leads to model misfit;• Usual reaction: increase number of classes;

This talk:• Do not increase number of classes;• Model the local dependencies directly;• Use specifically tailored measures to detect which localdependencies should be modeled:

• Bivariate residuals (BVR),• Score test or ``modification index'' (MI),

• Expected parameter change (EPC).

Detecting local dependence in latent class models Daniel Oberski

Page 9: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Detecting local dependence in LCA

• Local dependence in LCA leads to model misfit;• Usual reaction: increase number of classes;

This talk:• Do not increase number of classes;• Model the local dependencies directly;• Use specifically tailored measures to detect which localdependencies should be modeled:

• Bivariate residuals (BVR),• Score test or ``modification index'' (MI),• Expected parameter change (EPC).

Detecting local dependence in latent class models Daniel Oberski

Page 10: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Detecting local dependence in LCA

• Local dependence in LCA leads to model misfit;• Usual reaction: increase number of classes;

This talk:• Do not increase number of classes;• Model the local dependencies directly;• Use specifically tailored measures to detect which localdependencies should be modeled:

• Bivariate residuals (BVR),• Score test or ``modification index'' (MI),• Expected parameter change (EPC).

Detecting local dependence in latent class models Daniel Oberski

Page 11: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

..1 Example LCA

..2 Local dependence

..3 BVR and MI

..4 EPC

..5 Conclusions

Detecting local dependence in latent class models Daniel Oberski

Page 12: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Example LCA: dentists rating caries in xrays

Detecting local dependence in latent class models Daniel Oberski

Page 13: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Example LCA: dentists rating caries in xrays

Here is an xray of a person who possibly has caries:

Detecting local dependence in latent class models Daniel Oberski

Page 14: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Example LCA: dentists rating caries in xrays

Here is an xray of a person who possibly has caries:

Detecting local dependence in latent class models Daniel Oberski

Page 15: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Example LCA: dentists rating caries in xrays

Detecting local dependence in latent class models Daniel Oberski

Page 16: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Example LCA: dentists rating caries in xrays

y1 : Yes (1)

y2 : No (0)

y3 : No (0)

y4 : Yes (1)

Detecting local dependence in latent class models Daniel Oberski

Page 17: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Example LCA: dentists rating caries in xrays

y1 : Yes (1)

y2 : No (0)

y3 : No (0)

y4 : Yes (1)

Detecting local dependence in latent class models Daniel Oberski

Page 18: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Example LCA: dentists rating caries in xrays

y1 : Yes (1)

y2 : No (0)

y3 : No (0)

y4 : Yes (1)

Detecting local dependence in latent class models Daniel Oberski

Page 19: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Example LCA: dentists rating caries in xrays

y1 : Yes (1)

y2 : No (0)

y3 : No (0)

y4 : Yes (1)

Detecting local dependence in latent class models Daniel Oberski

Page 20: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Actual dentists' ratings collected by Espeland &Handelman (1989)

> dentistsVar1 Var2 Var3 Var4 Var5 Observed

1 0 0 0 0 0 18802 0 0 0 0 1 7893 0 0 0 1 0 434 0 0 0 1 1 755 0 0 1 0 0 236 0 0 1 0 1 637 0 0 1 1 0 88 0 0 1 1 1 229 0 1 0 0 0 18810 0 1 0 0 1 19111 0 1 0 1 0 17... etc.

Detecting local dependence in latent class models Daniel Oberski

Page 21: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

2-class local independence model for dentists

Detecting local dependence in latent class models Daniel Oberski

Page 22: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Local independence latent class model

Patternwise likelihood:

Pr(Y = y) =∑t

Pr(Y = y|ξ = t)Pr(ξ = t), (1)

where the conditional probability of the response patterns is

Pr(Y = y|ξ = t) = exp(Xθ)

1T exp(Xθ)(2)

With X a design matrix containing:• Observed variables main effects (intercepts);• Latent class × observed variables interaction.

Detecting local dependence in latent class models Daniel Oberski

Page 23: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Estimation of local independence latent class model

Log-likelihoodℓ(θ) = nT logPr(Y = y),

with vector n the observed frequency of each responsepattern. Total sample size is N. Maximum likelihood estimates

θ̂ = argmaxθ∈Rq

ℓ(θ)

by expectation-maximization, quasi-Newton, or a combination.This gives expected frequencies µ̂ := N · Pr(Y = y|θ = θ̂)

Detecting local dependence in latent class models Daniel Oberski

Page 24: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

2-class local independence model for dentists' ratings

Possible goals:

Goal: How LCA is useful:* See how well dentists rate xrays ``sensitivity'' & ``specificity''

(conditional probabilities)from intercepts and slopes

* Classify each photograph Pattern posteriors giveprobabilistic classification

* Estimate ``true'' caries prevalence Latent class proportions

Local dependence biases all of these results.

Detecting local dependence in latent class models Daniel Oberski

Page 25: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

2-class local independence model for dentists' ratings

Possible goals:

Goal: How LCA is useful:* See how well dentists rate xrays ``sensitivity'' & ``specificity''

(conditional probabilities)from intercepts and slopes

* Classify each photograph Pattern posteriors giveprobabilistic classification

* Estimate ``true'' caries prevalence Latent class proportions

Local dependence biases all of these results.

Detecting local dependence in latent class models Daniel Oberski

Page 26: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Questions for you

• What does the latent class variable represent?• And if the number of classes is increased?

Model fit:

L2 X2 df BIC AIC AIC3 CAIC129.85 132.00 20 -35.37 89.85 69.85 -35.37

• The model does not fit. What could be a reason?• What should be done?

Detecting local dependence in latent class models Daniel Oberski

Page 27: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Questions for you

• What does the latent class variable represent?• And if the number of classes is increased?

Model fit:

L2 X2 df BIC AIC AIC3 CAIC129.85 132.00 20 -35.37 89.85 69.85 -35.37

• The model does not fit. What could be a reason?• What should be done?

Detecting local dependence in latent class models Daniel Oberski

Page 28: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Questions for you

• What does the latent class variable represent?• And if the number of classes is increased?

Model fit:

L2 X2 df BIC AIC AIC3 CAIC129.85 132.00 20 -35.37 89.85 69.85 -35.37

• The model does not fit. What could be a reason?• What should be done?

Detecting local dependence in latent class models Daniel Oberski

Page 29: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Local dependence latent class models

Detecting local dependence in latent class models Daniel Oberski

Page 30: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

2-class local dependence model for dentists

Detecting local dependence in latent class models Daniel Oberski

Page 31: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Local dependence latent class model

Conditional probability of the response patterns still

Pr(Y = y|ξ = t) = exp(Xθ)

1T exp(Xθ)

But now X is a design matrix containing:• Observed variables main effects (intercepts);• Latent class × observed variables interaction;• (Some) observed variables × observed variables 2-wayinteractions;

Detecting local dependence in latent class models Daniel Oberski

Page 32: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

• Can we find a locally dependent two-class model that fits?• Which local dependence parameters are needed?

•• Goal: Examine fit of each pair of variables (dentist ratings)to the local independence assumption without fitting allpossible alternative models.

Detecting local dependence in latent class models Daniel Oberski

Page 33: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

• Can we find a locally dependent two-class model that fits?• Which local dependence parameters are needed?

•• Goal: Examine fit of each pair of variables (dentist ratings)to the local independence assumption without fitting allpossible alternative models.

Detecting local dependence in latent class models Daniel Oberski

Page 34: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

• Can we find a locally dependent two-class model that fits?• Which local dependence parameters are needed?

•• Goal: Examine fit of each pair of variables (dentist ratings)to the local independence assumption without fitting allpossible alternative models.

Detecting local dependence in latent class models Daniel Oberski

Page 35: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

• Can we find a locally dependent two-class model that fits?• Which local dependence parameters are needed?

•• Goal: Examine fit of each pair of variables (dentist ratings)to the local independence assumption without fitting allpossible alternative models.

Detecting local dependence in latent class models Daniel Oberski

Page 36: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Detecting local dependencies with the BVR and MI

Detecting local dependence in latent class models Daniel Oberski

Page 37: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Fit of two-way cross-table between dentists 1 and 3

Observed

No YesNo 3250 280Yes 123 216

Expected

No YesNo 3217 313Yes 156 183

Bivariate residuals

No YesNo 32.6 -32.6Yes -32.6 32.6

BVR1,3 = r112∑k,l

µ̂−1kl = (32.6)2

∑k,l

µ̂−1kl ≈ 1063(0.0154) ≈ 16.3

MI1,3 = r112 Var(r11)−1 = (32.6)2/(31.3) ≈ 1063(0.0320) ≈ 34.0

Detecting local dependence in latent class models Daniel Oberski

Page 38: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Fit of two-way cross-table between dentists 1 and 3

Observed

No YesNo 3250 280Yes 123 216

Expected

No YesNo 3217 313Yes 156 183

Bivariate residuals

No YesNo 32.6 -32.6Yes -32.6 32.6

BVR1,3 = r112∑k,l

µ̂−1kl = (32.6)2

∑k,l

µ̂−1kl ≈ 1063(0.0154) ≈ 16.3

MI1,3 = r112 Var(r11)−1 = (32.6)2/(31.3) ≈ 1063(0.0320) ≈ 34.0

Detecting local dependence in latent class models Daniel Oberski

Page 39: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Fit of two-way cross-table between dentists 1 and 3

Observed

No YesNo 3250 280Yes 123 216

Expected

No YesNo 3217 313Yes 156 183

Bivariate residuals

No YesNo 32.6 -32.6Yes -32.6 32.6

BVR1,3 = r112∑k,l

µ̂−1kl = (32.6)2

∑k,l

µ̂−1kl ≈ 1063(0.0154) ≈ 16.3

MI1,3 = r112 Var(r11)−1 = (32.6)2/(31.3) ≈ 1063(0.0320) ≈ 34.0

Detecting local dependence in latent class models Daniel Oberski

Page 40: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Local dependencies between five dentists' x-rayratings for caries.

Dentist dependence MI BVR1 ↔ 2 3.1 1.41 ↔ 3 34.0 ** 16.31 ↔ 4 13.1 ** 7.71 ↔ 5 2.7 0.82 ↔ 3 6.8 * 1.72 ↔ 4 1.8 0.62 ↔ 5 16.4 ** 4.73 ↔ 4 2.7 1.03 ↔ 5 5.1 * 0.84 ↔ 5 3.5 1.0

Note: ** Sig. (α = 0.05) with Bonferroni correction for multipletesting, * without correction

Detecting local dependence in latent class models Daniel Oberski

Page 41: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI

``Bivariate residual'' (BVR):• Pearson "chi-square" residual in cross-table of a pair ofobserved variables: ∑(observed - expected)2/expected

• Surprise! Not a chi square statistic!

Detecting local dependence in latent class models Daniel Oberski

Page 42: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI

``Bivariate residual'' (BVR):• Pearson "chi-square" residual in cross-table of a pair ofobserved variables: ∑(observed - expected)2/expected

• Surprise! Not a chi square statistic!

Detecting local dependence in latent class models Daniel Oberski

Page 43: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

●●

●●

● ●●

● ●●

●●●●●●●●●●●●●●●●●●●

● ●●

●●

0 20 40 60 80 100

020406080

100

MI equals chi−square improvement...

χ2

MI

●●

●●●

● ●●

●●●●●●●●●●●●●●●●●● ● ● ●● ●●

● ● ●●●

0 20 40 60 80 100

020406080

100

... BVR does not.

χ2

BV

R

Detecting local dependence in latent class models Daniel Oberski

Page 44: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI

``Bivariate residual'' (BVR):• Pearson residual in cross-table of a pair of observedvariables: (observed - expected)2/expected;

• Not a chi square statistic.

``Modification index'' (MI):• ``Score test'' (``Lagrange multiplier'' test) for introducing alocal dependency between a pair of variables;

• Turns out to use the same residual as the BVR, but withthe correct variance! (thanks Jeroen)

• Under the null hypothesis, chi-square distributed (1 df)

Detecting local dependence in latent class models Daniel Oberski

Page 45: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI

``Bivariate residual'' (BVR):• Pearson residual in cross-table of a pair of observedvariables: (observed - expected)2/expected;

• Not a chi square statistic.

``Modification index'' (MI):• ``Score test'' (``Lagrange multiplier'' test) for introducing alocal dependency between a pair of variables;

• Turns out to use the same residual as the BVR, but withthe correct variance! (thanks Jeroen)

• Under the null hypothesis, chi-square distributed (1 df)

Detecting local dependence in latent class models Daniel Oberski

Page 46: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI

``Bivariate residual'' (BVR):• Pearson residual in cross-table of a pair of observedvariables: (observed - expected)2/expected;

• Not a chi square statistic.

``Modification index'' (MI):• ``Score test'' (``Lagrange multiplier'' test) for introducing alocal dependency between a pair of variables;

• Turns out to use the same residual as the BVR, but withthe correct variance! (thanks Jeroen)

• Under the null hypothesis, chi-square distributed (1 df)

Detecting local dependence in latent class models Daniel Oberski

Page 47: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR not even close to chi-square distributed, MI is.

Monte Carlo simulation under the null hypothesisCondition α for nominal 5% Empirical distribution

BVR MI BVRλ n Naive Boot MI Mean Var Mean Var

0.5 200 0.000 0.050 0.051 0.97 1.7 0.33 0.20.5 500 0.000 0.020 0.050 1.06 2.3 0.36 0.20.5 1000 0.000 0.060 0.065 0.96 1.9 0.33 0.20.5 5000 0.000 0.085 0.055 0.97 2.0 0.34 0.20.8 200 0.000 0.065 0.040 1.04 1.7 0.25 0.10.8 500 0.000 0.070 0.060 1.05 2.0 0.25 0.10.8 1000 0.000 0.060 0.090 1.22 2.6 0.30 0.20.8 5000 0.000 0.035 0.060 1.16 3.1 0.28 0.2Should be: 0.050 0.050 0.050 1.00 2.0 1.00 2.0

Detecting local dependence in latent class models Daniel Oberski

Page 48: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI: main findings

• Many (most?) published applied studies using BVRpretend it is a chi-square statistic*;

• BUT, if pretend BVR is chi-square:

• BVR does not provide nominal α levels;• BVR power to detect local dependencies miserably low*.

• Referring MI to chi-square:

• MI provides approximately nominal α levels;• Power of MI is adequate except for very smalldependencies (±0.05)*.

•• Can bootstrap BVR values, leading to performance similarto, though slightly below, MI*

* See paper.

Detecting local dependence in latent class models Daniel Oberski

Page 49: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI: main findings

• Many (most?) published applied studies using BVRpretend it is a chi-square statistic*;

• BUT, if pretend BVR is chi-square:• BVR does not provide nominal α levels;

• BVR power to detect local dependencies miserably low*.• Referring MI to chi-square:

• MI provides approximately nominal α levels;• Power of MI is adequate except for very smalldependencies (±0.05)*.

•• Can bootstrap BVR values, leading to performance similarto, though slightly below, MI*

* See paper.

Detecting local dependence in latent class models Daniel Oberski

Page 50: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI: main findings

• Many (most?) published applied studies using BVRpretend it is a chi-square statistic*;

• BUT, if pretend BVR is chi-square:• BVR does not provide nominal α levels;• BVR power to detect local dependencies miserably low*.

• Referring MI to chi-square:

• MI provides approximately nominal α levels;• Power of MI is adequate except for very smalldependencies (±0.05)*.

•• Can bootstrap BVR values, leading to performance similarto, though slightly below, MI*

* See paper.

Detecting local dependence in latent class models Daniel Oberski

Page 51: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI: main findings

• Many (most?) published applied studies using BVRpretend it is a chi-square statistic*;

• BUT, if pretend BVR is chi-square:• BVR does not provide nominal α levels;• BVR power to detect local dependencies miserably low*.

• Referring MI to chi-square:

• MI provides approximately nominal α levels;• Power of MI is adequate except for very smalldependencies (±0.05)*.

•• Can bootstrap BVR values, leading to performance similarto, though slightly below, MI*

* See paper.

Detecting local dependence in latent class models Daniel Oberski

Page 52: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI: main findings

• Many (most?) published applied studies using BVRpretend it is a chi-square statistic*;

• BUT, if pretend BVR is chi-square:• BVR does not provide nominal α levels;• BVR power to detect local dependencies miserably low*.

• Referring MI to chi-square:• MI provides approximately nominal α levels;

• Power of MI is adequate except for very smalldependencies (±0.05)*.

•• Can bootstrap BVR values, leading to performance similarto, though slightly below, MI*

* See paper.

Detecting local dependence in latent class models Daniel Oberski

Page 53: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI: main findings

• Many (most?) published applied studies using BVRpretend it is a chi-square statistic*;

• BUT, if pretend BVR is chi-square:• BVR does not provide nominal α levels;• BVR power to detect local dependencies miserably low*.

• Referring MI to chi-square:• MI provides approximately nominal α levels;• Power of MI is adequate except for very smalldependencies (±0.05)*.

•• Can bootstrap BVR values, leading to performance similarto, though slightly below, MI*

* See paper.

Detecting local dependence in latent class models Daniel Oberski

Page 54: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI: main findings

• Many (most?) published applied studies using BVRpretend it is a chi-square statistic*;

• BUT, if pretend BVR is chi-square:• BVR does not provide nominal α levels;• BVR power to detect local dependencies miserably low*.

• Referring MI to chi-square:• MI provides approximately nominal α levels;• Power of MI is adequate except for very smalldependencies (±0.05)*.

•• Can bootstrap BVR values, leading to performance similarto, though slightly below, MI*

* See paper.

Detecting local dependence in latent class models Daniel Oberski

Page 55: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI: main findings

• Many (most?) published applied studies using BVRpretend it is a chi-square statistic*;

• BUT, if pretend BVR is chi-square:• BVR does not provide nominal α levels;• BVR power to detect local dependencies miserably low*.

• Referring MI to chi-square:• MI provides approximately nominal α levels;• Power of MI is adequate except for very smalldependencies (±0.05)*.

•• Can bootstrap BVR values, leading to performance similarto, though slightly below, MI*

* See paper.

Detecting local dependence in latent class models Daniel Oberski

Page 56: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

BVR and MI: main findings

• Many (most?) published applied studies using BVRpretend it is a chi-square statistic*;

• BUT, if pretend BVR is chi-square:• BVR does not provide nominal α levels;• BVR power to detect local dependencies miserably low*.

• Referring MI to chi-square:• MI provides approximately nominal α levels;• Power of MI is adequate except for very smalldependencies (±0.05)*.

•• Can bootstrap BVR values, leading to performance similarto, though slightly below, MI*

* See paper.

Detecting local dependence in latent class models Daniel Oberski

Page 57: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size of local dependencies with the EPC

Detecting local dependence in latent class models Daniel Oberski

Page 58: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size besides statisticalsiginificance

``Modification index'' (MI):• Statistical ``score'' test of the hypothesis that the localdependence between two variables, say ψ, equals zero.

• ψ corresponds to the observed variable × observedvariable 2-way interaction in conditional prob. model;

• Only statistical significance → MI depends on:

• Sample size;• Size of the loadings of the two variables;• Size of the loadings of the other variables;• Latent variable intercept(s);• Observed variable intercepts;• Regression coefficients of covariates if present.

Detecting local dependence in latent class models Daniel Oberski

Page 59: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size besides statisticalsiginificance

``Modification index'' (MI):• Statistical ``score'' test of the hypothesis that the localdependence between two variables, say ψ, equals zero.

• ψ corresponds to the observed variable × observedvariable 2-way interaction in conditional prob. model;

• Only statistical significance → MI depends on:

• Sample size;• Size of the loadings of the two variables;• Size of the loadings of the other variables;• Latent variable intercept(s);• Observed variable intercepts;• Regression coefficients of covariates if present.

Detecting local dependence in latent class models Daniel Oberski

Page 60: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size besides statisticalsiginificance

``Modification index'' (MI):• Statistical ``score'' test of the hypothesis that the localdependence between two variables, say ψ, equals zero.

• ψ corresponds to the observed variable × observedvariable 2-way interaction in conditional prob. model;

• Only statistical significance → MI depends on:• Sample size;

• Size of the loadings of the two variables;• Size of the loadings of the other variables;• Latent variable intercept(s);• Observed variable intercepts;• Regression coefficients of covariates if present.

Detecting local dependence in latent class models Daniel Oberski

Page 61: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size besides statisticalsiginificance

``Modification index'' (MI):• Statistical ``score'' test of the hypothesis that the localdependence between two variables, say ψ, equals zero.

• ψ corresponds to the observed variable × observedvariable 2-way interaction in conditional prob. model;

• Only statistical significance → MI depends on:• Sample size;• Size of the loadings of the two variables;

• Size of the loadings of the other variables;• Latent variable intercept(s);• Observed variable intercepts;• Regression coefficients of covariates if present.

Detecting local dependence in latent class models Daniel Oberski

Page 62: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size besides statisticalsiginificance

``Modification index'' (MI):• Statistical ``score'' test of the hypothesis that the localdependence between two variables, say ψ, equals zero.

• ψ corresponds to the observed variable × observedvariable 2-way interaction in conditional prob. model;

• Only statistical significance → MI depends on:• Sample size;• Size of the loadings of the two variables;• Size of the loadings of the other variables;

• Latent variable intercept(s);• Observed variable intercepts;• Regression coefficients of covariates if present.

Detecting local dependence in latent class models Daniel Oberski

Page 63: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size besides statisticalsiginificance

``Modification index'' (MI):• Statistical ``score'' test of the hypothesis that the localdependence between two variables, say ψ, equals zero.

• ψ corresponds to the observed variable × observedvariable 2-way interaction in conditional prob. model;

• Only statistical significance → MI depends on:• Sample size;• Size of the loadings of the two variables;• Size of the loadings of the other variables;• Latent variable intercept(s);

• Observed variable intercepts;• Regression coefficients of covariates if present.

Detecting local dependence in latent class models Daniel Oberski

Page 64: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size besides statisticalsiginificance

``Modification index'' (MI):• Statistical ``score'' test of the hypothesis that the localdependence between two variables, say ψ, equals zero.

• ψ corresponds to the observed variable × observedvariable 2-way interaction in conditional prob. model;

• Only statistical significance → MI depends on:• Sample size;• Size of the loadings of the two variables;• Size of the loadings of the other variables;• Latent variable intercept(s);• Observed variable intercepts;

• Regression coefficients of covariates if present.

Detecting local dependence in latent class models Daniel Oberski

Page 65: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size besides statisticalsiginificance

``Modification index'' (MI):• Statistical ``score'' test of the hypothesis that the localdependence between two variables, say ψ, equals zero.

• ψ corresponds to the observed variable × observedvariable 2-way interaction in conditional prob. model;

• Only statistical significance → MI depends on:• Sample size;• Size of the loadings of the two variables;• Size of the loadings of the other variables;• Latent variable intercept(s);• Observed variable intercepts;• Regression coefficients of covariates if present.

Detecting local dependence in latent class models Daniel Oberski

Page 66: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size besides statisticalsiginificance

``Modification index'' (MI):• Statistical ``score'' test of the hypothesis that the localdependence between two variables, say ψ, equals zero.

• ψ corresponds to the observed variable × observedvariable 2-way interaction in conditional prob. model;

• Only statistical significance → MI depends on:• Sample size;• Size of the loadings of the two variables;• Size of the loadings of the other variables;• Latent variable intercept(s);• Observed variable intercepts;• Regression coefficients of covariates if present.

Detecting local dependence in latent class models Daniel Oberski

Page 67: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size besides statisticalsiginificance

``Expected parameter change'' (EPC):• Estimate of ψ, the strength of the local dependencebetween two variables;

• No need to fit alternative model;

• Assesses substantive rather than statistical significance.

Detecting local dependence in latent class models Daniel Oberski

Page 68: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size besides statisticalsiginificance

``Expected parameter change'' (EPC):• Estimate of ψ, the strength of the local dependencebetween two variables;

• No need to fit alternative model;• Assesses substantive rather than statistical significance.

Detecting local dependence in latent class models Daniel Oberski

Page 69: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Assessing substantive size besides statisticalsiginificance

``Expected parameter change'' (EPC):• Estimate of ψ, the strength of the local dependencebetween two variables;

• No need to fit alternative model;• Assesses substantive rather than statistical significance.

Detecting local dependence in latent class models Daniel Oberski

Page 70: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Local dependencies between five dentists' x-rayratings for caries.

Dentist dependence EPC MI BVR1 ↔ 2 -0.081 3.1 1.41 ↔ 3 -0.261 34.0 ** 16.31 ↔ 4 -0.146 13.1 ** 7.71 ↔ 5 -0.117 2.7 0.82 ↔ 3 -0.140 6.8 * 1.72 ↔ 4 -0.058 1.8 0.62 ↔ 5 -0.157 16.4 ** 4.73 ↔ 4 0.074 2.7 1.03 ↔ 5 -0.191 5.1 * 0.84 ↔ 5 -0.104 3.5 1.0

Note: ** Sig. (α = 0.05) with Bonferroni correction for multipletesting, * without correction

Detecting local dependence in latent class models Daniel Oberski

Page 71: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Based on EPC and MI,• keep two-classes, but...• ...free five out of ten possible bivariate localdependencies.

•• 15 degrees of freedom (was 20)• L2 = 28.4 (p = 0.07) (was 129.9)• BIC is −95.5 (was −35.37)• Best two-class model in lit. (Qu et al. 1996): BIC −83.4

Detecting local dependence in latent class models Daniel Oberski

Page 72: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Based on EPC and MI,• keep two-classes, but...• ...free five out of ten possible bivariate localdependencies.

•• 15 degrees of freedom (was 20)• L2 = 28.4 (p = 0.07) (was 129.9)• BIC is −95.5 (was −35.37)• Best two-class model in lit. (Qu et al. 1996): BIC −83.4

Detecting local dependence in latent class models Daniel Oberski

Page 73: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Based on EPC and MI,• keep two-classes, but...• ...free five out of ten possible bivariate localdependencies.

•• 15 degrees of freedom (was 20)

• L2 = 28.4 (p = 0.07) (was 129.9)• BIC is −95.5 (was −35.37)• Best two-class model in lit. (Qu et al. 1996): BIC −83.4

Detecting local dependence in latent class models Daniel Oberski

Page 74: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Based on EPC and MI,• keep two-classes, but...• ...free five out of ten possible bivariate localdependencies.

•• 15 degrees of freedom (was 20)• L2 = 28.4 (p = 0.07) (was 129.9)

• BIC is −95.5 (was −35.37)• Best two-class model in lit. (Qu et al. 1996): BIC −83.4

Detecting local dependence in latent class models Daniel Oberski

Page 75: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Based on EPC and MI,• keep two-classes, but...• ...free five out of ten possible bivariate localdependencies.

•• 15 degrees of freedom (was 20)• L2 = 28.4 (p = 0.07) (was 129.9)• BIC is −95.5 (was −35.37)

• Best two-class model in lit. (Qu et al. 1996): BIC −83.4

Detecting local dependence in latent class models Daniel Oberski

Page 76: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Based on EPC and MI,• keep two-classes, but...• ...free five out of ten possible bivariate localdependencies.

•• 15 degrees of freedom (was 20)• L2 = 28.4 (p = 0.07) (was 129.9)• BIC is −95.5 (was −35.37)• Best two-class model in lit. (Qu et al. 1996): BIC −83.4

Detecting local dependence in latent class models Daniel Oberski

Page 77: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Based on EPC and MI,• keep two-classes, but...• ...free five out of ten possible bivariate localdependencies.

•• 15 degrees of freedom (was 20)• L2 = 28.4 (p = 0.07) (was 129.9)• BIC is −95.5 (was −35.37)• Best two-class model in lit. (Qu et al. 1996): BIC −83.4

Detecting local dependence in latent class models Daniel Oberski

Page 78: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Conclusions

Detecting local dependence in latent class models Daniel Oberski

Page 79: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Conclusions

• Local dependence is a serious problem in LCA;• Sometimes modeling dependence directly is preferable toincreasing no. classes;

• Need to monitor local dependence.

•• Introduced MI and BVR, tests for local dependence:

• MI works well as chi-square statistic;• BVR does not, but can bootstrap p-values.

• Introduced EPC: local dependence substantive size.•• Dentist example: obtained more parsimonious andeasy-to-interpret two-class model than current literature.

Detecting local dependence in latent class models Daniel Oberski

Page 80: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Conclusions

• Local dependence is a serious problem in LCA;• Sometimes modeling dependence directly is preferable toincreasing no. classes;

• Need to monitor local dependence.

•• Introduced MI and BVR, tests for local dependence:

• MI works well as chi-square statistic;• BVR does not, but can bootstrap p-values.

• Introduced EPC: local dependence substantive size.•• Dentist example: obtained more parsimonious andeasy-to-interpret two-class model than current literature.

Detecting local dependence in latent class models Daniel Oberski

Page 81: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Conclusions

• Local dependence is a serious problem in LCA;• Sometimes modeling dependence directly is preferable toincreasing no. classes;

• Need to monitor local dependence.

•• Introduced MI and BVR, tests for local dependence:

• MI works well as chi-square statistic;• BVR does not, but can bootstrap p-values.

• Introduced EPC: local dependence substantive size.•• Dentist example: obtained more parsimonious andeasy-to-interpret two-class model than current literature.

Detecting local dependence in latent class models Daniel Oberski

Page 82: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Conclusions

• Local dependence is a serious problem in LCA;• Sometimes modeling dependence directly is preferable toincreasing no. classes;

• Need to monitor local dependence.

•• Introduced MI and BVR, tests for local dependence:

• MI works well as chi-square statistic;• BVR does not, but can bootstrap p-values.

• Introduced EPC: local dependence substantive size.•• Dentist example: obtained more parsimonious andeasy-to-interpret two-class model than current literature.

Detecting local dependence in latent class models Daniel Oberski

Page 83: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Conclusions

• Local dependence is a serious problem in LCA;• Sometimes modeling dependence directly is preferable toincreasing no. classes;

• Need to monitor local dependence.

•• Introduced MI and BVR, tests for local dependence:• MI works well as chi-square statistic;

• BVR does not, but can bootstrap p-values.• Introduced EPC: local dependence substantive size.•• Dentist example: obtained more parsimonious andeasy-to-interpret two-class model than current literature.

Detecting local dependence in latent class models Daniel Oberski

Page 84: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Conclusions

• Local dependence is a serious problem in LCA;• Sometimes modeling dependence directly is preferable toincreasing no. classes;

• Need to monitor local dependence.

•• Introduced MI and BVR, tests for local dependence:• MI works well as chi-square statistic;• BVR does not, but can bootstrap p-values.

• Introduced EPC: local dependence substantive size.•• Dentist example: obtained more parsimonious andeasy-to-interpret two-class model than current literature.

Detecting local dependence in latent class models Daniel Oberski

Page 85: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Conclusions

• Local dependence is a serious problem in LCA;• Sometimes modeling dependence directly is preferable toincreasing no. classes;

• Need to monitor local dependence.

•• Introduced MI and BVR, tests for local dependence:• MI works well as chi-square statistic;• BVR does not, but can bootstrap p-values.

• Introduced EPC: local dependence substantive size.

•• Dentist example: obtained more parsimonious andeasy-to-interpret two-class model than current literature.

Detecting local dependence in latent class models Daniel Oberski

Page 86: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Conclusions

• Local dependence is a serious problem in LCA;• Sometimes modeling dependence directly is preferable toincreasing no. classes;

• Need to monitor local dependence.

•• Introduced MI and BVR, tests for local dependence:• MI works well as chi-square statistic;• BVR does not, but can bootstrap p-values.

• Introduced EPC: local dependence substantive size.

•• Dentist example: obtained more parsimonious andeasy-to-interpret two-class model than current literature.

Detecting local dependence in latent class models Daniel Oberski

Page 87: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Conclusions

• Local dependence is a serious problem in LCA;• Sometimes modeling dependence directly is preferable toincreasing no. classes;

• Need to monitor local dependence.

•• Introduced MI and BVR, tests for local dependence:• MI works well as chi-square statistic;• BVR does not, but can bootstrap p-values.

• Introduced EPC: local dependence substantive size.•• Dentist example: obtained more parsimonious andeasy-to-interpret two-class model than current literature.

Detecting local dependence in latent class models Daniel Oberski

Page 88: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Conclusions

• Local dependence is a serious problem in LCA;• Sometimes modeling dependence directly is preferable toincreasing no. classes;

• Need to monitor local dependence.

•• Introduced MI and BVR, tests for local dependence:• MI works well as chi-square statistic;• BVR does not, but can bootstrap p-values.

• Introduced EPC: local dependence substantive size.•• Dentist example: obtained more parsimonious andeasy-to-interpret two-class model than current literature.

Detecting local dependence in latent class models Daniel Oberski

Page 89: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

• Class-specific local dependence;• Multivariate MI, EPC, (polytomous items);• Effect of freeing local dependencies on params of interest;• Problem of parameter dependence;• Problem of multiple comparisons;• (Post-hoc) power of MI (Satorra);• MI for other params than locdeps (e.g. item bias).

Detecting local dependence in latent class models Daniel Oberski

Page 90: Detecting local dependence in latent class models

Example LCA Local dependence BVR and MI EPC Conclusions References

Thank you for your attention!

Daniel [email protected]

Working papers on this topic:Oberski, D., Van Kollenburg, G., and Vermunt, J. (submitted). AMonte Carlo evaluation of three methods to detect localdependence in binary data latent class models.

Oberski, D. and Vermunt, J. (submitted). The Expected ParameterChange (EPC) for local dependence assessment in binary datalatent class models.

Oberski, D. (submitted). Change in SEM parameters of interest asa criterion for partial measurement invariance: TheEPC-interest.

Detecting local dependence in latent class models Daniel Oberski

Page 91: Detecting local dependence in latent class models

Appendix

Appendix

Detecting local dependence in latent class models Daniel Oberski

Page 92: Detecting local dependence in latent class models

Appendix

0.5 0.8

0.00

0.25

0.50

0.75

1.00

0.00

0.25

0.50

0.75

1.00

0.00

0.25

0.50

0.75

1.00

0.00

0.25

0.50

0.75

1.00

0.00

0.25

0.50

0.75

1.00

0.00

0.25

0.50

0.75

1.00

−0.4

−0.2

−0.05

0.050.2

0.4

200

500

1000

5000200

500

1000

5000

Log(sample size)

Pow

er

p−value type

BVR

BVR_bootstrap

MI

Detecting local dependence in latent class models Daniel Oberski

Page 93: Detecting local dependence in latent class models

Appendix

Convergence to noncentral chi-square in worstcondition

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●

●●

●●

Sample size: 200Q−Q plot, noncentral χ1

2(4.7)

Theoretical quantiles

MI

0

10

20

30

40

0 5 10 15 20 25

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●

●●●

● ●

Sample size: 500Q−Q plot, noncentral χ1

2(11.8)

Theoretical quantiles

MI

010203040506070

0 10 20 30 40

●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●

●●●●●●●

●●●●●

Sample size: 1000Q−Q plot, noncentral χ1

2(23.7)

Theoretical quantilesM

I

0

20

40

60

80

10 20 30 40 50 60

●●●●●●

●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●●

●●

Sample size: 5000Q−Q plot, noncentral χ1

2(118)

Theoretical quantiles

MI

50

100

150

200

80 120 160

Detecting local dependence in latent class models Daniel Oberski

Page 94: Detecting local dependence in latent class models

Appendix

Local dependencies between indicators of Hispanicethnicity in the U.S. Census

Local dependence EPCL TL EPCgs TGS ψ̃ WaldAncestry-re ↔ Language-in 0.92 5.0 1.45 7.9 0.22 0.2Ancestry-re ↔ Origin-in -0.76 2.5 -1.23 4.1 -0.23 0.1Ancestry-re ↔ Origin-re 2.94 45.6 1.32 20.5 1.82 18.7Language-in ↔ Origin-in 4.14 97.1 1.59 37.2 3.52 53.4Language-in ↔ Origin-re -1.08 7.9 -1.76 12.8 1.33 7.1

Origin-in ↔ Origin-re 1.10 6.1 2.20 12.2 0.52 0.3

Detecting local dependence in latent class models Daniel Oberski