Analyzing Ordinal Data With Linear Models

44
1 Consequences of Ignoring Ordinality In statistical analysis numbers represent properties of the observed units. Measurement level: Which features of numbers correspond to empirical meanings? Nominal scale: If two units are represented by same numbers, they have the same property; if two units are represented by different numbers, they have different properties. Ordinal scale: If one unit is represented by a larger number than a second unit, than the first unit has more of a property than the second unit. Metric scale: If one unit is represented by a larger number than a second unit, than the difference of the numbers represent how much more of a property the first unit has compared with the second unit. It depends on reality and theory about reality which measurement level is given. Analyzing Ordinal Data With Linear Models

Transcript of Analyzing Ordinal Data With Linear Models

Page 1: Analyzing Ordinal Data With Linear Models

1

Consequences of Ignoring Ordinality

In statistical analysis numbers represent properties of the observed units.

Measurement level: Which features of numbers correspond to empirical meanings?

Nominal scale: If two units are represented by same numbers, they have the sameproperty; if two units are represented by different numbers, they havedifferent properties.

Ordinal scale: If one unit is represented by a larger number than a second unit, than the first unit has more of a property than the second unit.

Metric scale: If one unit is represented by a larger number than a second unit, thanthe difference of the numbers represent how much more of a propertythe first unit has compared with the second unit.

It depends on reality and theory about reality which measurement level is given.

Analyzing Ordinal Data With Linear Models

Page 2: Analyzing Ordinal Data With Linear Models

2

Problems of Regression Based on Ordinal Data Case 1: Depending variable is metric,predictor variable is ordinal..

0

1

2

3

4

5

6

7

8

9

10

1 2 3

Regression using rank numbers of X

X

Y

0

1

2

3

4

5

6

7

8

9

10

1 2 3 4

Regression using true distances of X

True distance between category 1 and category 2

True distance between category 2 and category 3

Y

X*

Then: Regression on X concealsthe exact functional formof a monotonic relationship.

Consequences:(a) Ignoring ordinality can be

captured by non-linearregression functions.

(b) If predictors are ordinal,there is no meaningful difference between linearand non-linear monotonicrelations.

Page 3: Analyzing Ordinal Data With Linear Models

3

Regression based on ranks of Y

-5-4-3-2-101234567

1 2 3

Y

Case 2: Depending variable is ordinal, predictor variable is metric or ordinal.

Because means utilize on distances, means and variances are not defined for ordinal variables.

Consequences:It is not possible to infer from linear regression of Y on X whether a relationship is positive, negative, or non-monotonic, or whether there is any relationship at all.Ignoring ordinality of dependent variables can cause meaningless results.

True distancebetween firstand second category

Rank distancebetween firstand second category

-5-4-3-2-101234567

1 2 3

Y*

Regression based on true values

Page 4: Analyzing Ordinal Data With Linear Models

4

2211

YX00

-1-1

To cope with ordinality in linear models it is assumed that the ordinal variables X and Y are crude measures of unobserved metric continuous variables X* and Y*.Then any category xi of X will be observed if and only if a realization of X* is in the interval between the thresholdsτi-1 and τi. In the same way any category yj of Y will be observed if and only if a realization of Y*

is in the interval between the thresholds τj-1 and τj.

Then:The probability of any cell in the contingencytable is given as the volume under the bivariatedensity in the regions defined by the thresholds.

Coping with Ordinality in Linear Models: Polychoric Correlation:

*i 1 iProb(X i) Prob( X )−= = τ < ≤ τ

*j 1 jProb(Y j) Prob( Y )−= = τ < ≤ τand

Mathematically:

The polychoric correlation is an estimate of theproduct moment correlation between X* and Y*

Page 5: Analyzing Ordinal Data With Linear Models

5

The Estimation of Polychoric Correlations and Covariances

*k 1 kProb(Y k) Prob( Y )−= = τ < ≤ τFrom it follows, that Y is related on Y* by

a step function:

Y=1

Y=3

Y=k

Y=2

τ1 τ2 τk-1Y*

Y

If the thresholds are unknown, not only the polychoric correlation but also the thresholds have to be estimated.

Page 6: Analyzing Ordinal Data With Linear Models

6

-4.0 -3.5 -3.0 -2.5 -2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0 2,.5 3.0 3.5 4.0

Y=1 Y=2 Y=3

f(Y*)

τ1 τ2

Because the realizations of an ordinal category depends on the probability distribution of the underlying continuous variable it is necessary to assume a distribution for the unobserved variables Y*. Usually it is assumed that the underlying continuous variable Y* is normal distributed.

If Y* is normally distributed, the following relation holds:

* 2(Y )1k 2 2*k

k

1Prob(Y ) e dY*

2

( )

−µ

στ − ⋅

−∞≤ τ = ⋅∫

σ πτ − µ

= Φσ

The formula shows that the probabilities are given by the areas under the standard normal curve:

The Estimation of Polychoric Correlations and Covariances

Page 7: Analyzing Ordinal Data With Linear Models

7

f(Y*)

τ1 τ2

-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0

-4.0 -3.0 -2.0 -1.0 4.03.02.01.00.0

From the formula above or the figure it can be easily justified that the translation of the curve to the right or to the left may be compensated by corresponding shifts of all thresholds.

Therefore, the thresholds and the mean of Y* cannot be estimated independently. Either one threshold or the mean must be fixed to an a priori number.

An increase in the variance of Y* will result in a flatter curve or equivalently in a rescaling of the horizontal axis. Again this can be compensated by the values of the thresholds. That means that also the variance of Y* cannot be estimated independent of the values of the thresholds. This is a reflection of the fact that an ordinal variable has no mean and no variance.

Statistically, both the mean and the variance of the underlying metric variables are not identified. If there is no other information, often the mean will be fixed to zero and the variance to one (standard parametrization).An alternative is to fix the first two thresholds to 0 and 1(Alternative parametrization).

Indetermination of Mean and Variance

Page 8: Analyzing Ordinal Data With Linear Models

8

whereΦ-1(y) denotes the inverse of the standard normal distribution function, that is the z-value for which the probability that a standard normal distribution is less or equal is given by the proportion in the argument of the function.

In standard parametrization the thresholds can be estimated by the inverse of the cumulative standard normal distribution.

If the sample size is n and nk is the number of cases in category k the thresholds are estimated by:

Example data: (crosstab.dat)

X1 2 3 ∑

1 25 20 10 55Y 2 20 50 20 90

3 10 20 25 55∑ 55 90 55 200

1 11

1 12

55(0.275) 0.598

200145

(0.725) 0.598200

− −

− −

τ = Φ = Φ = − τ = Φ = Φ = +

both for X* and Y*.

( )

( )

10

1 11

1 1 22

1 1 2 k 1K 1

1K

0

nnn n

n ...

n n +...+nn

1

− −−

τ = Φ = −∞

τ = Φ

+ τ = Φ

+ τ = Φ

τ = Φ = +∞

Estimation of thresholds:

Page 9: Analyzing Ordinal Data With Linear Models

9

To estimate the correlation ρY*X* between two unobserved continuous variables Y* and X* it is assumed that Y* and X* have a standard bivariate normal distribution.

Then the probabilities of any cell of the contingency table is given by:

( )( )j * * * *i

i 1 j 1 * *

* * * * * *

* *

*2 *2 * * 212 Y X Y X * *

2Y X

2 i j 2 i 1 j 2 i j 1Y X Y X Y X2 i 1 j 1 Y X

exp Y X 2 Y X /(1 )Prob(Y i,X j) Y X

2 1

( , , ) ( , , ) ( , , )( , , )

− −

ττ

τ τ

− −

− −

− + + ρ ⋅ ⋅ − ρ= = = ⋅∂ ⋅∂∫ ∫

π − ρ

= Φ τ τ ρ − Φ τ τ ρ − Φ τ τ ρ+Φ τ τ ρ

where Φ2(y,x,ρ) denotes the probability that a bivariate standard normal distribution withcorrelation ρ is smaller or equal X=x and Y=y.

Given the bivariate frequencies the polychoric correlation of Y and X can be estimated bymaximum-likelihood (ML). The ML-estimation is that correlation, that maximizes theprobability that the observed data could be realized by a bivariate normal distribution.

Estimation of polychoric correlations

Page 10: Analyzing Ordinal Data With Linear Models

10

For the example data and the threshold estimated before, the log-likelihood is

( )( )( )

22 2

22 22 2 2 2

ln L 25 ln ( .598, .598, )20 ln ( .598,.598, ) ( .598, .598, )10 ln ( .598) ( .598,.598, )20 ln (.598, .598, ) ( .598, .598, )50 ln (.598,.598, ) ( .598,.598, ) (.598, .598, ) ( .59

= ⋅ Φ − − ρ+ ⋅ Φ − ρ − Φ − − ρ+ ⋅ Φ − − Φ − ρ+ ⋅ Φ − ρ − Φ − − ρ+ ⋅ Φ ρ − Φ − ρ − Φ − ρ + Φ −( )

( )( )( )( )

2 22

2 22

8, .598, )20 ln (.598) ( .598) (.598,.598, ) ( .598,.598, )10 ln ( .598) (.598, .598, )20 ln (.598) ( .598) (.598,.598, ) (.598, .598, )25 ln 1 (.598) (.598) (.598,.598, )

− ρ+ ⋅ Φ − Φ − − Φ ρ + Φ − ρ+ ⋅ Φ − − Φ − ρ+ ⋅ Φ − Φ − − Φ ρ + Φ − ρ+ ⋅ − Φ − Φ + Φ ρ

where Φ2(∞,x,ρ)=Φ(x), Φ2(y,∞,ρ)=Φ(y), and Φ2(-∞,x,ρ)=Φ2(y,-∞,ρ)=0.

The ML-estimation is the value of ρ, where lnL reached a maximum or where -lnL reached a minimum. In the example the maximum is reached, when ρ=0.341.

If the units are sampled independently, the log-likelihood-function, that is the logarithm of the probability of the whole sample given the correlation, is:

* * * * * *

* ** *

I J 2 i j 2 i j 1 2 i 1 jY X Y X Y XijY X i 1 j 1 2 i 1 j 1 Y X

( , , ) ( , , ) ( , , )ln L( ) n ln

( , , )− −

= = − −

Φ τ τ ρ − Φ τ τ ρ − Φ τ τ ρ ρ = ⋅∑ ∑ +Φ τ τ ρ

For each case in the sample the probability of its realization is given by the equation above.

Page 11: Analyzing Ordinal Data With Linear Models

11

Using alternative parametrization the first two thresholds are fixed to 0 and 1. Then not only the polychoric correlations, but also the mean and the standard deviation can be estimated. Because reparametrizations can be transformed in each other, it is also possible to compute the alternat-ive parametrization from the standard parametrization.

The mean and the standard deviations can be computed from the first two estimated thresholds by:

−τµ = σ =

τ − τ τ − τ1

2 1 2 1

1 ;

The transformed thresholds are than:τ − τ

τ =τ − τ

* i 1i

2 1

From this transformation it can be easily seen that the values of the first two transformed thesholds are 0 and 1. For the example data the alternative parametrization for both indicators becomes:

Σ = ⋅ ⋅

=

2

2

0.8360.341 0.836 0.836 0.836

0.7000.238 0.700

Estimation of covariances, standard-deviations and means

b) alternative parametrizationτ1 = 0, τ2 = 1,µ =-(-0.598)/(0.598+0.598)

= 0.5;σ=1/(0.598+0.598) = 0.836

a) standard parametrizationτ1 = −0.598, τ2 = +0.598,µ = 0, σ=1, ρ=0.341

Page 12: Analyzing Ordinal Data With Linear Models

12

For statistical inferences it is necessary to estimate not only the polychoric correlations, but also their standard errors and covariances between pairs of polychoric correlations. Using alternative parametrization the asymptotic covariances of the covariance matrix is needed.

There are three methods to estimate these (asymptotic) variances and covariances of the polychoric correlations:(a) As the correlations their variances and covariances can be estimated from the data too.

These estimation is based on multivariate crosstabulations of more than two variables.From a statistical point of view, thresholds, polychoric correlations and their variancesand covariances should be estimated simultaneously by ML-estimation based on amultidimensional contingency table. Because the effort would be to high, estimationsare done stepwise. But even then the computations are very tedious and differentcomputer programs use different algorithms resulting sometimes in slightly differentestimations.

(b) The variances and covariances can be estimated by non-parametric bootstrapping.From the empirical data a lot of bootstrap-samples are drawn by simple randomsampling with replacement. Then for each subsample the polychoric correlations arecomputed. The estimated variances and covariances of the polychoric correlations aregiven by the empirical variances and covariances of the correlations across thebootstrap-samples.

Estimation of Standard errors and covariances between estimations of polychoric correlations

Page 13: Analyzing Ordinal Data With Linear Models

13

(c) The variances and covariances can be estimated by parametric bootstrapping or simulation. In the first step of parametric bootstrapping all parameters of a model are estimated. Here the parameters are the estimated thresholds and the polychoric correlations, or the thresholds and the parameters of a statistical model for the polychoric correlations.Next these parameters will be used to generate samples of simulated data. The variances and covariances of the estimated parameters across the simulated datasamples can be used as estimation of the variances and covariances of the parameters.Whereas in usual bootstrap-samples the (same) empirical data are used repeatedly, inparametric bootstrapping new data are generated by simulation.

Remarks:

Multivariate causal analysis of ordinal data using polychoric correlations is based on a prioriassumptions:

(1) It is assumed that all ordinal variables are crude measures of underlying continuousvariables.If ordinal variables are thought of as measures where distances are not only unknown butnot defined, the logic of polychoric correlations does not fit.Further, if it makes no sense to assume that the property - one is interested in - is notcontinuously, the logic of polychoric correlations does not fit.

Page 14: Analyzing Ordinal Data With Linear Models

14

(2) It is assumed that the relation between the crude measures and the underlying continuous variable can be formalized by a threshold model.Note, that the model postulate that the threshold are parameters that are valid for all units of a population. Nevertheless, simulation give hints that the model is stable evenif the threshold are random variables varying across units.

(3) Without an assumption on the the distribution of the underlying continuous variables itis impossible to estimate thresholds and polychoric correlations. Usually it is assumedthat all underlying continuous variables are normal distributed and that each pair ofvariables are binormal distributed. If this assumption does not hold, one can try to collapscategories or to reduce the number of variables. Further, simulation studies give hints that the true correlations can be estimated even if the assumption of binormality is false. That is, the estimation algorithm seems to be robust with respect to the distribution of the continuous variables.

(4) Correlations are sufficient association measures for linear models. Therefore, it ispossible to estimate linear models. But it is not possible to estimate non-linearrelationships between the underlying continuous variables. Theory who stated non-linear relations cannot be tested empirically using this approach.

Page 15: Analyzing Ordinal Data With Linear Models

15

Estimating the polychoric correlations or polychoric covariances of the efficacy items

The Allbus raw data of the 7 items to model political efficacy are stored in the file allb96s3.dat“. The following PRELIS-command are used to estimate the polychoric correlations and their asymptotic covariances for the 7 variables:

Read asci raw data and compute polychoric correlationsand their (co-) variances for Allbus subsetDA NI=7LAPolint1 Polint2 Impact Election Politicn Governm Leader/ RA=allb96s3.dat FO(7F1.0)OR ALLOU MA=PM PM=allb96s1.pm ac=allb96s1.acp BT

An example:

Page 16: Analyzing Ordinal Data With Linear Models

16

Total Sample Size = 1882

Univariate Marginal ParametersVariable Mean St. Dev. Thresholds-------- ---- -------- ----------Polint1 0.000 1.000 -1.677 -0.704 0.524 1.319Polint2 0.000 1.000 -1.711 -0.733 0.541 1.386Impact 0.000 1.000 -0.980 0.192 0.860 2.039

Election 0.000 1.000 -1.784 -1.012 -0.539 0.811Politicn 0.000 1.000 -0.944 0.113 0.825 2.018Governm 0.000 1.000 -1.185 -0.138 0.652 2.173Leader 0.000 1.000 -0.306 0.097 0.453 0.895 1.291

1.651....Bivariate Distributions for Ordinal Variables (Frequencies)

Polint2 Impact ----------------------- -----------------------

Polint1 1 2 3 4 5 1 2 3 4 5 ----------------------- -----------------------

1 63 15 9 1 0 34 28 19 5 2 2 15 292 57 1 0 61 174 68 53 9 3 4 42 783 34 1 125 362 221 141 15 4 0 3 36 338 12 50 154 87 89 9 5 0 2 7 24 143 38 58 36 40 4

Selected output:

Page 17: Analyzing Ordinal Data With Linear Models

17

(PE=Pearson Product Moment, PC=Polychoric, PS=Polyserial)Test of Model Test of Close Fit

Variable vs. Variable Correlation Chi-Squ. D.F. P-Value RMSEA P-Value-------- --- -------- ----------- -------- ---- ------- ----- -------Polint2 vs. Polint1 0.905 (PC) 482.780 15 0.000 0.129 0.000W_A_R_N_I_N_G: Underlying bivariate normality may not hold, see BTS-file

Impact vs. Polint1 0.109 (PC) 46.156 15 0.000 0.033 1.000Impact vs. Polint2 0.136 (PC) 46.583 15 0.000 0.033 1.000

Election vs. Polint1 0.158 (PC) 39.318 15 0.001 0.029 1.000Election vs. Polint2 0.172 (PC) 40.990 15 0.000 0.030 1.000Election vs. Impact 0.312 (PC) 81.802 15 0.000 0.049 1.000Politicn vs. Polint1 0.050 (PC) 36.880 15 0.001 0.028 1.000Politicn vs. Polint2 0.062 (PC) 40.366 15 0.000 0.030 1.000Politicn vs. Impact 0.317 (PC) 129.032 15 0.000 0.064 1.000Politicn vs. Election 0.230 (PC) 125.406 15 0.000 0.063 1.000Governm vs. Polint1 0.044 (PC) 37.295 15 0.001 0.028 1.000Governm vs. Polint2 0.062 (PC) 50.939 15 0.000 0.036 1.000Governm vs. Impact 0.275 (PC) 91.635 15 0.000 0.052 1.000Governm vs. Election 0.269 (PC) 100.322 15 0.000 0.055 1.000Governm vs. Politicn 0.563 (PC) 232.222 15 0.000 0.088 0.977Leader vs. Polint1 -0.047 (PC) 58.742 23 0.000 0.029 1.000Leader vs. Polint2 -0.051 (PC) 61.720 23 0.000 0.030 1.000Leader vs. Impact 0.120 (PC) 45.630 23 0.003 0.023 1.000Leader vs. Election 0.159 (PC) 53.734 23 0.000 0.027 1.000Leader vs. Politicn 0.177 (PC) 19.567 23 0.668 0.000 1.000Leader vs. Governm 0.245 (PC) 40.806 23 0.012 0.020 1.000

Correlations and Test Statistics

Page 18: Analyzing Ordinal Data With Linear Models

18

Bivariate Table for Polint1 vs Polint2

Observed Frequencies 1 2 3 4 5 | Rowsum

----------------------------------------------------------------1 | 63 15 9 1 0 | 882 | 15 292 57 1 0 | 3653 | 4 42 783 34 1 | 8644 | 0 3 36 338 12 | 3895 | 0 2 7 24 143 | 176

----------------------------------------------------------------Colsum | 82 354 892 398 156 | 1882

Expected Frequencies 1 2 3 4 5 | Rowsum

----------------------------------------------------------------1 | 54.3 32.9 0.8 0.0 0.0 | 88.02 | 27.1 228.8 108.8 0.2 0.0 | 365.03 | 0.6 92.1 663.1 106.4 1.9 | 864.04 | 0.0 0.2 116.1 232.0 40.7 | 389.05 | 0.0 0.0 3.1 59.4 113.5 | 176.0

----------------------------------------------------------------Colsum | 82.0 354.0 892.0 398.0 156.0 | 1882.0

BTS-File:

Page 19: Analyzing Ordinal Data With Linear Models

19

1 2 3 4 5 | Rowsum ----------------------------------------------------------------

1 | 18.7 -23.5 42.7 27.6 0.0 | 65.52 | -17.8 142.4 -73.7 3.1 0.0 | 54.03 | 15.8 -66.0 260.3 -77.6 -1.2 | 131.44 | 0.0 16.6 -84.3 254.3 -29.3 | 157.35 | 0.0 40.6 11.3 -43.5 66.1 | 74.6

----------------------------------------------------------------Colsum | 16.8 110.1 156.2 164.1 35.6 | 482.8

GF Contributions 1 2 3 4 5 | Rowsum

----------------------------------------------------------------1 | 1.4 9.7 79.2******** 0.0 |********2 | 5.4 17.4 24.7 3.0 0.0 | 50.63 | 21.5 27.3 21.7 49.2 0.4 | 120.14 | 0.0 42.0 55.3 48.4 20.2 | 165.95 | 0.0 51817.2 4.8 21.1 7.7 | 51850.8

----------------------------------------------------------------Colsum | 28.3 51913.7 185.7******** 28.3 |********

LR Contributions

Page 20: Analyzing Ordinal Data With Linear Models

20

1 2 3 4 5 | Rowsum ----------------------------------------------------------------

1 | 2.1 -5.3 9.4 1004.1 0.0 | 1010.22 | -3.8 11.6 -9.0 1.8 0.0 | 0.53 | 4.8 -9.2 29.3 -12.4 -0.7 | 11.94 | 0.0 6.6 -13.2 17.1 -6.9 | 3.75 | 0.0 227.6 2.4 -7.5 5.9 | 228.5

----------------------------------------------------------------Colsum | 3.1 231.3 19.0 1003.1 -1.7 | 1254.9

Standardized Residuals

Page 21: Analyzing Ordinal Data With Linear Models

21

Correlation Matrix Polint1 Polint2 Impact Election Politicn Governm-------- -------- -------- -------- -------- --------

Polint1 1.000Polint2 0.905 1.000Impact 0.109 0.136 1.000

Election 0.158 0.172 0.312 1.000Politicn 0.050 0.062 0.317 0.230 1.000Governm 0.044 0.062 0.275 0.269 0.563 1.000Leader -0.047 -0.051 0.120 0.159 0.177 0.245

Leader--------

Leader 1.000

MeansPolint1 Polint2 Impact Election Politicn Governm-------- -------- -------- -------- -------- --------

0.000 0.000 0.000 0.000 0.000 0.000

Leader--------

0.000

Back to the PRELIS Output-file:

Page 22: Analyzing Ordinal Data With Linear Models

22

Polint1 Polint2 Impact Election Politicn Governm-------- -------- -------- -------- -------- --------

1.000 1.000 1.000 1.000 1.000 1.000

Leader--------

1.000

Standard Deviations

Alternative Parametrization can be realized by specifying MA=CM or AP on the output-command:Read asci raw data and compute polychoric covariancesand their (co-) variances for Allbus subsetDA NI=7LAPolint1 Polint2 Impact Election Politicn Governm Leader/ RA=allb96s3.dat FO(7F1.0)OR ALLOU MA=CM CM=allb96s3.cm ac=allb96s3.acc

Page 23: Analyzing Ordinal Data With Linear Models

23

-------- ---- -------- ----------Polint1 1.724 1.028 0.000 1.000 2.262 3.079Polint2 1.750 1.023 0.000 1.000 2.304 3.168Impact 0.836 0.854 0.000 1.000 1.570 2.577

Election 2.312 1.296 0.000 1.000 1.613 3.362Politicn 0.893 0.946 0.000 1.000 1.674 2.802Governm 1.131 0.955 0.000 1.000 1.754 3.206Leader 0.758 2.481 0.000 1.000 1.883 2.978 3.962 4.853

Covariance Matrix

Polint1 Polint2 Impact Election Politicn Governm-------- -------- -------- -------- -------- --------

Polint1 1.056Polint2 0.952 1.047Impact 0.095 0.119 0.729

Election 0.210 0.228 0.345 1.678Politicn 0.049 0.060 0.256 0.282 0.895Governm 0.043 0.061 0.224 0.333 0.509 0.911Leader -0.120 -0.129 0.255 0.511 0.416 0.581

Leader--------

Leader 6.155

Variable Mean St. Dev. Thresholds

The covariance matrix can be computed from the polychoric correlation matrix by multiplying the vector of standard deviations from left and from right.

Page 24: Analyzing Ordinal Data With Linear Models

24

As a consequences of ordinal data, with standard parametrization correlations instead of variances and covariances will be analyzed. If a covariance structure model is specified it is not certain that all diagonal elements have the estimations one. Because the diagonal ones are fixed numbers they have no asymptotic variances and covariances.

LISREL uses a different WLS fit-function if the matrix to be analyzed is a correlation matrix:

k i 1 k l 1 k

WLS ij,lm ij ij lm lm iii 1 j 1 l 1 m 1 i 1

ˆ ˆ ˆF w (r ) (r ) (1 )− −

= = = = =

= ⋅ − ρ ⋅ − ρ + − ρ∑∑∑∑ ∑

The weights wij,lm are the asymptotic variances and covariances of the (polychoric, polyserial or product moment) correlations, estimated by PRELIS.

Note, that this formula is an ad-hoc solution. It is not guaranteed that the fitted diagonals areones.

Analyzing Correlations or Covariances ?

Page 25: Analyzing Ordinal Data With Linear Models

25

In a covariance matrix, the diagonal elements are sample variances. These elements areestimations of the population variances.

In a correlation matrix, the diagonal elements are fixed numbers. Therefore, the diagonal elements contain no empirical information that can be used to estimate a linear model.

On the other hand, we have seen, that even when analyzing a matrix of polychoric correlations the diagonal elements are used in the estimation of the model parameters. In the example of the 1-factor model the diagonal ones are used to estimate the error covariances:

ξ1

X1 X2 X3

λ λ λ

δ1 δ2 δ3

2 1

3 1 3 2

211

2 2X X 22

2 2 2X X X X 33

211

11

1

ˆ ˆ1

λ + θ ρ = λ λ + θ

ρ ρ λ λ λ + θ

⇒ θ = − λ

The diagonal elements are used in the estimation because formally a LISREL model is a covariance structure.

Correlation Structures and Covariance Structures

Page 26: Analyzing Ordinal Data With Linear Models

26

Does it matter if a covariance structure is estimated by a correlation matrix?

There are three problems that can occur if a correlation matrix is analyzed as if it is a covariance matrix:• it is not possible to estimate specific restrictions• the chi-square goodness-of-fit statistic may be incorrect• standard errors may be incorrect.

An example for the first problem is, that analyzing correlations it is not possible to specifytau-equivalent measures. If loadings are restricted to be equal, measurement error varianceswill be equal too.

Estimating a covariance structure using correlations the chi-square goodness-of-fit statistic is often correct, if two condition are given:• the model have to be scale invariant• the diagonal elements are fitted to one that is all diagonal residual elements are zero.

Even if the chi-square is correct, the standard errors may be wrong.

In a covariance structure variances and covariances are functions of parameters;in a correlation structure correlations are functions of parameters.

Page 27: Analyzing Ordinal Data With Linear Models

27

Differences can be seen, if the efficacy model is estimated based on polychoric correlations by ML-method or by WLS-method:

Model for Efficacy based on polychoric correlations: MLDA NI=7 NO=1882 MA=CMCM=allb96s1.pm! ac=allb96s1.acpLA Polint1 Polint2 Impact Election Politicn Governm Leader /MO NY=7 NE=4 LY=FU,FI BE=FU,FI PS=SY TE=DI,FR LEPOLINT EFFICACY TRUST LEADER FI TE(7,7) FR PS(4,1) BE(2,1) BE(3,1) BE(2,3) BE(2,4) BE(3,4) VA 1.00 LY(1,1) LY(2,1) LY(3,2) LY(4,2) LY(5,3) LY(6,3) LY(7,4) PDOU AD=Off SO RS MI SS

Page 28: Analyzing Ordinal Data With Linear Models

28

POLINT0.91

EFFICACY0.16

TRUST0.51

LEADER1.00

Polint1 0.10

Polint2 0.09

Impact 0.68

Election 0.69

Politicn 0.45

Governm 0.43

Leader 0.00

Chi-Square=41.57, df=12, P-value=0.00004, RMSEA=0.036

1.00

1.00

1.00

1.00

1.00

1.00

1.00

0.13

0.45

0.05

0.07

0.22

-0.05

Page 29: Analyzing Ordinal Data With Linear Models

29

Model for Efficacy based on polychoric correlations: WLSDA NI=7 NO=1882 MA=PMpmallb96s1.pmac=allb96s1.acpLA Polint1 Polint2 Impact Election Politicn Governm Leader /MO NY=7 NE=4 LY=FU,FI BE=FU,FI PS=SY TE=DI,FR LEPOLINT EFFICACY TRUST LEADER FI TE(7,7) FR PS(4,1) BE(2,1) BE(3,1) BE(2,3) BE(2,4) BE(3,4) VA 1.00 LY(1,1) LY(2,1) LY(3,2) LY(4,2) LY(5,3) LY(6,3) LY(7,4) PDOU AD=Off SO RS MI SS WL

Estimating method: WLS:

Page 30: Analyzing Ordinal Data With Linear Models

30

POLINT0.89

EFFICACY0.17

TRUST0.52

LEADER1.00

Polint1 0.11

Polint2 0.11

Impact 0.68

Election 0.68

Politicn 0.43

Governm 0.43

Leader 0.00

Chi-Square=27.13, df=12, P-value=0.00740, RMSEA=0.026

1.00

1.00

1.00

1.00

1.00

1.00

1.00

0.14

0.45

0.06

0.08

0.22

-0.05

Page 31: Analyzing Ordinal Data With Linear Models

31

Model for Efficacy based on polychoric covariancesDA NI=7 NO=1882 MA=CMCM=allb96s3.cmac=allb96s3.accLA Polint1 Polint2 Impact Election Politicn Governm Leader /MO NY=7 NE=4 LY=FU,FI BE=FU,FI PS=SY TE=DI,FR LEPOLINT EFFICACY TRUST LEADER FI TE(7,7) FR PS(4,1) BE(2,1) BE(3,1) BE(2,3) BE(2,4) BE(3,4) VA 1.00 LY(1,1) LY(2,1) LY(3,2) LY(4,2) LY(5,3) LY(6,3) LY(7,4) PDOU AD=Off SO RS MI SS WL

If polychoric covariances are analyzed again a different result occur:

Page 32: Analyzing Ordinal Data With Linear Models

32

POLINT0.94

EFFICACY0.19

TRUST0.47

LEADER6.15

Polint1 0.11

Polint2 0.11

Impact 0.39

Election 1.34

Politicn 0.38

Governm 0.40

Leader 0.00

Chi-Square=40.94, df=12, P-value=0.00005, RMSEA=0.036

1.00

1.00

1.00

1.00

1.00

1.00

1.00

0.12

0.47

0.02

0.07

0.08

-0.15

Page 33: Analyzing Ordinal Data With Linear Models

33

Equal thresholds

If the thresholds are known or if they can be restricted it is possible to estimate also variances and covariances In PRELIS it is possible to set the thresholds equal for different variables. Then not only the correlations but also the means and variances of this variables can be estimated. But note that the means of variables with equal thresholds will sum to zero and the variances will sum to one.

Example: Equal thresholds for equal formats in the efficacy model:Example with equal thresholds DA NI=7LAPolint1 Polint2 Impact Election Politicn Governm Leader/ RA=allb96s3.dat FO(7F1.0)OR ALLET Polint1 Polint2ET Impact Election Politicn GovernmOU MA=PM

Restrictions on Thresholds

Page 34: Analyzing Ordinal Data With Linear Models

34

Variable Mean St. Dev. Thresholds-------- ---- -------- ----------Polint1 0.005 1.016 -1.700 -0.711 0.537 1.346Polint2 -0.005 0.983 -1.700 -0.711 0.537 1.346Impact -0.360 0.985 -1.324 -0.171 0.487 1.648

Election 0.871 1.127 -1.324 -0.171 0.487 1.648Politicn -0.343 0.998 -1.324 -0.171 0.487 1.648Governm -0.169 0.874 -1.324 -0.171 0.487 1.648Leader 0.000 1.000 -0.306 0.097 0.453 0.895 1.291 1.651

Correlation Matrix equal thresholds

Polint1 Polint2 Impact Election Politicn Governm-------- -------- -------- -------- -------- --------

Polint1 1.000Polint2 0.904 1.000Impact 0.109 0.137 1.000

Election 0.159 0.174 0.329 1.000Politicn 0.050 0.062 0.317 0.247 1.000Governm 0.041 0.059 0.266 0.275 0.550 1.000Leader -0.047 -0.050 0.120 0.170 0.176 0.237

free thresholdsPolint1 Polint2 Impact Election Politicn Governm-------- -------- -------- -------- -------- --------

Polint1 1.000Polint2 0.905 1.000Impact 0.109 0.136 1.000

Election 0.158 0.172 0.312 1.000Politicn 0.050 0.062 0.317 0.230 1.000Governm 0.044 0.062 0.275 0.269 0.563 1.000Leader -0.047 -0.051 0.120 0.159 0.177 0.245

Page 35: Analyzing Ordinal Data With Linear Models

35

Example with equal thresholds DA NI=7LAPolint1 Polint2 Impact Election Politicn Governm Leader/ RA=allb96s3.dat FO(7F1.0)OR ALLET Polint1 Polint2ET Impact Election Politicn GovernmOU MA=CM CM=a96s1et.cm AC=a96s1et.acc

Computation of polychoric covariances with equal thresholds

Variable Mean St. Dev. Thresholds-------- ---- -------- ----------Polint1 1.724 1.028 0.000 1.000 2.262 3.079Polint2 1.714 0.994 0.000 1.000 2.262 3.079Impact 0.836 0.854 0.000 1.000 1.570 2.577

Election 1.904 0.977 0.000 1.000 1.570 2.577Politicn 0.851 0.866 0.000 1.000 1.570 2.577Governm 1.002 0.758 0.000 1.000 1.570 2.577Leader 0.758 2.481 0.000 1.000 1.883 2.978 3.962 4.853

Page 36: Analyzing Ordinal Data With Linear Models

36

Covariance Matrix equal thresholds

Polint1 Polint2 Impact Election Politicn Governm-------- -------- -------- -------- -------- --------

Polint1 1.056Polint2 0.924 0.988Impact 0.095 0.116 0.729

Election 0.160 0.169 0.275 0.955Politicn 0.044 0.053 0.234 0.209 0.749Governm 0.032 0.045 0.172 0.204 0.361 0.575Leader -0.120 -0.124 0.255 0.413 0.379 0.445

Leader--------

Leader 6.155

Covariance Matrix free thresholds

Polint1 Polint2 Impact Election Politicn Governm-------- -------- -------- -------- -------- --------

Polint1 1.056Polint2 0.952 1.047Impact 0.095 0.119 0.729

Election 0.210 0.228 0.345 1.678Politicn 0.049 0.060 0.256 0.282 0.895Governm 0.043 0.061 0.224 0.333 0.509 0.911Leader -0.120 -0.129 0.255 0.511 0.416 0.581

Leader--------

Leader 6.155

Page 37: Analyzing Ordinal Data With Linear Models

37

POLINT0.92

EFFICACY0.15

TRUST0.33

LEADER6.15

Polint1 0.14

Polint2 0.07

Impact 0.45

Election 0.67

Politicn 0.39

Governm 0.22

Leader 0.00

Chi-Square=27.79, df=12, P-value=0.00594, RMSEA=0.026

1.00

1.00

1.00

1.00

1.00

1.00

1.00

0.13

0.52

0.02

0.06

0.07

-0.13

Page 38: Analyzing Ordinal Data With Linear Models

38

Another possibility to identify means and variances is to fix threshold to given values. This is useful for the same variables in a group comparison. In a first step common thresholds are estimated and saved. These thresholds are used in a second step to compute polychoric covariances and means. Lastly a structured mean model can be estimated.

Fixed thresholds

Example: Computing thresholds for group comparionsDA NI=7LAPolint1 Polint2 Impact Election Politicn Governm Group/ RA FI=allb96s0.rawSD GroupOR ALLOU MA=PM TH=allb96s0.th

Univariate Marginal Parameters

Variable Mean St. Dev. Thresholds-------- ---- -------- ----------Polint1 0.000 1.000 -1.537 -0.606 0.569 1.374Polint2 0.000 1.000 -1.571 -0.647 0.579 1.426Impact 0.000 1.000 -0.827 0.280 0.929 1.978

Election 0.000 1.000 -1.682 -0.922 -0.438 0.818Politicn 0.000 1.000 -0.883 0.178 0.878 1.984Governm 0.000 1.000 -1.031 -0.044 0.739 2.192

Page 39: Analyzing Ordinal Data With Linear Models

39

Computing pol. correlations based on given thresholds: WestDA NI=7LAPolint1 Polint2 Impact Election Politicn Governm Group/ RA FI=allb96s0.raw RESD Group=0OR ALLFT=allb96s0.th Polint1FT Polint2 ; FT Impact ; FT Election ; FT Politicn ; FT GovernmOU MA=PM Computing pm bsed on given thresholds: East DA NI=7LAPolint1 Polint2 Impact Election Politicn Governm Group/ RA FI=allb96s0.raw RESD Group=1OR ALLFT=allb96s0.th Polint1FT Polint2 ; FT Impact ; FT Election ; FT Politicn ; FT GovernmOU MA=PM

Page 40: Analyzing Ordinal Data With Linear Models

40

Computing cm and acc based on constant thresholds: West DA NI=7LAPolint1 Polint2 Impact Election Politicn Governm Group/ RA FI=allb96s0.raw RESD Group=0OR ALLFT=allb96s0.th Polint1FT Polint2 ; FT Impact ; FT Election ; FT Politicn ; FT GovernmOU MA=CM CM=a96s1pc.cm me=a96s1pc.me ac=a96s1pc.accComputing cm and acc based on constant thresholds: East DA NI=7LAPolint1 Polint2 Impact Election Politicn Governm Group/ RA FI=allb96s0.raw RESD Group=1OR ALLFT=allb96s0.th Polint1FT Polint2 ; FT Impact , FT Election ; FT Politicn ; FT GovernmOU MA=CM CM=a96s2pc.cm me=a96s2pc.me ac=a96s2pc.acc

Page 41: Analyzing Ordinal Data With Linear Models

41

Group: West

Univariate Marginal Parameters

Variable Mean St. Dev. Thresholds-------- ---- -------- ----------Polint1 1.701 1.059 0.000 1.000 2.262 3.127Polint2 1.733 1.065 0.000 1.000 2.327 3.244Impact 0.841 0.853 0.000 1.000 1.586 2.534

Election 2.306 1.276 0.000 1.000 1.637 3.289Politicn 0.895 0.926 0.000 1.000 1.660 2.702Governm 1.162 0.986 0.000 1.000 1.793 3.265

Covariance Matrix

Polint1 Polint2 Impact Election Politicn Governm-------- -------- -------- -------- -------- --------

Polint1 1.123Polint2 1.026 1.133Impact 0.103 0.122 0.728

Election 0.212 0.237 0.329 1.629Politicn 0.036 0.047 0.248 0.271 0.858Governm 0.038 0.061 0.230 0.339 0.516 0.973

Page 42: Analyzing Ordinal Data With Linear Models

42

Group: East

Variable Mean St. Dev. Thresholds-------- ---- -------- ----------Polint1 1.544 1.097 0.000 1.000 2.262 3.127Polint2 1.631 1.116 0.000 1.000 2.327 3.244Impact 0.523 1.009 0.000 1.000 1.586 2.534

Election 2.011 1.373 0.000 1.000 1.637 3.289Politicn 0.692 0.971 0.000 1.000 1.660 2.702Governm 0.794 1.037 0.000 1.000 1.793 3.265

Covariance Matrix

Polint1 Polint2 Impact Election Politicn GovernmPolint1 Polint2 Impact Election Politicn Governm

-------- -------- -------- -------- -------- --------Polint1 1.203Polint2 1.026 1.247Impact 0.115 0.143 1.018

Election 0.170 0.228 0.426 1.884Politicn 0.092 0.091 0.363 0.489 0.944Governm 0.099 0.080 0.388 0.433 0.688 1.075

Page 43: Analyzing Ordinal Data With Linear Models

43

Group A: West Group B: East

Results of the estimation

Chi-Square=80.07, df=27, P-value=0.00000, RMSEA=0.036

Polint10.10

Polint20.10

Impact0.43

Election1.26

Politicn0.34

Governm0.45

POLINT 1.02

EFFICACY 0.34

TRUST 0.57

1.00

1.00

1.00

1.00

1.00

1.00

0.14

0.06

0.30

Polint10.18

Polint20.21

Impact0.64

Election1.42

Politicn0.27

Governm0.40

POLINT 1.02

EFFICACY 0.34

TRUST 0.57

1.00

1.00

1.00

1.00

1.00

1.00

0.14

0.06

0.30

Page 44: Analyzing Ordinal Data With Linear Models

44

If it is possible to divide the variables in ordinal dependent variables and (fixed) metricexogenous variables, the underlying continuous variables can be regressed on the independentvariables. Such regression is known as probit regression. Here, the distributional assumption is that the residuals of the variables given the exogenous variables will be standard multinormal distributed.

PRELIS computes not only the thresholds and the regression coefficients but also the conditional (polychoric) correlations of the residuals. Further given these residual correlation matrix, the regression coefficients, and the variances and covariances of the exogenous variables also the unconditional variances and covariances of all variables may be computed.

Additionally the asymptotic variances and covariances of this variances and covariances can be estimated:

K K

k k i k k i 1k=1 k=1

Prob(Y i) Prob Y*- X Prob Y*- X − = = β ⋅ < τ − β ⋅ < τ

∑ ∑

/*

1

ˆ ˆ ˆˆ ˆ;ˆK

X Y k kkX X

xµ β=

Β ⋅Σ ⋅Β + ΡΣ = = ⋅∑ Β ⋅Σ Σ

Probit Regression to Estimate Partial Polychoric Correlations