Lecture 6 (April 16, 2009): Principal Components and...

40
Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences Lecture 6 (April 16, 2009): Principal Components and Factor Analysis XH Andrew Zhou [email protected] Professor, Department of Biostatistics, University of Washington Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 1/4

Transcript of Lecture 6 (April 16, 2009): Principal Components and...

Page 1: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Measurement, Design, and Analytic Techniques in MentalHealth and Behavioral Sciences

Lecture 6 (April 16, 2009): PrincipalComponents and Factor Analysis

XH Andrew Zhou

[email protected]

Professor, Department of Biostatistics, University of Washington

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 1/40

Page 2: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Principal component analysis and Factor analysis

• Regression models require to identify some importantvariables as outcome variables.

• Without identifying specific variables as outcome variables,principal component analysis and factor analysis can beused to examine the relationships among a set of variables.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 2/40

Page 3: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Principal component analysis

• Principal component analysis technique tries to explain asmuch variability among a set of variables as possible interms of a few linear combinations of the variables.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 3/40

Page 4: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Factor analysis technique

• Factor analysis seeks to explain the relationships among aset of variables, expressed by their correlations orcovariances, by a few unobserved variables, which is calledfactors.

• It is hoped that few factors than the original number ofvariables will be needed to explain the relationships amongthe variables.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 4/40

Page 5: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Factor models

• Unlike measurement models for measurement error, ahypothetical construct, such as intelligence, cannot bemeasured directly. Instead, different aspects of intelligentare measured by different indicators or items, such asverbal, quantitative, and visual reasons.

• Answer to one particular item is a reflection of both generalintelligence and an item-specific aspect, referred to as thecommon and specific factors.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 5/40

Page 6: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Uni-dimensional common factor model

• A unidimensional common factor model:

yij = βi + λiηj + ǫij ,

where yij is the observed response of item i on subject j, ηj

is the common factor or latent trait for subject j, λi is afactor loading for the ith item, and ǫij is the unique orspecific factor.

• We assume that ηj and ǫij are independent.

• Let us denote ψ = V ar(ηj) and θij = V ar(ǫij).

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 6/40

Page 7: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Identification and equivalence

• A parametric model g(y | θ) is called globally identified ifthere are no two points θ1 and θ2 such thatg(y | θ1) = g(y | θ2).

• A parametric model g(y | θ) is called locally identified at θ0 ifthere exists an open neighborhood of θ0 such that there notwo points θ1 and θ2 in the neighborhood such thatg(y | θ1) = g(y | θ2).

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 7/40

Page 8: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Identifibility of the unidimensional factor model

• Recallyij = βi + λiηj + ǫij ,

where ηj ∼ N(γ, ψ), and ǫij ∼ N(0, θij).

• By making a linear transformation of the factor, fj = aηj + c,we can write the model as

yij = (βi − λic/a) + (λia)fj + ǫij = β∗i + λ∗i f

∗j + ǫij ,

where f∗j ∼ N(γ∗, ψ∗), ǫij ∼ N(0, θij).

• Here

β∗i = βi − λic/a, λ

∗i = λi/a, γ

∗ = aγ + c, ψ∗ = a2ψ.

• Hence, different parameter points generate the samereduced form distribution and the model is not identified.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 8/40

Page 9: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Identification restrictions

• We can either fix the scale of the common factor ηj byanchoring (typically fixing the first factor loading, λ1 = 1) or"factor standardization’ (fixing the factor variance to apositive constant, ψ = 1).

• Although the models from either identification restriction areequivalent, anchoring has some advantage over factorstandardization from the point of view of "factorialinvariance.

• For example, assume that the unidimensional factor modelabove holds for a population. But we consider thesubpopulation of units with negative factor values. In thiscase the original factor loadings are recovered in thesubpopulation under anchoring (with a reduced varianceestimate ψ) but not under factor standardization.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 9/40

Page 10: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Unidimensional factor models in matrix notation

• Let yj = (y1j , . . . , yIj)′, Γ = (λ1, . . . , λI)

′, ǫj = (ǫ1j , . . . , ǫIj)′,

β = (β1, . . . , βI)′, where I is the number of items.

• The unidimensional factor model:

yj = β + Γηj + ǫj .

• The covariance structure of yj is called a factor structure:

Ω = Cov(yj) = ΓψΓ′ + Θ,

where Θ is a diagonal matrix with the θii placed on thediagonal.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 10/40

Page 11: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Multidimensional factor models

• The unidimensional factor model imposes a rather restrictivestructure on the covariance. In structuring I(I + 1)/2variances and covariances only 2I parameters are used.

• A less restrictive multidimensional factor models are oftenused:

y1j = β1 + λ11η1j + . . .+ λ1MηMj + ǫ1j

. . .

yIj = βI + λI1η1j + . . .+ λIMηMj + ǫIj

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 11/40

Page 12: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Multidimensional factor models in matrix form

• Let yj = (y1j , . . . , yIj)′,

Γ =

λ11 . . . λ1M

. . . . . . . . .

λI1 . . . λIM )

,

ǫj = (ǫ1j , . . . , ǫIj)′, β = (β1, . . . , βI)′, where I is the number of items.

• The multidimensional factor model can be written as follows:

yj = β + Γyηj + ǫj ,

where β is a vector of constant (usually omitted if yi is mean-centered), Γy is afactor loading matrix, ηj is a vector of M common factors with covariance matrix Ψ

and ǫj a vector of unique factors with diagonal covariance matrix Θ.

• The covariance matrix of the responses become

Ω = ΓyΨΓ′

y + Θ.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 12/40

Page 13: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Cronbach’s α - reliability

• Cronbach’s α is a coefficient of reliability (or consistency).• Cronbach’s α measures how well a set of items (or

variables) measures a single unidimensional latentconstruct.

• When data have a multidimensional structure, Cronbach’s αwill usually be low.

• They are referring to how well their items measure a singleunidimensional latent construct.

• If you have multi-dimensional data, Cronbach’s α willgenerally be low for all items. In this case, run a factoranalysis to see which items load highest on whichdimensions, and then take the alpha of each subset of itemsseparately.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 13/40

Page 14: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Calculation of Cronbach’s α

• Cronbach’s alpha can be written as a function of thenumber of test items AND the average inter-correlationamong the items:

α =N − r

1 + (N − 1)r,

where N is equal to the number of items and r is theaverage inter-item correlation among the items.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 14/40

Page 15: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Cronbach’s α, cont

• One can see from this formula that if you increase thenumber of items, you increase Cronbach’s α. Additionally, ifthe average inter-item correlation is low, α will be low. Asthe average inter-item correlation increases, Cronbach’s αincreases as well.

• This makes sense intuitively - if the inter-item correlationsare high, then there is evidence that the items aremeasuring the same underlying construct. This is reallywhat is meant when someone says they have "high" or"good" reliability.

• Note that a reliability coefficient of .70 or higher isconsidered "acceptable" in most Social Science researchsituations)

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 15/40

Page 16: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Warnings on the use of Cronbach’s α

• The first problem: α is dependent not only on the magnitudeof the correlations among items, but also on the number ofitems in the scale.

• A scale can be made to look more ’homogenous’ simply bydoubling the number of items, even though the averagecorrelation remains the same. For example, if we have twoscales which each measure a distinct construct, andcombine them to form one long scale, α would probably behigh, although the merged scale is not a uni-dimensional.

• If α is too high, then it may suggest a high level of itemredundancy; that is, a number of items asking the samequestion in slightly different ways.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 16/40

Page 17: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

A potential project topic

• Confidence intervals for estimated Cronbach’s α.• Koning, A.J. Franses, Ph.H.B.F. (2003). Confidence

Intervals for Cronbach’s Coefficient Alpha Values,https://ep.eur.nl/handle/1765/431

• Dawn Iacobucci and Adam Duhachek (2003). AdvancingAlpha: Measuring Reliability With Confidence. Journal ofConsumer Psychology, Vol. 13, No. 4, Pages 478-487

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 17/40

Page 18: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Confirmatory factor analysis

• If prior information is available, in terms of substantivetheory, previous results, confirmative factor analysis (CFA)should be used where particular parameters are set toprescribed values, typically zero. For Γy is often specified asindependent clusters structure where each item load oneone and only one common factor.

• Confirmatory factor analysis is thus a hypotheticistprocedure designed to test hypotheses about therelationship between items and factors, whose number andinterpretation are determined.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 18/40

Page 19: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

An example of confirmatory factor analysis

• Mulaik (1988, Handbook of Multivariate ExperimentalPsychology) considered 9 subjective rating-scale variablesdesigned to measure two "dimensions" or factors inconnection with a soldier’s conception of firing a rifle incombat.

• The first factor, supposed to be "fear", had as indicators, thefour scales: "frightening", " never-shaking", "terrifying", and"upsetting".

• The second factor, "optimism about outcome", had asindicators the five scales: "useful", " hopeful", "controllable","successful", and "bearable".

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 19/40

Page 20: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

An example of confirmatory factor analysis, cont

• The loadings of variables on irrelevant factors were hypothesized to be zero,whereas the factors were expected to (negatively) correlated.

• An independent clusters two-factor model where each factor is measured by threenon-overlapping items:

y1j

y2j

y3j

y4j

y5j

y6j

=

β1

β2

β3

β4

β5

β6

+

1 0

λ21 0

λ31 0

λ31 0

0 1

0 λ52

0 λ62

η1j

η2j

+

ǫ1j

ǫ2j

ǫ3j

ǫ4j

ǫ5j

ǫ6j

,

where we have fixed the scale of each factor by setting one factor loading to 1.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 20/40

Page 21: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Exploratory factor analysis

• Exploratory factor analysis (EFA) is an inductivist methoddesigned to discover an optimal set of factors, their numberto be determined in the analysis. Each factor is theninterpreted and ’named’ according to the subset of itemshaving high loadings on the factor.

• The multidimensional factor model above is not identifiedwithout some restrictions on the parameters since we canmultiply the factors by an arbitrary nonsingulartransformation matrix R, η∗j = Rηj , and obtain the samecovariance matrix as the original model by multiplying thefactor loading matrix Γy by R−1,

Ω = (ΓyR−1)RΨR(R−1Γ′

y) + Θ.

• If R is orthogonal, the transformation is a rotation orreflection

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 21/40

Page 22: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Identifiability

• In confirmatory factor analysis, restrictions on the factorloadings serve to fix the factor rotation, and combined withconstraints for the factor scales (either fixing factorvariances or by fixing one factor loading for each factor) willoften suffice to identify the model.

• In exploratory factor analysis, a standard but arbitrary wayof identifying the model is to set the factor covariance matrixequal to the identify matrix Ψ = I, and fix the rotation forexample by requiring that Γ′

yΘΓy is diagonal.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 22/40

Page 23: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Indeterminancy of the factor space

• Something magic about factor analysis; we are estimtingcoefficients of variables that are not even observed. It ishard to imagine that one can estimate it at all.

• In fact, it is not possible to uniquely estimate the Fi, but onecan estimate the Fi up to a certain indeterminancy.

• Mathematically, the factors are unique except for possiblelinear combinations.

• Geometrically, suppose we think of the factors in a modelwith k = 2 as corresponding to values in a plane. Let usassume that this plane is a part of a three-dimensionalspace.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 23/40

Page 24: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Indeterminancy of the factor space

• Within this three-dimensional space, factor analysis woulddetermine which plane contains the two factors. However,any two perpendicular directions in the factor plane wouldcorrespond to factors that equally well fit the data in terms fexplaining the covariance or correlation between thevariables.

• Thus, we have factors identified up to a certain extent, butwe are allowed to rotate them within a "subspace".

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 24/40

Page 25: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Methods of rotations

• This indeterminancy of the factor space allows one to playwith different combinations of factors, that is, rotations, sothat the factors are considered "easy to interpret".

• One of goals in the factor analysis is to find factors thatrepresent some abstract concepts. This task is easier toaccomplish when the factors are associated with somesubset of he variables. That is, the factors have highloadings (in terms of absolute value) on some subset of thevariables and very low (near zero in absoulte value)loeadings on the rest of the variables.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 25/40

Page 26: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Methods of rotations

• In this case, the factor is closely associated with the subsetof the variables that have high loadings. If these variableshave something in common conceptually, for example theyare all measures of blood pressure, or in a psychologicalstudy they all seem to be related to aggressive behavior,one might then identify the specific factor as a bloodpressure factor or a aggression factor.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 26/40

Page 27: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Visual rotation

• When it is hard to interpret the data points in the originalfactors, one can try to rotate the original factors by an angelθ to see whether rotated factors have an easy interpretation,that is each rotated factor associated with a subset of thevariables.

• This can be done visually, called visual rotation.• When k > 2, it may not be easy to use the visual rotation

method.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 27/40

Page 28: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Varimax method

• Suppose we have the loadings λij for one selection offactors.

• Let θij be the loadings for different set of factors, (the linearcombinations of the old factors).

• Define the weighted quantities

γij = θij/

p∑

j=1

λ2ij ,

which are chosen to minimize the effect of largecommunalities.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 28/40

Page 29: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Varimax method

• The Varimax method chooses the θij to maximize thefollowing:

k∑

j=1

(1

p

p∑

i=1

γ4ij −

1

p2(

p∑

i=1

γ2ij)

2).

• A factor analysis is said to have a general factor if there is afactor which is associated with all or almost all of thevariables.

• The varimax method can be useful but does not allowgeneral factors and should not be used when such factorsmay occur.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 29/40

Page 30: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Quartimax method

• The second popular method is the quartimax method. Thismethod, in contrast to the varimax method, tends to haveone factor with large loadings on all the variables, and notmany large loadings among the rest of the factors.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 30/40

Page 31: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Steps of performing exploratory factor analysis

• The number of factors is determined based on a principalcomponent analysis of the correlation matrix. The numberof factor is typically chosen to be equal to the number ofeigenvalues that are larger than one, so calledKaiser-Guttmann criteria.

• A factor analysis is performed with the chosen number offactor. It is typically difficult to ascribe meaning to the factorsat this stage since most of items will have nonnegativeloadings on most factor.

• An orthogonal transformation matrix R is therefore used toproduce more interpretable loadings according to somecriteria such as loadings either "small" or "large".

• The final step is to retain only the "salient" loadings,interpreting as zero any loadings falling below an arbitrarythreshold, typically, 0.3 or 0.4.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 31/40

Page 32: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Item response models

• The unidimensional factor model can be extended todichotomous and ordinal responses using two differentapproaches.

• Factor analysis using a latent response formulation• Item response theory

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 32/40

Page 33: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Factor analysis with latent response formulation

• A binary or ordinal-scale response yi can be often viewedas a partial observation of a continuous latent response y∗i .We model y∗i by a unidimensional common factor model by

y∗ij = βi + λiηj + ǫ′ij

• The observed response yi takes one of S ordered responsecategories as, where s = 1, . . . , S, and the relationshipbetween observed and latent response can be written as

yij =

a1 if k0 < y∗ij ≤ a2

a2 if k1 < y∗ij ≤ a3

. . .

aS if kS−1 < yij ≤ aS ,

where k0 = −∞ and kS = ∞.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 33/40

Page 34: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Item response theory (IRT)

• Item Response Theory (aka IRT) is also sometimes called latent trait theory.

• The classical application of these IRT models is in ability testing, where item i

represents questions or problems in a test and the answers are scored as right (1)or wrong (0).

• Item response theory relates characteristics of items (item parameters) andcharacteristics of individuals (latent traits) to the probability of a positive response.

• A variety of IRT models have been developed for dichotomous and polytomousdata. In each case, the probability of answering correctly or endorsing a particularresponse category can be represented graphically by an item (option) responsefunction (IRF/ORF). These functions represent the nonlinear regression of aresponse probability on a latent trait, such as conscientiousness or verbal ability.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 34/40

Page 35: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

One-parameter Logistic Model

• P (yij = 1 | η) is the conditional probability of a positive response (yij = 1) to itemi by subject j

• The mathematical form of the 3PL model is shown below.

P (yij = 1 | ηj) =1

1 + exp(−a(ηj − bi)),

where: ηj represents the value of the latent trait (e.g., conscientiousness or cognitive

ability) for subject j, The a parameter affects the steepness of the curve; as "a" increases the

slope of the IRF increases. Larger "a" parameters provide betterdiscrimination among examinees.

The bi parameter represents the location of the IRF along the horizontal axis,ηj . It is commonly called the item difficulty, or threshold. Large values of b

indicate "difficult" items. Usually, we assume ηj has the standard normal distribution. An one-parameter logistic model is just a random intercept model for

dichotomous items without covariates.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 35/40

Page 36: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

The 2-parameter logistic model (2PL)

• The two-parameter logistic model allows the slope ordiscrimination parameter (a) to vary across items instead ofbeing assumed to be equal as in the 1PL model.

• The mathematical form of the 2PL model is shown below.

P (yij = 1 | ηj) =1

1 + exp(−Dai(ηj − bi)),

where D is a scaling constant equal to 1.702.• Here, bi can be interpreted as the item difficulty, giving a

50% chance of a correct answer when ability equalsdifficulty, whereas ai is an item discrimination parameterdetermining how well the item discriminates betweensubjects with different abilities.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 36/40

Page 37: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

The 3-parameter logistic model (3PL)

• One assumption in the two-parameter model is that theprobability of answering correctly tends to zero as abilitytends to minus infinity. However, this assumption isunrealistic if multiple choice formats are used, sinceguessing would produce a nonzero probability of answeringcorrectly. An extra parameter can be introduced into thetwo-parameter model leading to the three-parameter model.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 37/40

Page 38: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

The 3-parameter logistic model (3PL), cont

• The mathematical form of the 3PL model is shown below.

Pi(θ) = ci + (1 − ci)1

1 + exp(−Dai(θ − bi)),

where: the "c" parameter is commonly called the"pseudo-guessing" parameter, because it indicates theprobability of responding positively for examinees havingvery low θ.

• Note that the 2PL model can be obtained from 3PL besetting c=0; the 1PL model may be obtained by setting c=0and a=1.

• Unfortunately, huge sample may be needed to obtainreliable estimates of this model.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 38/40

Page 39: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

Structural equation models with latent variables

• Measurement and factor models relate the latent variablesto the observed variables. The structural equation modelsthe relationships among latent variables.

• Structural equation modeling with latent variables, oftenrefereed to as covariance structure analysis, focuses on thecovariance structure whereas the mean structure is typicallyeliminated by subtracting the mean from each variable.

• Having defined common factor models, a structural modelsspecifying relationships among latent variables can beconstructed.

• In this structural model, there could be both latentdependent and latent explanatory variables.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 39/40

Page 40: Lecture 6 (April 16, 2009): Principal Components and ...faculty.washington.edu/azhou/teaching/biost578... · 4/16/2009  · Measurement, Design, and Analytic Techniques in Mental

A structural equation example

• Consider a structural equation model for two latent dependent variables η1j andη2j and two latent latent independent variables ξ1j and ξ2j .

• The measurement model for the dependent variable is specified as anindependent clustered overlapping items, written as

yj = ΓY ηj + ǫj .

Similarly, the measurement model for the explanatory variables can be written as

xj = Γxξj + δj .

• The structural equation model is given as follows:

ηj = Bηj + Γξj + ζj .

• Here we assume that ξ1j and ξ2j are correlated whereas the ζ1j and ζ2j areuncorrelated.

Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences – p. 40/40