Académique Documents
Professionnel Documents
Culture Documents
' $ ' $
Communalities
Heywood Cases
Statistical Formultion
Joreskog (1967) solved this problem by focusing on the
Much of the discussion in the 50s were procedures for choosing concentrated fit function
communalities and estimating factor loadings. There was a need
for a statistical formulation. So in my dissertation, I suggested f () = min F (, ) , (8)
that one could estimate the covariance matrix subject to the
Slide 5 Slide 7 which could be minimized numerically. If one or more of the i2
constraint
gets close to zero, this procedure becomes unstable. Joreskog
(1977) therefore developed this procedure further by
= + (diag1 )1 (5) reparameterizing
1
FML (, ) = log + tr(S1 ) log S p , (6) FGLS (, ) = tr[(I S1 )2 ] (11)
2
where
Each of these fit functions can also be minimized by minimizing
= + 2 , (7) the corresponding concentrated fit function (8).
and 2 is the diagonal matrix of error variances.
& % & %
Karl G Joreskog Factor Analysis and Its Extensions Karl G Joreskog Factor Analysis and Its Extensions
' $ ' $
&
' %
$ &
' %
$
The basic idea of factor analysis is the following. For a given set
Exploratory Factor Analysis of response variables x1 , . . . , xp one wants to find a set of
underlying latent factors 1 , . . . , k , fewer in number than the
observed variables. These latent factors are supposed to account
Exploratory factor analysis is a technique often used to detect for the intercorrelations of the response variables in the sense that
and assess latent sources of variation and covariation in observed when the factors are partialed out from the observed variables,
measurements. It is widely recognized that exploratory factor there should no longer remain any correlations between these. If
Slide 10 analysis can be quite useful in the early stages of experimentation Slide 12 both the observed response variables and the latent factors are
or test development. Thurstones (1938) primary mental abilities, measured in deviations from the mean, this leads to the model:
Frenchs (1951) factors in aptitude and achievement tests and
Guilfords (1956) structure of intelligence are good examples of
xi = i1 1 + i2 2 + + in k + i , (16)
this. The results of an exploratory factor analysis may have
heuristic and suggestive value and may generate hypotheses where i , the unique part of xi , is assumed to be uncorrelated
which are capable of more objective testing by other multivariate with 1 , 2 , . . . , k and with j for j = i. In matrix notation (16) is
methods. x = + (17)
& % & %
Karl G Joreskog Factor Analysis and Its Extensions Karl G Joreskog Factor Analysis and Its Extensions
' $ ' $
Two-Stage Least-Squares
The unique part i consists of two components: a specific factor
si and a pure random measurement error ei . These are
indistinguishable, unless the measurements xi are designed in y = x+u, (19)
Slide 13 such a way that they can be separately identified (panel designs Slide 15
= S1
xx sxy , (20)
and multitrait-multimethod designs). The term i is often called
the measurement error in xi even though it is widely recognized = (S zx S1
zz Szx )
1
S zx S1
zz szy , (21)
that this term may also contain a specific factor as stated above. (n p)1 uu (S zx S1
zz Szx )
1
, (22)
uu = syy 2 sxy + Sxx (23)
&
' %
$ &
' %
$
where u = 2 2 1 . Each equation in (26) is of the form (19) Consider data from several groups or populations. These may be
but it is not a regression equation because u is correlated with dierent nations, states, or regions, culturally or
x1 , since 1 is correlated with x1 . socioeconomically dierent groups, groups of individuals selected
Slide 17 Let Slide 19 on the basis of some known selection variables, groups receiving
dierent treatments, and control groups, etc. In fact, they may
be any set of mutually exclusive groups of individuals that are
xi = i x1 + ui , (27)
clearly defined. It is assumed that a number of variables have
be the i-th equation in (26), where i is the i-th row of 2 , and been measured on a number of individuals from each population.
let x(i) (q 1 1) be a vector of the remaining variables in x2 . This approach is particularly useful in comparing a number of
Then ui is uncorrelated with x(i) so that x(i) can be used as treatment and control groups regardless of whether individuals
instrumental variables for estimating (27). Provided q k + 1, have been assigned to the groups randomly or not.
this can be done for each i = 1, 2, . . . , q.
&
' %
$ &
' %
$
Factorial Invariance
Confirmatory Factor Analysis Consider the situation where the same tests have been
administered in G dierent groups and the factor analysis model
applied in each group:
In a confirmatory factor analysis, the investigator has such
knowledge about the factorial nature of the variables that he/she
Slide 18 Slide 20 xg = g g + g , g = 1, 2, . . . , G (28)
is able to specify that each measure xi depends only on a few of
the factors j . If xi does not depend on j , ij = 0 in (16) (Slide The covariance matrix in group g is
12). In many applications, the latent factor j represents a
theoretical construct and the observed measures xi are designed g = g g g + 2g (29)
to be indicators of this construct. In this case there is only one
non-zero ij in each equation (16). Hypothesis of factorial invariance:
1 = 2 = = G (30)
& % & %
Karl G Joreskog Factor Analysis and Its Extensions Karl G Joreskog Factor Analysis and Its Extensions
' $ ' $
xg = g + g g + g , g = 1, 2, . . . , G (31)
Slide 21 Slide 23 t = 1, 2, . . . , N (36)
Under complete factorial invariance:
& % & %
Karl G Joreskog Factor Analysis and Its Extensions Karl G Joreskog Factor Analysis and Its Extensions
' $ ' $
-
Qk
1 x1
This LISREL model was generalized in 1971-72 to include models Q1
Q
previously developed for multiple indicators of latent variables, Q
-
(x)
for confirmatory factor analysis, for simultaneous factor analysis
2 x2 21
1
Q 11
6 J Q Q
(x)
31
1
1
y1
in several populations and more general models for covariance QQs
1
- + J
structures. The basic form of the LISREL model has remained the 3 x3
Qk
21
J 123 1
P P(y)
Q32
(x)
I P Pq
21
same ever since and is still the same model as used today. The Q ? J 6 1 y2 2
- 1 Q J I
Slide 25 general form of the LISREL model, due to its flexible specification Slide 27 4 x4
(x)
2
J 21 12 21
21
- +
constraints, has proven to be so rich that it can handle not only 5 x5
3 P (y)
P42P Pq
31 32 2
the large variety of problems studied by hundreds of behavioral ? 23
y4
R
4
y + y (I B)1 ( + )
= ,
x + x
Assumptions
y A( + )A y + y Ax +
= ,
is uncorrelated with x A y + x x +
Slide 29 is uncorrelated with Slide 31 where A = (I B)1 .
& % & %
Karl G Joreskog Factor Analysis and Its Extensions Karl G Joreskog Factor Analysis and Its Extensions
' $ ' $
Some Formulas
N with W = WNT or
mghij = (1/N ) (zag zg )(zah zh )(zai zi )(zaj zj ) W = WNNT
a=1
& % & %
Karl G Joreskog Factor Analysis and Its Extensions Karl G Joreskog Factor Analysis and Its Extensions
' $ ' $
Results
nACov() = E1 VWVE1
Models can be tested
nACov(s ) = W E1
Slide 37 Slide 39 Computer technology
with W = WNT or
Simple command language
W = WNNT
c2 = n(s ) c (c WNT c )1 c (s ) Path diagram
&
' %
$ &
' %
$
250
There has been an enormous development of
structural equation modeling in the last 30 years. 200
Slide 38 Slide 40
Proof:
150
Hundreds of dissertations
50
Numerous books
0
1994 1995 1996 1997 1998 1999 2000 2001
SEM has the power to test complex hypotheses involving causal Back to Factor Analysis
relationships among construct or latent variables
SEM unifies several multivariate methods into one analytic Latent Variable Models
framework
Slide 41 SEM specifically expresses the eects of latent variables on each Slide 43
other and the eect of latent variables on observed variables f (x) = h()g(x | )d (42)
SEM can be used to test alternative hypotheses.
p
SEM gives social and behavioral researchers powerful tools for
f (x) = h() g(xi | )d (43)
stating theories more exactly, i=1
Retraction
xi = 1, 2, . . . mi (45)
k k
Is it really that great? (i)
gi (xi = s | ) = F (s(i) ij j ) F (s1 ij j ) (46)
Slide 42 In the preface of his 1975 book O. D. Duncan said that he was Slide 44 j=1 j=1
fascinated by the formal properties of causal models but held a (i) (i) (i)
(i) (i)
= 0 < 1 < 2 < m i1
< m i
=
rather agnostic view of their utility.
We have certainly come to great strides in the formal realm but t
1 1 2
are there really any great substantive applications? NOR : F (t) = (t) = e 2 u du (47)
2
et
POM : F (t) = (t) = (48)
1 + et
& % & %