Structural Equation Models
PROC CALIS can analyze matrix models of the form
-
C = F1 P1 F1' + ... + Fm Pm Fm'
where C is a symmetric correlation or covariance
matrix, each matrix Fk, k = 1, ... ,m, is the product
of n(k) matrices Fk1, ... ,Fkn(k),
and each matrix Pk is symmetric, that is,
-
Fk = Fk1 ... Fkn(k) and Pk = Pk' , k = 1, ... ,m
The matrices Fkj and Pk in the model are
parameterized by the matrices Gkj and Qk
where you can specify the type of matrix desired.
The matrices Gkj and Qk can contain
- constant values
- parameters to be estimated
- values computed from parameters via programming statements
The parameters can be summarized in a
parameter vector X = (x1, ... , xt).
For a given covariance or correlation matrix C,
PROC CALIS computes the unweighted least-squares (ULS),
generalized least-squares (GLS), maximum likelihood (ML),
weighted least-squares (WLS), or diagonally weighted
least-squares (DWLS) estimates of the vector X.
Some Special Cases of the Generalized COSAN Model
Original COSAN (Covariance Structure Analysis) Model (McDonald 1978, 1980)
Covariance Structure:
-
C = F1 ... Fn PFn' ... F1'
RAM (Reticular Action) Model (McArdle 1980; McArdle and McDonald 1984)
Structural Equation Model:
where A is a matrix of coefficients, and
v and are vectors of random variables.
The variables in v and
can be manifest or latent variables.
The endogenous variables corresponding to the components in v
are expressed as a linear combination of the remaining variables
and a residual component in with covariance matrix P.
Covariance Structure:
-
C = J(I-A)-1 P((I-A)-1)' J'
with selection matrix J and
LINEQS (Linear Equations) Model (Bentler and Weeks 1980)
Structural Equation Model:
where and are coefficient matrices,
and and are vectors of random variables.
The components of correspond to the endogenous
variables; the components of correspond to
the exogenous variables and to error variables.
The variables in and
can be manifest or latent variables.
The endogenous variables in are
expressed as a linear combination of the remaining
endogenous variables, of the exogenous variables
of , and of a residual component in .
The coefficient matrix describes the
relationships among the endogenous variables of
, and should be nonsingular.
The coefficient matrix describes the
relationships between the endogenous variables of
and the exogenous and error variables of .
Covariance Structure:
with selection matrix J, , and
Keesling - Wiley - Jreskog LISREL (Linear Structural Relationship) Model
Structural Equation Model and Measurement Models:
where and are vectors of latent variables
(factors), and x and y are vectors of manifest variables.
The components of correspond to
endogenous latent variables; the components of
correspond to exogenous latent variables.
The endogenous and exogenous latent variables are
connected by a system of linear equations (the
structural model) with coefficient matrices
B and and an error vector .It is assumed that matrix I- B is nonsingular.
The random vectors y and x correspond to manifest
variables that are related to the latent variables
and by two systems of linear equations (the measurement
model) with coefficients and
and with measurement errors and .Covariance Structure:
with selection matrix J,
,,,and .Higher-Order Factor Analysis Models
First-order model:
-
C = F1 P1 F1' + U21
Second-order model:
-
C = F1 F2 P2 F2' F1' + F1 U22 F1' + U21
Example of McDonald (1980): k=3: Occasions of Measurement;
n=3: Variables (Tests); m=2: Common Factors
For more information on this model, see Example 19.6.
A Structural Equation Example
This example from Wheaton et al. (1977) illustrates
the relationships among the RAM, LINEQS, and LISREL
models.
Different structural models for these data are
in Jreskog and
Srbom (1985) and in
Bentler (1985, p. 28).
The data set contains covariances among six (manifest) variables
collected from 932 people in rural regions of Illinois:
- Variable 1:
- V1, y1 : Anomia 1967
- Variable 2:
- V2, y2 : Powerlessness 1967
- Variable 3:
- V3, y3 : Anomia 1971
- Variable 4:
- V4, y4 : Powerlessness 1971
- Variable 5:
- V5, x1 : Education (years of schooling)
- Variable 6:
- V6, x2 : Duncan's Socioeconomic Index (SEI)
It is assumed that anomia and powerlessness are indicators
of an alienation factor and that education and SEI
are indicators for a socioeconomic status (SES) factor.
Hence, the analysis contains three latent variables:
- Variable 7:
- F1, : Alienation 1967
- Variable 8:
- F2, : Alienation 1971
- Variable 9:
- F3, : Socioeconomic Status (SES)
The following path diagram shows the structural model
used in Bentler (1985, p. 29) and slightly modified in
Jreskog and Srbom (1985, p. 56).
In this notation for the path diagram, regression coefficients
between the variables are indicated as one-headed arrows.
Variances and covariances among the
variables are indicated as two-headed arrows.
Indicating error variances and covariances as two-headed
arrows with the same source and destination (McArdle
1988; McDonald 1985) is helpful in transforming the path
diagram to RAM model list input for the CALIS procedure.
Figure 19.1: Path Diagram of Stability and Alienation Example
Variables in Figure 19.1 are as follows:
- Variable 1:
- V1, y1 : Anomia 1967
- Variable 2:
- V2, y2 : Powerlessness 1967
- Variable 3:
- V3, y3 : Anomia 1971
- Variable 4:
- V4, y4 : Powerlessness 1971
- Variable 5:
- V5, x1 : Education (years of schooling)
- Variable 6:
- V6, x2 : Duncan's Socioeconomic Index (SEI)
- Variable 7:
- F1, : Alienation 1967
- Variable 8:
- F2, : Alienation 1971
- Variable 9:
- F3, : Socioeconomic Status (SES)
RAM Model
The vector v contains the six manifest
variables v1 = V1, ... ,v6 = V6 and the three
latent variables v7=F1, v8=F2, v9=F3.
The vector contains the corresponding error variables
u1 = E1, ... , u6 = E6 and u7=D1, u8=D2, u9=D3.
The path diagram corresponds to the following
set of structural equations of the RAM model:
This gives the matrices A and P in the RAM model:
The RAM model
input specification of this example for the CALIS procedure is
in the "RAM Model Specification" section.
LINEQS Model
The vector contains the six endogenous manifest variables
V1, ... , V6 and the two endogenous latent variables F1 and F2.
The vector contains the exogenous error variables
E1, ... , E6, D1, and D2 and the exogenous latent variable
F3. The path diagram corresponds to the following set of structural
equations of the LINEQS model:
This gives the matrices , and in
the LINEQS model:
The LINEQS model
input specification of this example for the CALIS procedure is
in the section "LINEQS Model Specification".
LISREL Model
The vector y contains the four endogenous manifest variables
y1 = V1, ... , y4 = V4, and the vector x contains the exogenous
manifest variables x1=V5 and x2=V6. The vector contains the error variables corresponding to y, and the vector contains the error
variables and corresponding to x.
The vector contains the endogenous latent variables (factors)
and , while the vector contains the
exogenous latent variable (factor) . The vector contains the errors and in the
equations (disturbance terms) corresponding to .The path diagram corresponds to the following set of structural
equations of the LISREL model:
This gives the matrices , ,B, , and in the LISREL model:
The CALIS procedure does not provide a LISREL
model input specification. However, any model that can be specified
by the LISREL model can also be specified by using the COSAN,
LINEQS, or RAM model specifications in PROC CALIS.
Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.