PROC CANDISC Statement
- PROC CANDISC < options > ;
This statement invokes the CANDISC procedure.
The options listed in the following table can
appear in the PROC CANDISC statement.
Table 21.1: CANDISC Procedure Options
Task
|
Options
|
Specify Data Sets | DATA= |
| OUT= |
| OUTSTAT= |
Control Canonical Variables | NCAN= |
| PREFIX= |
Determine Singularity | SINGULAR= |
Control Displayed Correlations | BCORR |
| PCORR |
| TCORR |
| WCORR |
Control Displayed Covariances | BCOV |
| PCOV |
| TCOV |
| WCOV |
Control Displayed SSCP Matrices | BSSCP |
| PSSCP |
| TSSCP |
| WSSCP |
Suppress Output | NOPRINT |
| SHORT |
Miscellaneous | ALL |
| ANOVA |
| DISTANCE |
| SIMPLE |
| STDMEAN |
- ALL
-
activates all of the display options.
- ANOVA
-
displays univariate statistics for testing the hypothesis that
the class means are equal in the population for each variable.
- BCORR
-
displays between-class correlations.
- BCOV
-
displays between-class covariances.
The between-class covariance matrix equals the between-class
SSCP matrix divided by n(c-1)/c, where n is the number of
observations and c is the number of classes.
The between-class covariances should be interpreted
in comparison with the total-sample and within-class
covariances, not as formal estimates of population parameters.
- BSSCP
-
displays the between-class SSCP matrix.
- DATA=SAS-data-set
-
specifies the data set to be analyzed.
The data set can be an ordinary SAS data set or one
of several specially structured data sets created
by SAS statistical procedures.
These specially structured data sets include
TYPE=CORR, COV, CSSCP, and SSCP.
If you omit the DATA= option, the procedure uses
the most recently created SAS data set.
- DISTANCE
-
displays squared Mahalanobis distances between the group means,
F statistics, and the corresponding probabilities of greater
squared Mahalanobis distances between the group means.
- NCAN=n
-
specifies the number of canonical variables to be computed.
The value of n must be less than
or equal to the number of variables.
If you specify NCAN=0, the procedure displays
the canonical correlations, but not the
canonical coefficients, structures, or means.
A negative value suppresses the canonical analysis entirely.
Let v be the number of variables in the VAR
statement and c be the number of classes.
If you omit the NCAN= option, only min(v, c-1)
canonical variables are generated; if you also specify an OUT= output data
set, v canonical variables are generated,
and the last v-(c-1) canonical variables have missing values.
- NOPRINT
-
suppresses the normal display of results. Note that this option
temporarily disables the Output Delivery System (ODS);
see Chapter 15, "Using the Output Delivery System," for more information.
- OUT=SAS-data-set
-
creates an output SAS data set containing the
original data and the canonical variable scores.
To create a permanent SAS data set, specify a
two-level name (refer to
SAS Language Reference: Concepts,
for more information on permanent SAS data sets).
- OUTSTAT=SAS-data-set
-
creates a TYPE=CORR output SAS data set that
contains various statistics including class means,
standard deviations, correlations, canonical correlations,
canonical structures, canonical coefficients, and means of
canonical variables for each class.
To create a permanent SAS data set, specify a
two-level name (refer to SAS Language Reference: Concepts,
for more information on permanent SAS data sets).
- PCORR
-
displays pooled within-class correlations (partial correlations
based on the pooled within-class covariances).
- PCOV
-
displays pooled within-class covariances.
- PREFIX=name
-
specifies a prefix for naming the canonical variables. By default the
names are Can1, Can2, Can3 and so forth. If you
specify PREFIX=Abc, the components are named Abc1, Abc2,
and so on. The number of characters in the prefix, plus the number of
digits required to designate the canonical variables, should not
exceed 32. The prefix is truncated if the combined length exceeds 32.
- PSSCP
-
displays the pooled within-class corrected SSCP matrix.
- SHORT
-
suppresses the display of canonical structures,
canonical coefficients, and class means on canonical variables;
only tables of canonical correlations and
multivariate test statistics are displayed.
- SIMPLE
-
displays simple descriptive statistics
for the total sample and within each class.
- SINGULAR=p
-
specifies the criterion for determining the singularity
of the total-sample correlation matrix and the pooled
within-class covariance matrix, where 0<p<1.
The default is SINGULAR=1E-8.
Let S be the total-sample correlation matrix. If the R2
for predicting a quantitative variable in the VAR statement from the
variables preceding it exceeds 1-p, S is considered
singular. If S is singular, the probability levels for the
multivariate test statistics and canonical correlations are adjusted
for the number of variables with R2 exceeding 1-p.
If S is considered singular and the inverse of S
(Squared Mahalanobis Distances) is required, a quasi-inverse is used
instead.
For details see the "Quasi-Inverse" section in Chapter 25, "The DISCRIM Procedure."
- STDMEAN
-
displays total-sample and pooled within-class
standardized class means.
- TCORR
-
displays total-sample correlations.
- TCOV
-
displays total-sample covariances.
- TSSCP
-
displays the total-sample corrected SSCP matrix.
- WCORR
-
displays within-class correlations for each class level.
- WCOV
-
displays within-class covariances for each class level.
- WSSCP
-
displays the within-class corrected SSCP matrix for each class level.
Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.