![]() Chapter Contents |
![]() Previous |
![]() Next |
Introduction to Regression Procedures |
After the model has been fit, predicted and residual values are usually calculated and output. The predicted values are calculated from the estimated regression equation; the residuals are calculated as actual minus predicted. Some procedures can calculate standard errors of residuals, predicted mean values, and individual predicted values.
Consider the ith observation where xi is the row of regressors, b is the vector of parameter estimates, and s2 is the mean squared error.
Let
Then
The standard error of the individual (future) predicted value yi is
The residual is defined as
The ratio of the residual to its standard error, called the studentized residual, is sometimes shown as
There are two kinds of confidence intervals for predicted values. One type of confidence interval is an interval for the mean value of the response. The other type, sometimes called a prediction or forecasting interval, is an interval for the actual value of a response, which is the mean value plus error.
For example, you can construct for the ith
observation a confidence interval that contains the true
mean value of the response with probability .The upper and lower limits of the
confidence interval for the mean value are
where is the tabulated t statistic with degrees
of freedom equal to the degrees of freedom for the mean squared error.
The limits for the confidence interval for an actual individual response are
Influential observations are those that, according to various criteria, appear to have a large influence on the parameter estimates. One measure of influence, Cook's D, measures the change to the estimates that results from deleting each observation:
where k is the number of parameters in the model (including the intercept). For more information, refer to Cook (1977, 1979).
The predicted residual for observation i is defined as the residual for the ith observation that results from dropping the ith observation from the parameter estimates. The sum of squares of predicted residual errors is called the PRESS statistic:
![]() Chapter Contents |
![]() Previous |
![]() Next |
![]() Top |
Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.