STAT 350: Lecture 14
Reparametrization
Suppose X is the design matrix of a linear model and that is the design matrix of the linear model we get by imposing some linear restrictions on the model using X. A good example is on assignment 3 but here is another. Consider the one way layout, also called the K sample problem. We have K samples from K populations with means for . Suppose the ith sample size is n+i. This is a linear model, provided we are able to assume that all the population variances are the same.
The resulting design matrix is
where there are copies of the first row, and then copies of and so on. Under the restriction there is just one parameter, say , and the design matrix is just a column of 1s. Notice that is not a submatrix of the original design matrix X. However, if A is the matrix with all entries equal to 1 we have so that the column space of is a subspace (of dimension 1) of the K dimensional column space of X. The extra sum of squares principle can thus be used to test the null hypothesis by fitting the two models and computing, using the notation of ANOVA from STAT 330:
The numerator of this sum simplifies algebraically to
so that this extra Sum of Squares F test is the usual F test in this problem; this is universal.
It is actually quite common to reparametrize the full model in such a way that the null hypothesis of interest is of the form . For the 1 way ANOVA there are two such reparametrizations in common use.
The first of these defines a grand mean parameter and individual ``effects'' . This new model has K+1 parameters apparently and the corresponding design matrix, , would not have full rank; its rank would be K although it would have K+1 columns. As such the matrix would be singular and we could not find unique least squares estimates. The problem is that we have defined the parameters in such a way that there is a linear restriction on them, namely, . We get around this problem by dropping and remembering in our model equations that .
If you now write out the model equations with and the as parameters you get the design matrix
Students will have seen this matrix in 330 in the case where all the are the same and the fractions in the last rows of are all equal to -1. Notice that the hypothesis is the same as .
The other reparametrization is ``corner-point coding'' where we define new parameters by and . For this parameterization the null hypothesis of interest is . The design matrix is