Chapter Contents Previous Next
 The GLM Procedure

## Multivariate Analysis of Variance

If you fit several dependent variables to the same effects, you may want to make tests jointly involving parameters of several dependent variables. Suppose you have p dependent variables, k parameters for each dependent variable, and n observations. The models can be collected into one equation:

where Y is n ×p, X is n ×k, is k ×p, and is n ×p. Each of the p models can be estimated and tested separately. However, you may also want to consider the joint distribution and test the p models simultaneously.

For multivariate tests, you need to make some assumptions about the errors. With p dependent variables, there are n ×p errors that are independent across observations but not across dependent variables. Assume

where vec strings out by rows, denotes Kronecker product multiplication, and is p ×p. can be estimated by

S = [(e'e)/(n - r)] = [((Y - Xb)'(Y - Xb))/(n - r)]

where b = (X'X)-X'Y, r is the rank of the X matrix, and e is the matrix of residuals.

If S is scaled to unit diagonals, the values in S are called partial correlations of the Ys adjusting for the Xs. This matrix can be displayed by PROC GLM if PRINTE is specified as a MANOVA option.

The multivariate general linear hypothesis is written

You can form hypotheses for linear combinations across columns, as well as across rows of .

The MANOVA statement of the GLM procedure tests special cases where L corresponds to Type I, Type II, Type III, or Type IV tests, and M is the p ×p identity matrix. These tests are joint tests that the given type of hypothesis holds for all dependent variables in the model, and they are often sufficient to test all hypotheses of interest.

Finally, when these special cases are not appropriate, you can specify your own L and M matrices by using the CONTRAST statement before the MANOVA statement and the M= specification in the MANOVA statement, respectively. Another alternative is to use a REPEATED statement, which automatically generates a variety of M matrices useful in repeated measures analysis of variance. See the "REPEATED Statement" section and the "Repeated Measures Analysis of Variance" section for more information.

One useful way to think of a MANOVA analysis with an M matrix other than the identity is as an analysis of a set of transformed variables defined by the columns of the M matrix. You should note, however, that PROC GLM always displays the M matrix in such a way that the transformed variables are defined by the rows, not the columns, of the displayed M matrix.

All multivariate tests carried out by the GLM procedure first construct the matrices H and E corresponding to the numerator and denominator, respectively, of a univariate F-test.

The diagonal elements of H and E correspond to the hypothesis and error SS for univariate tests. When the M matrix is the identity matrix (the default), these tests are for the original dependent variables on the left-hand side of the MODEL statement. When an M matrix other than the identity is specified, the tests are for transformed variables defined by the columns of the M matrix. These tests can be studied by requesting the SUMMARY option, which produces univariate analyses for each original or transformed variable.

Four multivariate test statistics, all functions of the eigenvalues of E-1H (or (E+H)-1H), are constructed:

• Wilks' lambda = det(E)/det(H+E)
• Pillai's trace = trace(H(H + E)-1)
• Hotelling-Lawley trace = trace(E-1H)
• Roy's maximum root = , largest eigenvalue of E-1H

All four are reported with F approximations. For further details on these four statistics, see the "Multivariate Tests" section in Chapter 3, "Introduction to Regression Procedures."

 Chapter Contents Previous Next Top