|
Chapter Contents |
Previous |
Next |
| The MIXED Procedure |
An intercept is included in the fixed-effects model by default. If no fixed effects are specified, only this intercept term is fit. The intercept can be removed by using the NOINT option.
You can specify the following options in the MODEL statement after a slash (/).
model Y = A B A*B / ddf=3,.,4.7;
assigns 3 denominator degrees of freedom to A and 4.7 to A*B, while those for B remain the same.
The DDFM=CONTAIN option invokes the containment method to compute denominator degrees of freedom, and it is the default when you specify a RANDOM statement. The containment method is carried out as follows: Denote the fixed effect in question A, and search the RANDOM effect list for the effects that syntactically contain A. For example, the RANDOM effect B(A) contains A, but the RANDOM effect C does not, even if it has the same levels as B(A).
Among the RANDOM effects that contain A, compute their rank contribution to the (X Z) matrix. The DDF assigned to A is the smallest of these rank contributions. If no effects are found, the DDF for A is set equal to the residual degrees of freedom, N - rank(X Z). This choice of DDF matches the tests performed for balanced split-plot designs and should be adequate for moderately unbalanced designs.
Caution: If you have a Z matrix with a large number of columns, the overall memory requirements and the computing time after convergence can be substantial for the containment method. If it is too large, you may want to use the DDFM=BETWITHIN option.
The DDFM=BETWITHIN option is the default for REPEATED statement specifications (with no RANDOM statements). It is computed by dividing the residual degrees of freedom into between-subject and within-subject portions. PROC MIXED then checks whether a fixed effect changes within any subject. If so, it assigns the within-subject degrees of freedom to the effect; otherwise, it assigns the between-subject degrees of freedom to the effect (refer to Schluchter and Elashoff 1990). One exception is the case where you have specified no RANDOM statements but you have specified a REPEATED statement with the TYPE=UN option; in this case, all effects are assigned the between-subject degrees of freedom.
The DDFM=RESIDUAL option performs all tests using the residual degrees of freedom, n - rank(XZ), where n is the number of observations.
The DDFM=SATTERTH option performs a general Satterthwaite
approximation for the denominator degrees of freedom, computed as
follows. Let C = (X'V-1X)-, where - denotes a generalized
inverse, and let
be the vector of unknown parameters in V.
Let
and
be the corresponding estimates.
We first consider the one-dimensional case, and
consider l to be a vector defining an estimable
linear combination of
.The Satterthwaite degrees of freedom for the t-statistic


For the multi-dimensional case, let L be an estimable contrast matrix of rank q > 1. The Satterthwaite denominator degrees of freedom for the F-statistic




This method is a generalization of the techniques described in
Giesbrecht and Burns (1985), McLean and Sanders (1988), and Fai and
Cornelius (1996). The method can also include estimated random
effects. In this case, append
to
and
change
to be the inverse of the coefficient matrix in the
mixed model equations. The calculations require extra memory to hold
c matrices that are the size of the mixed model equations, where c
is the number of covariance parameters. In the notation of
Table 37.9, this is approximately 8q(p+g)(p+g)/2
bytes. Extra computing time is also required to process these
matrices. The Satterthwaite method implemented here is intended to
produce an accurate F-approximation; however, the results may
differ from those produced by PROC GLM. Also, the small sample
properties of this approximation have not been extensively
investigated for the various models available with PROC MIXED.
Predicted values are formed by using the rows from (X Z) as
L matrices. The predicted values from the original data are,
thus,
. Their
approximate standard errors of prediction are formed from the
quadratic form of L with
defined in
the "Statistical Properties" section. The L95
and U95 variables provide a t-type confidence interval for the
predicted values, and they correspond to the L95M and U95M
variables from the GLM and REG procedures for fixed-effect models.
The residuals are the observed minus the predicted values.
Predicted values for data points other than those observed can be
obtained by using missing dependent variables in your input data
set.
Specifications that have a REPEATED
statement with the SUBJECT= option and missing dependent variables
compute predicted values using empirical best
linear unbiased prediction (EBLUP).
Using hats
to
denote estimates, the EBLUP formula is

![\hat{Var}(\hat{m} - m) &=& \hat{V}_{m}-
\hat{C}_{m}\hat{V}^{-1} \hat{C}_{m}^T +...
...\hat{V}^{-1} X]
( X^T \hat{V}^{-1} X)^-
[X_{m}- \hat{C}_{m}\hat{V}^{-1} X]^T](images/mixeq67.gif)
By default, all variables from the input data set are included in the OUTP= data set. You can select a subset of these variables using the ID statement.
The output data set is of the same form as that resulting from the
OUTP= option, except that the predicted values do not incorporate
the EBLUP values
nor do they use the EBLUPs
for specifications that have a REPEATED statement with the SUBJECT=
option and missing dependent variables. The predicted values are
formed as
in the OUTPM= data set, and standard
errors are quadratic forms in the approximate variance-covariance
matrix of
as displayed by the COVB option.
By default, all variables from the input data set are included in the OUTPM= data set. You can select a subset of these variables using the ID statement.
Along with the estimates and their approximate standard errors, a t-statistic is computed as the estimate divided by its standard error. The degrees of freedom for this t-statistic matches the one appearing in the "Tests of Fixed Effects" table under the effect containing the parameter. The "Pr > |t|" column contains the two-tailed p-value corresponding to the t-statistic and associated degrees of freedom. You can use the CL option to request confidence intervals for all of the parameters; they are constructed around the estimate by using a radius of the standard error times a percentage point from the t-distribution.
|
Chapter Contents |
Previous |
Next |
Top |
Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.