|
Chapter Contents |
Previous |
Next |
| The MIXED Procedure |
The LSMEANS statement computes least-squares means (LS-means) of fixed effects. As in the GLM procedure, LS-means are predicted population margins -that is, they estimate the marginal means over a balanced population. In a sense, LS-means are to unbalanced designs as class and subclass arithmetic means are to balanced designs. The L matrix constructed to compute them is the same as the L matrix formed in PROC GLM; however, the standard errors are adjusted for the covariance parameters in the model.
Each LS-mean is computed as
where L is
the coefficient matrix associated with the least-squares mean and
is the estimate of the fixed-effects parameter
vector (see the "Estimating and in the Mixed Model" section). The approximate standard errors for
the LS-mean is computed as the square root of
.
LS-means can be computed for any effect in the MODEL statement that involves CLASS variables. You can specify multiple effects in one LSMEANS statement or in multiple LSMEANS statements, and all LSMEANS statements must appear after the MODEL statement. As in the ESTIMATE statement, the L matrix is tested for estimability, and if this test fails, PROC MIXED displays "Non-est" for the LS-means entries.
Assuming the LS-mean is estimable, PROC MIXED constructs an
approximate t-test to test the null hypothesis that the
associated population quantity equals zero. By default, the
denominator degrees of freedom for this test are the same as those
displayed for the effect in the "Tests of Fixed Effects"
table (see the "Default Output" section).
You can specify the following options in the LSMEANS statement after a slash (/).
The BON (Bonferroni) and SIDAK adjustments involve correction factors described in Chapter 28, "The GLM Procedure," and Chapter 39, "The MULTTEST Procedure"; also refer to Westfall and Young (1993). When you specify ADJUST=TUKEY and your data are unbalanced, PROC MIXED uses the approximation described in Kramer (1956). Similarly, when you specify ADJUST=DUNNETT and the LS-means are correlated, PROC MIXED uses the factor-analytic covariance approximation described in Hsu (1992). The preceding references also describe the SCHEFFE and SMM adjustments.
The SIMULATE adjustment computes adjusted p-values and
confidence limits from the simulated distribution of the maximum or
maximum absolute value of a multivariate t random vector. All
covariance parameters except the residual variance are fixed at
their estimated values throughout the simulation, potentially
resulting in some underdispersion. The simulation estimates q,
the true
th quantile, where
is the
confidence coefficient. The default
is 0.05, and you can
change this value with the ALPHA= option in the LSMEANS statement.
The number of samples is set so that the tail area
for the simulated q is within
of
with
% confidence. In equation form,

If there is an effect containing two or more covariates, the AT option sets the effect equal to the product of the individual means rather than the mean of the product (as with standard LS-means calculations). The AT MEANS option sets covariates equal to their mean values (as with standard LS-means) and incorporates this adjustment to cross products of covariates.
As an example, consider the following invocation of PROC MIXED:
proc mixed;
class A;
model Y = A X1 X2 X1*X2;
lsmeans A;
lsmeans A / at means;
lsmeans A / at X1=1.2;
lsmeans A / at (X1 X2)=(1.2 0.3);
run;
For the first two LSMEANS statements, the LS-means coefficient for
X1 is
(the mean of X1) and for
X2 is
(the mean of X2). However, for the first LSMEANS
statement, the coefficient for X1*X2 is
, but for
the second LSMEANS statement, the coefficient is
. The third LSMEANS statement
sets the coefficient for X1 equal to 1.2 and leaves it at
for X2, and the final LSMEANS statement sets these
values to 1.2 and 0.3, respectively.
If a WEIGHT variable is present, it is used in processing AT variables. Also, observations with missing dependent variables are included in computing the covariate means, unless these observations form a missing cell and the FULLX option in the MODEL statement is not in effect. You can use the E option in conjunction with the AT option to check that the modified LS-means coefficients are the ones you desire.
The AT option is disabled if you specify the BYLEVEL option.
To specify which levels of the effects are the controls, list the quoted formatted values in parentheses after the keyword CONTROL. For example, if the effects A, B, and C are class variables, each having two levels, 1 and 2, the following LSMEANS statement specifies the (1,2) level of A*B and the (2,1) level of B*C as controls:
lsmeans A*B B*C / diff=control('1' '2' '2' '1');
For multiple effects, the ordering of the list is significant, and you should check the output to make sure that the controls are correct.
Two-tailed tests and confidence limits are associated with the CONTROL difftype. For one-tailed results, use either the CONTROLL or CONTROLU difftype. The CONTROLL difftype tests whether the noncontrol levels are significantly smaller than the control; the upper confidence limits for the control minus the noncontrol levels are considered to be infinity and are displayed as missing. Conversely, the CONTROLU difftype tests whether the noncontrol levels are significantly larger than the control; the upper confidence limits for the noncontrol levels minus the control are considered to be infinity and are displayed as missing.
If you want to perform multiple comparison adjustments on the differences of LS-Means, use the ADJUST= option. For DIFF=ALL (the default), ADJUST=TUKEY is the default method, and in all other instances, the default ADJUST= option is DUNNETT. If there is a conflict between the DIFF= and ADJUST= options, the ADJUST= option takes precedence.
The differences of the LS-means are displayed in a table titled "Differences of Least Squares Means." For ODS purposes, the table name is "Diffs."
By default, OM-data-set is the same as the analysis data set. You can optionally specify another data set that describes the population for which you want to make inferences. This data set must contain all model variables except for the dependent variable (which is ignored if it is present). Specifying OM-data-set enables you to construct arbitrarily weighted LS-means.
In computing the observed margins, PROC MIXED uses all observations for which there are no missing independent variables, including those for which there are missing dependent variables. Also, if OM-data-set has a WEIGHT variable, PROC MIXED uses weighted margins to construct the LS-means coefficients. If OM-data-set is balanced, the LS-means are unchanged by the OM option.
The BYLEVEL option modifies the observed-margins LS-means. Instead of computing the margins across all of OM-data-set, PROC MIXED computes separate margins for each level of the LSMEANS effect in question. The resulting LS-means are actually equal to raw means in this case, but their estimated standard errors account for the covariance structure that you have specified. If the AT option is specified, the BYLEVEL option disables it.
You can use the E option in conjunction with either the OM or BYLEVEL option to check that the modified LS-means coefficients are the ones you desire. It is possible that the modified LS-means are not estimable when the standard ones are, or vice versa. Nonestimable LS-means are noted as "Non-est" in the output.
lsmeans A*B / slice=B;
This code tests for the simple main effects of A for B, which are calculated by extracting the appropriate rows from the coefficient matrix for the A*B LS-means and using them to form an F-test. See the "Inference and Test Statistics" section for more information on this F-test.
The SLICE option produces a table titled "Tests of Effect Slices." For ODS purposes, the table name is "Slices."
|
Chapter Contents |
Previous |
Next |
Top |
Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.