|
Chapter Contents |
Previous |
Next |
| The REG Procedure |
Table 50.2 lists the options available in the MODEL statement. Equations for the statistics available are given in the "Model Fit and Diagnostic Statistics" section.
Table 50.2: MODEL Statement Options| Option | Description |
| Model Selection and Details of Selection | |
| SELECTION= | specifies model selection method |
| BEST= | specifies maximum number of subset models displayed or output to the OUTEST= data set |
| DETAILS | produces summary statistics at each step |
| DETAILS= | specifies the display details for forward, backward, and stepwise methods |
| GROUPNAMES= | provides names for groups of variables |
| INCLUDE= | includes first n variables in the model |
| MAXSTEP= | specifies maximum number of steps that may be performed |
| NOINT | fits a model without the intercept term |
| PCOMIT= | performs incomplete principal component analysis and outputs estimates to the OUTEST= data set |
| SLE= | sets criterion for entry into model |
| RIDGE= | performs ridge regression analysis and outputs estimates to the OUTEST= data set |
| SLS= | sets criterion for staying in model |
| START= | specifies number of variables in model to begin the comparing and switching process |
| STOP= | stops selection criterion |
| Fit Statistics | |
| ADJRSQ | computes adjusted R2 |
| AIC | computes Akaike's information criterion |
| B | computes parameter estimates for each model |
| BIC | computes Sawa's Bayesian information criterion |
| CP | computes Mallows' Cp statistic |
| GMSEP | computes estimated MSE of prediction assuming multivariate normality |
| JP | computes Jp, the final prediction error |
| MSE | computes MSE for each model |
| PC | computes Amemiya's prediction criterion |
| RMSE | displays root MSE for each model |
| SBC | computes the SBC statistic |
| SP | computes Sp statistic for each model |
| SSE | computes error sum of squares for each model |
| Data Set Options | |
| EDF | outputs the number of regressors, the error degrees of freedom, and the model R2 to the OUTEST= data set |
| OUTSEB | outputs standard errors of the parameter estimates to the OUTEST= data set |
| OUTSTB | outputs standardized parameter estimates to the OUTEST= data set. Use only with the RIDGE= or PCOMIT= option. |
| OUTVIF | outputs the variance inflation factors to the OUTEST= data set. Use only with the RIDGE= or PCOMIT= option. |
| PRESS | outputs the PRESS statistic to the OUTEST= data set |
| RSQUARE | has same effect as the EDF option |
| Regression Calculations | |
| I | displays inverse of sums of squares and crossproducts |
| XPX | displays sums-of-squares and crossproducts matrix |
| Details on Estimates | |
| ACOV | displays asymptotic covariance matrix of estimates assuming heteroscedasticity |
| COLLIN | produces collinearity analysis |
| COLLINOINT | produces collinearity analysis with intercept adjusted out |
| CORRB | displays correlation matrix of estimates |
| COVB | displays covariance matrix of estimates |
| PCORR1 | displays squared partial correlation coefficients using Type I sums of squares |
| PCORR2 | displays squared partial correlation coefficients using Type II sums of squares |
| SCORR1 | displays squared semi-partial correlation coefficients using Type I sums of squares |
| SCORR2 | displays squared semi-partial correlation coefficients using Type II sums of squares |
| SEQB | displays a sequence of parameter estimates during selection process |
| SPEC | tests that first and second moments of model are correctly specified |
| SS1 | displays the sequential sums of squares |
| SS2 | displays the partial sums of squares |
| STB | displays standardized parameter estimates |
| TOL | displays tolerance values for parameter estimates |
| VIF | computes variance-inflation factors |
| Predicted and Residual Values | |
| CLB | computes |
| CLI | computes |
| CLM | computes |
| DW | computes a Durbin-Watson statistic |
| INFLUENCE | computes influence statistics |
| P | computes predicted values |
| PARTIAL | displays partial regression plots for each regressor |
| R | produces analysis of residuals |
| Display Options and Other Options | |
| ALL | requests the following options: ACOV, CLB, CLI, CLM, CORRB, COVB, I, P, PCORR1, PCORR2, R, SCORR1, SCORR2, SEQB, SPEC, SS1, SS2, STB, TOL, VIF, XPX |
| ALPHA= | sets significance value for confidence and prediction intervals and tests |
| NOPRINT | suppresses display of results |
| SIGMA= | specifies the true standard deviation of error term for computing CP and BIC |
| SINGULAR= | sets criterion for checking for singularity |
You can specify the following options in the MODEL statement after a slash (/).
If the BEST= option is used without the B option (displaying estimated regression coefficients), the variables in each MODEL are listed in order of inclusion instead of the order in which they appear in the MODEL statement.
If the BEST= option is omitted and the number of regressors is less than 11, all possible subsets are evaluated. If the BEST= option is omitted and the number of regressors is greater than 10, the number of subsets selected is, at most, equal to the number of regressors. A small value of the BEST= option greatly reduces the CPU time required for large problems.
For example,
model y={x1 x2} x3 / selection=stepwise
groupnames='x1 x2' 'x3';
As another example,
model y={ht wgt age} bodyfat / selection=forward
groupnames='htwgtage' 'bodyfat';
If you specify the PCOMIT= option, RESTRICT statements are ignored. The PCOMIT= option is ignored if you use the SELECTION= option in the MODEL statement.
Only nonnegative numbers can be specified with the RIDGE= option. Example 50.10 illustrates this option.
If you specify the RIDGE= option, RESTRICT statements are ignored. The RIDGE= option is ignored if you use the SELECTION= option in the MODEL statement.
For the RSQUARE, ADJRSQ, and CP methods, START=s specifies the smallest number of regressors to be reported in a subset model. For these methods, the default is START=1.
The START= option cannot be used with model-selection methods other than the six described here.
The default setting for the STOP= option is the number of variables in the MODEL statement. This option can be used only with the MAXR, MINR, RSQUARE, ADJRSQ and CP methods.
|
Chapter Contents |
Previous |
Next |
Top |
Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.