|
Chapter Contents |
Previous |
Next |
| The LOGISTIC Procedure |
If you use the single-trial syntax, the data set may also contain a variable named _LEVEL_, which indicates the level of the response that the given row of output is referring to. For instance, the value of the cumulative probability variable is the probability that the response variable is as large as the corresponding value of _LEVEL_. For details, see the section "OUT= Output Data Set".
The estimated linear predictor, its standard error estimate, all predicted probabilities, and the confidence limits for the cumulative probabilities are computed for all observations in which the explanatory variables have no missing values, even if the response is missing. By adding observations with missing response values to the input data set, you can compute these statistics for new observations or for settings of the explanatory variables not present in the data without affecting the model fit.
The following list explains specifications in the OUTPUT statement.
You can request any of the three given types of predicted probabilities. For example, you can request both the individual predicted probabilities and the cross-validated probabilities by specifying PREDPROBS=(I X).
When you specify the PREDPROBS= option, two automatic variables _FROM_ and _INTO_ are included for the single-trial syntax and only one variable, _INTO_, is included for the events/trials syntax. The _FROM_ variable contains the formatted value of the observed response. The variable _INTO_ contains the formatted value of the response level with the largest individual predicted probability.
If you specify PREDPROBS=INDIVIDUAL, the OUTPUT data set contains k additional variables representing the individual probabilities, one for each response level, where k is the maximum number of response levels across all BY-groups. The names of these variables have the form IP_xxx, where xxx represents the particular level. The representation depends on the following situations:
If you specify PREDPROBS=CUMULATIVE, the OUTPUT data set contains k additional variables representing the cumulative probabilities, one for each response level, where k is the maximum number of response levels across all BY-groups. The names of these variables have the form CP_xxx, where xxx represents the particular response level. The naming convention is similar to that given by PREDPROBS=INDIVIDUAL. The PREDPROB=CUMULATIVE values are the same as those output by the PREDPROB=keyword, but are arranged in variables on each output observation rather thin in multiple output observations.
If you specify PREDPROBS=CROSSVALIDATE, the OUTPUT data set contains k additional variables representing the cross-validated predicted probabilities of the k response levels, where k is the maximum number of response levels across all BY-groups.
The names of these variables have the form XP_xxx, where xxx represents the particular level. The representation is the same as that given by PREDPROBS=INDIVIDUAL except that for the events/trials syntax there are four variables for the cross-validated predicted probabilities instead of two:
The cross-validated predicted probabilities are precisely those used in the CTABLE option. Refer to the "Predicted Probability of an Event for Classification" section for details of the computation.
You can specify the following option after a slash.
|
Chapter Contents |
Previous |
Next |
Top |
Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.