|
Chapter Contents |
Previous |
Next |
| The LOGISTIC Procedure |
Two forms of the MODEL statement can be specified. The first form, referred to as single-trial syntax, is applicable to both binary response data and ordinal response data. The second form, referred to as events/trials syntax, is restricted to the case of binary response data. The single-trial syntax is used when each observation in the DATA= data set contains information on only a single trial, for instance, a single subject in an experiment. When each observation contains information on multiple binary-response trials, such as the counts of the number of subjects observed and the number responding, then events/trials syntax can be used.
In the single-trial syntax, you specify one variable (preceding the equal sign) as the response variable. This variable can be character or numeric. Values of this variable are sorted by the ORDER= option (and the DESCENDING option, if specified) in the PROC LOGISTIC statement.
In the events/trials syntax, you specify two variables that contain count data for a binomial experiment. These two variables are separated by a slash. The value of the first variable, events, is the number of positive responses (or events). The value of the second variable, trials, is the number of trials. The values of both events and (trials-events) must be nonnegative and the value of trials must be positive for the response to be valid.
For both forms of the MODEL statement, the independents following the equal sign are the explanatory variables for the model. These variables must be numeric.
Table 35.1 summarizes the options available in the MODEL statement.
Table 35.1: Model Statement OptionsThe following list describes these options.



Note that the INCLUDE= and START= options perform different tasks: the INCLUDE= option includes the first n explanatory variables in every model, whereas the START= option only requires that the first n explanatory variables appear in the first model.
A classification table for each of several cutpoints can be requested by specifying a list. For example,
pprob= (0.3, 0.5 to 0.8 by 0.1)
requests a classification of the observations for each of the cutpoints 0.3, 0.5, 0.6, 0.7, and 0.8. If the PPROB= option is not specified, the default is to display the classification for a range of probabilities from the smallest estimated probability (rounded below to the nearest 0.02) to the highest estimated probability (rounded above to the nearest 0.02) with 0.02 increments.
You can use the AGGREGATE (or AGGREGATE=) option to define the subpopulations for calculating the Pearson chi-square statistic and the deviance. In the absence of the AGGREGATE (or AGGREGATE=) option, each observation is regarded as coming from a different subpopulation. For the events/trials syntax, each observation consists of n Bernoulli trials, where n is the value of the trials variable. For single-trial syntax, each observation consists of a single response, and for this setting it is not appropriate to carry out the Pearson or deviance goodness-of-fit analysis. Thus, PROC LOGISTIC ignores specifications SCALE=P, SCALE=D, and SCALE=N when single-trial syntax is specified without the AGGREGATE (or AGGREGATE=) option.
The "Deviance and Pearson Goodness-of-Fit Statistics" table includes the Pearson chi-square statistic, the deviance, their degrees of freedom, the ratio of each statistic divided by its degrees of freedom, and the corresponding p-value. For more information, see the "Overdispersion" section.


|
Chapter Contents |
Previous |
Next |
Top |
Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.