|
Chapter Contents |
Previous |
Next |
| The CATMOD Procedure |
By default, PROC CATMOD uses the standard response functions (generalized logits, which are explained in detail in the "Understanding the Standard Response Functions" section). With these standard response functions, the default estimation method is maximum likelihood, but you can use the WLS option in the MODEL statement to request weighted least-squares estimation. With other response functions (specified in the RESPONSE statement), the default (and only) estimation method is weighted least squares.
You can specify more than one RESPONSE statement, in which case each RESPONSE statement produces a separate analysis. If the computed response functions for any population are linearly dependent (yielding a singular covariance matrix), then PROC CATMOD displays an error message and stops processing. See the "Cautions" section for methods of dealing with this.
The function specification can be any of the items in the following list. For an example of response functions generated and formulas for q (the number of response functions), see the "More on Response Functions" section.
| ALOGIT ALOGITS | specifies response functions as adjacent-category logits of the marginal probabilities for each of the dependent variables. For each dependent variable, the response functions are a set of linearly independent adjacent-category logits, obtained by taking the logarithms of the ratios of two probabilities. The denominator of the kth ratio is the marginal probability corresponding to the kth level of the variable, and the numerator is the marginal probability corresponding to the (k+1)th level. If a dependent variable has two levels, then the adjacent-category logit is the negative of the generalized logit. |
| CLOGIT CLOGITS | specifies that the response functions are cumulative logits of the marginal probabilities for each of the dependent variables. For each dependent variable, the response functions are a set of linearly independent cumulative logits, obtained by taking the logarithms of the ratios of two probabilities. The denominator of the kth ratio is the cumulative probability, ck, corresponding to the kth level of the variable, and the numerator is 1-ck (Agresti 1984, 113 -114). If a dependent variable has two levels, then PROC CATMOD computes its cumulative logit as the negative of its generalized logit. You should use cumulative logits only when the dependent variables are ordinally scaled. |
| JOINT | specifies that the response functions are the joint response probabilities. A linearly independent set is created by deleting the last response probability. For the case of one dependent variable, the JOINT and MARGINALS specifications are equivalent. |
| LOGIT LOGITS | specifies that the response functions are generalized logits of the marginal probabilities for each of the dependent variables. For each dependent variable, the response functions are a set of linearly independent generalized logits, obtained by taking the logarithms of the ratios of two probabilities. The denominator of each ratio is the marginal probability corresponding to the last observed level of the variable, and the numerators are the marginal probabilities corresponding to each of the other levels. If there is one dependent variable, then specifying LOGIT is equivalent to using the standard response functions. |
| MARGINAL MARGINALS | specifies that the response functions are marginal probabilities for each of the dependent variables in the MODEL statement. For each dependent variable, the response functions are a set of linearly independent marginals, obtained by deleting the marginal probability corresponding to the last level. |
| MEAN MEANS | specifies that the response functions are the means of the dependent variables in the MODEL statement. This specification requires that all of the dependent variables be numeric. |
| READ variables | specifies that the response functions and their covariance matrix are to be read directly from the input data set with one response function for each variable named. See the section "Inputting Response Functions and Covariances Directly" for more information. |
| transformation | specifies response functions that can be expressed by using successive applications of the four operations: LOG, EXP, * matrix literal, or + matrix literal. The operations are described in detail in the "Using a Transformation to Specify Response Functions" section. |
You can specify the following options in the RESPONSE statement after a slash.
| Value of A: | 1 | 2 | 3 |
| proportions: | p1 | p2 | p3 |
Note that
. The following table shows the
response functions generated for each population.
| Function | Value | |
| Specification | of q | Response Function |
| none* | 2 | ln( [(p1)/(p3)] ), ln( [(p2)/(p3)] ) |
| ALOGITS | 2 | ln( [(p2)/(p1)] ), ln( [(p3)/(p2)] ) |
| CLOGITS | 2 | ln( [(1-p1)/(p1)] ), ln( [(1-(p1+p2))/(p1+p2)] ) |
| JOINT | 2 | p1, p2 |
| LOGITS | 2 | ln( [(p1)/(p3)] ), ln( [(p2)/(p3)] ) |
| MARGINAL | 2 | p1, p2 |
| MEAN | 1 | 1 p1 + 2 p2 + 3 p3 |
| *Without
a function specification, the default response
functions are generalized logits. | ||
Now, suppose the dependent variables A and B each have 3 levels (valued 1, 2, and 3 each) and the response-effect in the MODEL statement is A*B. The following table shows the proportions upon which the response functions are defined.
| Value of A: | 1 | 1 | 1 | 2 | 2 | 2 | 3 | 3 | 3 |
| Value of B: | 1 | 2 | 3 | 1 | 2 | 3 | 1 | 2 | 3 |
| proportions: | p1 | p2 | p3 | p4 | p5 | p6 | p7 | p8 | p9 |
The marginal totals for the preceding table are defined as follows,

| Function | Value | |
| Specification | of q | Response Function |
| none* | 8 | ln( [(p1)/(p9)] ), ln( [(p2)/(p9)] ), ln( [(p3)/(p9)] ), ... , ln( [(p8)/(p9)] ) |
| ALOGITS | 4 | ln( [(p2 ·)/(p1 ·)] ), ln( [(p3 ·)/(p2 ·)] ), ln( [(p·2)/(p·1)] ), ln( [(p·3)/(p·2)] ) |
| CLOGITS | 4 | ln( [(1-p1 ·)/(p1 ·)] ), ln( [(1-(p1 ·+p2 ·))/(p1 ·+p2 ·)] ), ln( [(1-p·1)/(p·1)] ), ln( [(1-(p·1+p·2))/(p·1+p·2)] ) |
| JOINT | 8 | p1, p2, p3, p4, p5, p6, p7, p8 |
| LOGITS | 4 | ln( [(p1 ·)/(p3 ·)] ), ln( [(p2 ·)/(p3 ·)] ), ln( [(p·1)/(p·3)] ), ln( [(p·2)/(p·3)] ) |
| MARGINAL | 4 | p1 ·, p2 ·, p·1, p·2 |
| MEAN | 2 | 1 p1 · + 2 p2 · + 3 p3 ·, 1 p·1 + 2 p·2 + 3 p·3 |
| * Without a function specification, the default response functions are generalized logits. | ||
The READ and transformation function specifications are not shown in the preceding table. For these two situations, there is not a general response function; the response functions generated depend on what you specify.
Another important aspect of the function specification is the number of response functions generated per population, q. Let mi represent the number of levels for the ith dependent variable in the MODEL statement, and let d represent the number of dependent variables in the MODEL statement. Then, if the function specification is ALOGITS, CLOGITS, LOGITS, or MARGINALS, the number of response functions is


If the function specification is MEANS, the number of response functions per population is q=d.
| Example | Result |
response marginals; | marginals for each dependent variable |
response means; | the mean of each dependent variable |
response logits; | generalized logits of the marginal probabilities |
response clogits; | cumulative logits of the marginal probabilities |
response alogits; | adjacent-category logits of the marginal probabilities |
response joint; | the joint probabilities |
response 1 -1 log; | the logit |
response; | generalized logits |
response 1 2 3; | the mean score, with scores of 1, 2, and 3 corresponding to the three response levels |
response read b1-b4; | four response functions and their covariance matrix, read directly from the input data set |
| Operation | Specification |
| linear combination | * matrix literal |
| matrix literal | |
| logarithm | LOG |
| exponential | EXP |
| adding constant | + matrix literal |
If more than one operation is specified, then PROC CATMOD applies the operations consecutively from right to left.
A matrix literal is a matrix of numbers with each row of the matrix separated from the next by a comma. If you specify a linear combination, in most cases the * is not needed. The following statement defines the response function p1+1. The * is needed to separate the two matrix literals '1' and '1 0'.
response + 1 * 1 0;
The LOG of a vector transforms each element of the vector into its natural logarithm; the EXP of a vector transforms each element into its exponential function (antilogarithm).
In order to specify a linear response function for data that have r=3 response categories, you could specify either of the following RESPONSE statements:
response * 1 0 0 , 0 1 0; response 1 0 0 , 0 1 0;
The matrix literal in the preceding statements specifies a 2 ×3 matrix, which is applied to each population as follows:
![[ F_1 \ F_2 ] =
[ 1 & 0 & 0 \ 0 & 1 & 0
] *
[ p_1 \ p_2 \ p_3 ]](images/cateq32.gif)
As another example of the linear response function, suppose you have two dependent variables corresponding to two observers who evaluate the same subjects. If the observers grade on the same three-point scale and if all nine possible responses are observed, then the following RESPONSE statement would compute the probability that the observers agree on their assessments:
response 1 0 0 0 1 0 0 0 1;
This response function is then computed as
![F = p_{11} + p_{22} + p_{33} =
[ 1 & 0 & 0 & 0 & 1 & 0 & 0 & 0 & 1 \ ] *
[ p_{11} \ p_{12} \ p_{13} \
p_{21} \ p_{22} \ p_{23} \
p_{31} \ p_{32} \ p_{33} \ ]](images/cateq33.gif)
If the function is a compound function, requiring more than one operation to specify it, then the operations should be listed so that the first operation to be applied is on the right and the last operation to be applied is on the left. For example, if there are two response levels, the response function
response 1 -1 log;
is equivalent to the matrix expression:
![F = [ 1 & -1 \ ] *
[ \log(p_1) \ \log(p_2) \
]
= \log(p_1)-log(p_2) = \log(\frac{p_1}{p_2})](images/cateq34.gif)
Another example of a compound response function is
response exp 1 -1 * 1 0 0 1, 0 1 1 0 log;which is equivalent to the matrix expression
![A = [ 1 & -1 \ ]
{ and }
B = [ 1 & 0 & 0 & 1 \ 0 & 1 & 1 & 0 \ ]](images/cateq35.gif)
which is the odds (crossproduct) ratio for a 2 ×2 table.
response 1 0 0 -1,
0 1 0 -1,
0 0 1 -1 log;
This results in three response functions:
![F = [ F_1 \ F_2 \ F_3 \ ] =
[ \log(p_1/p_4) \ \log(p_2/p_4) \ \log(p_3/p_4) \ ]](images/cateq36.gif)


|
Chapter Contents |
Previous |
Next |
Top |
Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.