Chapter Contents
Chapter Contents
Descriptive Statistics

Computing Summary Statistics

In this task, summary statistics (such as the mean, standard deviation, and minimum and maximum values) are desired for the birth and infant mortality rates for each region. In addition, box-and-whisker plots are requested.

Request Summary Statistics

To request the Summary Statistics task, follow these steps:

  1. Select Statistics arrow Descriptive arrow Summary Statistics...
  2. Select the analysis variables birth and death from the candidate list.

You can specify a classification variable to define groups within your data. When you specify a classification variable, the Analyst Application produces summary statistics for the analysis variables at each level of the classification variable.

  1. Select region as the classification variable.

Figure 7.7 displays the Summary Statistics main dialog with birth and death specified as the analysis variables and region specified as the classification variable.

c07ss02.gif (5063 bytes)

Figure 7.7: Summary Statistics Dialog

Request Box-and-Whisker Plots

To request box-and-whisker plots, follow these steps:

  1. Click on the Plots button.
  2. Select Box-&-whisker plot.
  3. Click OK.

Figure 7.8 displays the Plots dialog with Box-&-whisker plot selected.

c07ss02b.gif (3482 bytes)

Figure 7.8: Summary Statistics: Plots Dialog

To perform the analysis, click OK in the main dialog.

Review the Results

The results are presented in the project tree under the Summary Statistics folder, as displayed in Figure 7.9. The four icons represent the summary statistics output, the box-and-whisker plots for each analysis variable, and the SAS programming statements (labeled Code) that generate the output.

c07ss02c.gif (5663 bytes)

Figure 7.9: Summary Statistics: Project Tree

Double-click on any of the icons to display the corresponding information in a separate window.

Figure 7.10 displays, for each value of the classification variable region, the number of observations, the mean, the standard deviation, and the minimum and maximum values of each analysis variable. The western region has the highest birth rate (16.89) and the southern region has the highest death rate (10.15).

c07ss04.gif (8907 bytes)

Figure 7.10: Summary Statistics: Statistics for birth and death

Figure 7.11 displays the box-and-whisker plot for the variable birth for each level of the region variable.

c07ss03.gif (9448 bytes)

Figure 7.11: Summary Statistics: Box-and-Whisker Plot for Birth Rate by Region

This plot reveals a possible outlier in the birth rate for the midwestern region (region=`MW'). The western region (region=`W') is noticeable as the region with the highest birth rate.

Chapter Contents
Chapter Contents

Copyright © 1999 by SAS Institute Inc., Cary, NC, USA. All rights reserved.