Summary Statistics for a Quantitative Variable

Carson West

AP Stats Home

Summary Statistics for a Quantitative Variable

Summary statistics provide a concise numerical description of the main features of a distribution of a quantitative variable. They help us understand the center, spread, and shape of the data without having to look at every single data point. When describing a distribution, we often focus on these key characteristics.

Measures of Center

Measures of center indicate the “typical” value of a dataset.

Mean ( $ \bar{x} $ )

The arithmetic mean, often simply called the mean, is the sum of all values divided by the number of values. It’s the most common measure of center.

$$ \bar{x} = \frac{\sum x_i}{n} $$

Median (M)

The median is the midpoint of an ordered distribution. Half of the observations are smaller than the median, and half are larger.

Comparing Mean and Median

Measures of Spread (Variability)

Measures of spread describe how much the data values vary from each other.

Range

The range is the difference between the maximum and minimum values in a dataset.

$$ \text{Range} = \text{Maximum value} - \text{Minimum value} $$

Interquartile Range (IQR)

The IQR is the range of the middle 50% of the data. It’s the difference between the third quartile (Q3) and the first quartile (Q1).

$$ IQR = Q_3 - Q_1 $$

Standard Deviation ( $ s_x $ )

The standard deviation measures the typical distance of an observation from the mean. It’s the square root of the variance.

$$ s_x = \sqrt{\frac{\sum (x_i - \bar{x})^2}{n-1}} $$

Variance ( $ s_x^2 $ )

The variance is the average of the squared deviations from the mean. It is the square of the standard deviation.

$$ s_x^2 = \frac{\sum (x_i - \bar{x})^2}{n-1} $$

The Five-Number Summary

The five-number summary provides a robust numerical description of a distribution and consists of:

  1. Minimum value
  2. First Quartile ( $ Q_1 $ )
  3. Median (M)
  4. Third Quartile ( $ Q_3 $ )
  5. Maximum value

This summary is the basis for constructing Graphical Representations of Summary Statistics#Boxplots.

Identifying Outliers

Outliers are observations that fall outside the overall pattern of the distribution. A common rule for identifying potential outliers using the IQR is:

These summary statistics are fundamental tools for Describing the Distribution of a Quantitative Variable.