Describing the Distribution of a Quantitative Variable

Carson West

AP Stats Home

Describing the Distribution of a Quantitative Variable

When analyzing a single quantitative variable, it’s crucial to describe its overall pattern and any striking deviations from that pattern. We often use a visual display like a histogram, stemplot, or boxplot (which can be found in Representing a Quantitative Variable with Graphs) to aid in this description. A common mnemonic for a complete description is SOCS: Shape, Outliers, Center, and Spread.

Shape

The shape of a distribution describes its overall form.

Outliers

Outliers are individual values that fall outside the overall pattern of the distribution. They can be due to natural variability, measurement errors, or data entry mistakes. When describing a distribution, you should always mention any apparent outliers and investigate them further.

Identifying Outliers

Center

The center of a distribution provides a typical or central value.

For more on calculating these values, refer to Summary Statistics for a Quantitative Variable.

Spread

The spread (or variability) describes how much the data values vary from each other.

For a summary of resistance to outliers:

Statistic Resistant to Outliers?
Mean No
Median Yes
Range No
Interquartile Range (IQR) Yes
Standard Deviation No

Choosing between mean/standard deviation vs. median/IQR depends on the shape:

Remember to always describe a quantitative variable in terms of its SOCS!