Statistics & Data Analysis: Mean, Median, Mode, and the Correlation Coefficient

Mean, median, and mode summarize a data set in one number — but they tell different stories. Plus correlation vs causation, and how to read a correlation coefficient.

8 分钟 TEKS 4A,4B,4C 代数1

One number to summarize many

Statistics is the art of compressing a list of numbers into a single value that captures the “typical” or “middle.” The Texas Algebra 1 CBE tests three measures of center, one measure of spread, and the basics of correlation.

Three measures of center

Mean (average)
Add all values, divide by count. Sensitive to outliers.
Median
Sort first. The middle value (or average of two middles if count is even). Resistant to outliers.
Mode
The value that appears most often. Can have multiple modes, or none.

Worked: mean

Data: 4, 8, 12, 16, 20 Sum = 60, count = 5 Mean = 60 / 5 = 12
Practice

Find the mean

Find the mean of the data set: 4, 8, 12, 16, 20.

Open the question →

Worked: median

Data: 3, 7, 9, 11, 14, 18, 22  (already sorted) Count = 7 → middle is the 4th value Median = 11 If count were even (say 8), you'd average the 4th and 5th values.
Practice

Find the median

Find the median of: 3, 7, 9, 11, 14, 18, 22.

Open the question →

Why median sometimes wins

Outliers shift the mean

Five salaries: 40k, 42k, 45k, 50k, 2,000k (the boss).
Mean ≈ $435k — misleading! Median = $45k — truer picture.
When data has extreme values, prefer median.

Correlation coefficient (r)

The correlation coefficient r measures how well a straight line fits a scatter plot. It always falls between −1 and +1.

−1 perfect negative −0.7 strong neg. 0 no correlation +0.7 strong pos. +1 perfect positive
The closer |r| is to 1, the tighter the line fits. Sign tells direction; magnitude tells strength.
Practice

Interpret r

A correlation coefficient of r = −0.95 indicates:

Open the question →

Correlation vs causation

Correlation is not causation

Two variables can move together (high correlation) without one causing the other. Ice cream sales and drowning rates both rise in summer — but ice cream doesn't cause drowning. Both are caused by hot weather. Causation requires controlled experiments, not just observation.

Practice

Define causation

Causation means:

Open the question →

3-second recap

  • Mean = average. Median = middle (after sorting). Mode = most frequent.
  • Outliers? Prefer the median.
  • r in [−1, +1]: sign = direction, |r| = strength.
  • Correlation ≠ causation.