SAT Problem-Solving: Statistics and Data Analysis
Mean vs. median (and the outlier effect), standard-deviation comparisons, scatterplots, two-way tables, and the correlation-vs-causation trap the SAT loves to set.
Statistics shows up 2–4 times on every Digital SAT — mean, median, standard deviation, scatterplots, and sampling. Most are quick if you remember which measure is sensitive to outliers and which isn't.
Measures of center
- Mean = sum / count — sensitive to outliers
- Median = middle value (sort first!) — resistant to outliers
- Mode = most frequent value — rarely tested directly
When outliers matter
SAT loves this pattern: "If the largest value is increased by 100, what happens to the mean and median?" Mean goes up by 100/n; median doesn't move (unless the largest value was already the middle).
Standard deviation = spread
You'll never compute standard deviation by hand on the SAT — but you must compare it across data sets.
Scatterplots and lines of best fit
A scatterplot question usually asks one of three things:
- Slope of the line of best fit — interpret as "y changes by m for every 1-unit change in x"
- y-intercept of the line of best fit — predicted y when x = 0
- Predict a y-value — plug an x into the line equation
SAT will trap you with "the data shows X causes Y." Reject any causation claim from observational data — only experiments support causal language.
Margin of error and sampling
The SAT tests sampling concepts loosely. Two rules:
- Larger sample → smaller margin of error. Double the sample size, error shrinks.
- Random sample = generalizable. Non-random (e.g., volunteer) samples can't generalize, no matter how large.
Basic probability
Two-way table questions
A two-way table is the SAT's favorite stats prop. The trick is reading what the question is conditioning on:
- "What percent of all students are X?" → divide cell by grand total
- "What percent of juniors are X?" → divide cell by row (or column) total
- "Given that a student is X, ..." → that's conditional probability, denominator is the X total
Finding a missing value from a known average
A staple SAT pattern: you're given an average and asked for a single missing value.
Common mistakes
- Forgetting to sort before finding the median
- Confusing standard deviation (spread) with mean (center)
- Concluding causation from a scatterplot
- Dividing by the wrong total on two-way tables
- Generalizing from a non-random sample to a whole population