Use visual displays and summary statistics to understand center, spread, and shape. Identify clusters, gaps, and outliers.
Marks each data value; great for small datasets.
Bins data into intervals; shows shape and spread.
Displays Q1, median, Q3, IQR, and potential outliers.
Scores: 85, 78, 92, 88, 90, 75, 82, 86, 95, 80. Find median and IQR.
Sorted: 75, 78, 80, 82, 85, 86, 88, 90, 92, 95
Median =
Q1 = 80, Q3 = 90 → IQR =
Quartiles: Q1 is median of lower half, Q3 is median of upper half (exclude the overall median when n is odd).
Outlier rule (Tukey): values outside are potential outliers.
Robust statistics: median and IQR are resistant to extreme values; mean and standard deviation are not.
Sample SD: ; Population SD: .
z-score: measures how many SDs a point is from the mean.
Chebyshev (any distribution): proportion within k SDs is at least for .
Data (sorted): 4, 5, 6, 7, 8, 12, 13, 14, 30
Median = 8; Q1 = 6; Q3 = 13 → IQR = 7
Fence:
30 is outside → potential outlier
Class scores: , . Alice scored 90, Bob scored 68. Compare relative standing.
Alice: (2 SDs above mean)
Bob: (1.67 SDs below mean)
At least what proportion of data lie within 3 SDs of the mean?
1) Compute mean, median, IQR, and SD for [2, 4, 6, 8].
2) For Q1=12, Q3=20, apply IQR rule. Is 35 an outlier?
3) A value has z=2.5 (using class mean & SD). Interpret in context.