Descriptive Statistics
A right-skewed salary dataset (n=40). Toggle each statistic to see it overlaid on the histogram — notice how the mean is pulled far right by a few high earners while the median stays anchored near the bulk of the data.
Measures of central tendency answer: what's typical?
- Mean is the arithmetic average. Sensitive to outliers — one billionaire moves the average household income of a small town enormously.
- Median is the middle value when data is sorted. Robust to outliers — the billionaire doesn't change the median household income.
- Mode is the most frequently occurring value. The only measure of central tendency that makes sense for purely categorical data.
When you have a skewed distribution — income, web session length, time-to-failure — the gap between mean and median tells you something. Mean substantially greater than median? Long right tail. Mean substantially less? Long left tail.
Measures of dispersion answer: how spread out is the data?
- Standard deviation is the average distance of data points from the mean.
- Variance is the square of the standard deviation. Useful in statistical derivations.
- Range is maximum minus minimum. Dominated by extremes.
- Interquartile range (IQR) is Q3 − Q1. Captures the spread of the middle half of your data, robust to outliers — which is why box plots use it.
A note on the formulas: the population standard deviation divides by N. The sample standard deviation divides by n − 1. That n − 1 is Bessel's correction — it compensates for the systematic underestimation that happens when you use the sample mean instead of the true population mean.
The Empirical Rule
If your data is approximately normal, standard deviations have a concrete meaning:
- : ~68% of data
- : ~95% of data
- : ~99.7% of data
This is why "more than 3 standard deviations from the mean" is a common outlier threshold. In a normal distribution, under 0.3% of data should be out there. If you're seeing substantially more, investigate.
Measures of distribution shape describe asymmetry and tailedness.
- Skewness measures asymmetry. Positive skewness = longer right tail (right-skewed). Negative = longer left tail. Symmetric distributions like the normal have skewness near zero.
- Kurtosis measures tailedness — how prone the distribution is to extreme values. Higher kurtosis = heavier tails = more outliers relative to a normal distribution.
Distribution explorer — adjust skewness and kurtosis sliders and see how mean vs. median diverge.
You compute descriptive statistics on a salary dataset and find: mean = $95,000, median = $62,000, standard deviation = $85,000. What is the most likely explanation, and what should you do?