Descriptive Statistics

Descriptive Statistics Explorer
$18kAnnual income$180k
mean
Mean: The mean ($47k) is pulled rightward by the handful of high earners. It's higher than the median — a tell-tale sign of right skew. In skewed distributions, mean overstates what's "typical."

A right-skewed salary dataset (n=40). Toggle each statistic to see it overlaid on the histogram — notice how the mean is pulled far right by a few high earners while the median stays anchored near the bulk of the data.

Measures of central tendency answer: what's typical?

  • Mean is the arithmetic average. Sensitive to outliers — one billionaire moves the average household income of a small town enormously.
  • Median is the middle value when data is sorted. Robust to outliers — the billionaire doesn't change the median household income.
  • Mode is the most frequently occurring value. The only measure of central tendency that makes sense for purely categorical data.

When you have a skewed distribution — income, web session length, time-to-failure — the gap between mean and median tells you something. Mean substantially greater than median? Long right tail. Mean substantially less? Long left tail.

Measures of dispersion answer: how spread out is the data?

  • Standard deviation is the average distance of data points from the mean.
  • Variance is the square of the standard deviation. Useful in statistical derivations.
  • Range is maximum minus minimum. Dominated by extremes.
  • Interquartile range (IQR) is Q3 − Q1. Captures the spread of the middle half of your data, robust to outliers — which is why box plots use it.

A note on the formulas: the population standard deviation divides by N. The sample standard deviation divides by n − 1. That n − 1 is Bessel's correction — it compensates for the systematic underestimation that happens when you use the sample mean instead of the true population mean.

The Empirical Rule

If your data is approximately normal, standard deviations have a concrete meaning:

  • pm1sigmapm 1sigma: ~68% of data
  • pm2sigmapm 2sigma: ~95% of data
  • pm3sigmapm 3sigma: ~99.7% of data

This is why "more than 3 standard deviations from the mean" is a common outlier threshold. In a normal distribution, under 0.3% of data should be out there. If you're seeing substantially more, investigate.

Measures of distribution shape describe asymmetry and tailedness.

  • Skewness measures asymmetry. Positive skewness = longer right tail (right-skewed). Negative = longer left tail. Symmetric distributions like the normal have skewness near zero.
  • Kurtosis measures tailedness — how prone the distribution is to extreme values. Higher kurtosis = heavier tails = more outliers relative to a normal distribution.
Distribution Explorer
Skewness0.0
Left-skewedRight-skewed
Symmetric
median
LowHigh
Symmetric distribution — the mean and median coincide.
Kurtosis0.0
Flat / light tailsPeaked / heavy tails
Mesokurtic (normal)
LowHigh
Mesokurtic — kurtosis close to zero, similar to a normal distribution. Tails and peak are neither unusually heavy nor light.
0.0
Skewness
0.0
Kurtosis

Distribution explorer — adjust skewness and kurtosis sliders and see how mean vs. median diverge.

Checkpoint

You compute descriptive statistics on a salary dataset and find: mean = $95,000, median = $62,000, standard deviation = $85,000. What is the most likely explanation, and what should you do?