Checking the Assumptions

Parametric tests are statistical tests that make specific assumptions about the population distribution. They have greater statistical power than nonparametric tests when assumptions hold — you need smaller samples to detect effects. The trade-off: they're only valid when the assumptions are met.

Three assumptions most parametric tests share:

Assumption 1: Independence of Observations

Each observation must be independent of the others, both within and across groups. Check this by examining your data collection process:

  • Did you measure the same subject multiple times? → Violation (use paired tests or repeated-measures designs).
  • Is your data time-ordered and likely autocorrelated? → Violation.
  • Are observations clustered (students within classrooms, patients within hospitals)? → Violation; may need hierarchical models.

Assumption 2: Normality

The data within each group should be approximately normally distributed. Three ways to check:

  • Visual: Histogram and Q-Q plot. A Q-Q plot where points fall near the diagonal indicates normality.
  • Shapiro-Wilk test: Most widely used. Null: sample came from a normal distribution. Significant p → reject normality. Best for n up to a few thousand.
  • Anderson-Darling test: More sensitive to tail deviations.

Practical note: With very large samples, even tiny departures from normality show as significant. With very small samples, the tests have low power. Always pair formal tests with visual inspection.

Assumption 3: Homogeneity of Variance

The groups being compared should have approximately equal variances. Check with Levene's test: the null is that variances are equal; a significant p-value means they're not.

Assumption Checker
Histograms with normal overlay
Group 1
Count
Group 2
Count
Q-Q plots
Group 1
Theoretical quantilesSample
Group 2
Theoretical quantilesSample

Points near the dashed line indicate normality. Curves or S-shapes suggest departures.

Formal test results (n = 40 per group)
Shapiro-Wilk (normality)H₀: data are normally distributed
Group 1
W = 0.971, p = 0.542
✓ Pass
Group 2
W = 0.968, p = 0.483
✓ Pass
Levene's test (homogeneity of variance)H₀: group variances are equal
F = 0.41, p = 0.524✓ Pass
Recommendation

All assumptions hold. A Student's t-test or one-way ANOVA is appropriate.

40.1
Mean G1
45.8
Mean G2
8.0
SD G1
9.9
SD G2
Group 1Group 2Normal fit

Given a dataset, and run Shapiro-Wilk and Levene's tests. View Q-Q plots and histogram overlays to visually assess normality.

Checkpoint

You run a Shapiro-Wilk test on each group in your dataset (n = 2,000 per group). Both return p < 0.001, suggesting non-normality. Before concluding the parametric test is invalid, what should you also check?