Chapter 5
Sampling
Every claim a data scientist makes is a claim about a population, made from a sample. This chapter covers the major sampling methods and connects them to the ML workflows where sampling decisions have direct consequences: train/test splits, cross-validation, and handling class imbalance.