Interpreting Regression Coefficients

Now that we have the OLS line for bedrooms vs. house price, let's see what it actually means:

price^=65,000×bedrooms+100,000\hat{\text{price}} = 65{,}000 \times \text{bedrooms} + 100{,}000

This is the same y = mx + b you know from algebra!

  • 65,000 is the slope (β1\beta_1): each additional bedroom is associated with a $65,000 increase in predicted price.
  • 100,000 is the intercept (β0\beta_0): the predicted price of a house with zero bedrooms.

The slope tells you how much bedrooms matter; the intercept is the baseline.

Reading the Equation

How much does a 3-bedroom house cost, according to this model?
price^=65,000×3+100,000=295,000\hat{\text{price}} = 65{,}000 \times 3 + 100{,}000 = 295{,}000

What about a 5-bedroom house?
price^=65,000×5+100,000=425,000\hat{\text{price}} = 65{,}000 \times 5 + 100{,}000 = 425{,}000

The difference — 130,000 for two extra bedrooms — is exactly 2×β12 \times \beta_1. The slope is constant: the model says every bedroom adds the same amount regardless of whether you're going from 1→2 or 4→5.

When the Intercept Doesn't Make Sense

The intercept here is $100,000 — the predicted price of a house with zero bedrooms. Does that make sense? Maybe a studio apartment or a parking space. But if your dataset contains only 2–6 bedroom houses, then zero bedrooms is an extrapolation far outside your data. The intercept is a mathematical anchor for the line, not always a meaningful real-world quantity.


Be especially careful when the intercept is negative. A model predicting sale time from listing price might give a negative time for very cheap homes — which is physically impossible. The intercept anchors the line; it isn't always interpretable.

Regression Interpreter
Predictor
01234567100200300400500ŷ = 328$kBedroomsHouse price ($k)

Orange dot = probe point. Dashed lines show the predicted value.

Probe
Probe (bd)3.5 bd
Interpretation
Slope — β₁ = +65.0 $k/bd

For each additional 1 bd of bedrooms, predicted house price increases by 65.0 $k.

Intercept — β₀ = 100 $k

When bedrooms = 0, predicted house price = 100 $k. Predicted price at 0 bedrooms (a studio or anchor point — not always interpretable).

Prediction at Bedrooms = 3.5 bd

ŷ = 65.0 × 3.5 + 100 = 328 $k

Equation
House price = 65.0 × Bedrooms + 100
Houses Regression line Probe point

Use the probe to predict prices at specific bedroom counts. Notice: what does the intercept tell you about the predicted price at zero bedrooms?

Correlation ≠ Causation

The slope of 65,000 says bedrooms and price are associated — not that buying more bedrooms causes a house to be worth more. More bedrooms also means more square footage, better neighborhoods, and other correlated features. Simple linear regression can't separate those effects.

The regression line describes the data you have. Claims about causation require study design, natural experiments, or explicit controls for confounders.

Checkpoint

A model gives: price = 60,000 × bedrooms + 80,000. A house has 4 bedrooms and sold for $310,000. What is the residual?