Useful Data Tips

Why Causal Inference is Hard

⏱️ 8 sec read 🔬 Causal Inference

Most "X caused Y" takes are just good lighting on a coincidence.

Three gremlins:

Confounding → A third thing nudges both X and Y. Heavy users see more prompts AND convert more.

Selection bias → Your sample isn't the population. Only engaged users see your experiment.

Interference → Users collide. Price test for drivers changes rider behavior.

Design beats modeling. If users interact, don't A/B—use switchbacks. Split by time, not random rows. Pre-commit your rules before you ship.

← Back to Causal Inference