Abstract
Background: The fundamental problem of causal inference is one of missing data, and specifically of missing potential outcomes: if potential outcomes were fully observed, then causal inference could be made trivially. Though often not discussed explicitly in the epidemiological literature, the connections between causal inference and missing data can provide additional intuition. Methods: We demonstrate how we can approach causal inference in ways similar to how we address all problems of missing data, using multiple imputation and the parametric g-formula. Results: We explain and demonstrate the use of these methods in example data, and discuss implications for more traditional approaches to causal inference. Conclusions: Though there are advantages and disadvantages to both multiple imputation and g-formula approaches, epidemiologists can benefit from thinking about their causal inference problems as problems of missing data, as such perspectives may lend new and clarifying insights to their analyses.
Original language | English (US) |
---|---|
Article number | dyv135 |
Pages (from-to) | 1731-1737 |
Number of pages | 7 |
Journal | International journal of epidemiology |
Volume | 44 |
Issue number | 5 |
DOIs | |
State | Published - Oct 1 2015 |
Externally published | Yes |
Keywords
- Causal inference
- g-formula
- multiple imputation
- potential outcomes
ASJC Scopus subject areas
- Epidemiology