A Bayesian shrinkage model for incomplete longitudinal binary data with application to the Breast Cancer Prevention Trial

C. Wang, M. J. Daniels, D. O. Scharfstein, S. Land

Research output: Contribution to journalArticle


We consider inference in randomized longitudinal studies with missing data that is generated by skipped clinic visits and loss to followup. In this setting, it is well known that full data estimands are not identified unless unverified assumptions are imposed. We assume a nonfuture dependence model for the drop-out mechanism and partial ignorability for the intermittent missingness. We posit an exponential tilt model that links nonidentifiable distributions and distributions identified under partial ignorability. This exponential tilt model is indexed by nonidentified parameters, which are assumed to have an informative prior distribution, elicited from subject-matter experts. Under this model, full data estimands are shown to be expressed as functionals of the distribution of the observed data. To avoid the curse of dimensionality, we model the distribution of the observed data using a Bayesian shrinkage model. In a simulation study, we compare our approach to a fully parametric and a fully saturated model for the distribution of the observed data. Our methodology is motivated by, and applied to, data from the Breast Cancer Prevention Trial.

Original languageEnglish (US)
Pages (from-to)1333-1346
Number of pages14
JournalJournal of the American Statistical Association
Issue number492
StatePublished - Dec 1 2010



  • Informative drop-out
  • Intermittent missingness
  • Prior elicitation

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Cite this