Missing information principle: A unified approach for general truncated and censored survival data problems

Yifei Sun, Jing Qin, Chiung Yu Huang

Research output: Contribution to journalArticle

Abstract

It is well known that truncated survival data are subject to sampling bias, where the sampling weight depends on the underlying truncation time distribution. Recently, there has been a rising interest in developing methods to better exploit the information about the truncation time, thus the sampling weight function, to obtain more efficient estimation. In this paper, we propose to treat truncation and censoring as "missing data mechanism" and apply the missing information principle to develop a unified framework for analyzing left-truncated and right-censored data with unspecified or known truncation time distributions. Our framework is structured in a way that is easy to understand and enjoys a great flexibility for handling different types of models. Moreover, a new test for checking the independence between the underlying truncation time and survival time is derived along the same line. The proposed hypothesis testing procedure utilizes all observed data and hence can yield a much higher power than the conditional Kendall's tau test that only involves comparable pairs of observations under truncation. Simulation studies with practical sample sizes are conducted to compare the performance of the proposed method with its competitors. The proposed methodologies are applied to a dementia study and a nursing house study for illustration.

Original languageEnglish (US)
Pages (from-to)261-276
Number of pages16
JournalStatistical Science
Volume33
Issue number2
DOIs
StatePublished - May 1 2018
Externally publishedYes

Fingerprint

Censored Survival Data
Truncation
Missing Data Mechanism
Kendall's tau
Truncated Data
Dementia
Right-censored Data
Nursing
Efficient Estimation
Survival Data
Survival Time
Censoring
Hypothesis Testing
Weight Function
High Power
Missing information
Sample Size
Flexibility
Simulation Study
Methodology

Keywords

  • Inverse probability weighted estimator
  • Kendall's tau
  • Outcome-dependent sampling
  • Prevalent sampling
  • Self-consistency algorithm

ASJC Scopus subject areas

  • Statistics and Probability
  • Mathematics(all)
  • Statistics, Probability and Uncertainty

Cite this

Missing information principle : A unified approach for general truncated and censored survival data problems. / Sun, Yifei; Qin, Jing; Huang, Chiung Yu.

In: Statistical Science, Vol. 33, No. 2, 01.05.2018, p. 261-276.

Research output: Contribution to journalArticle

Sun, Yifei ; Qin, Jing ; Huang, Chiung Yu. / Missing information principle : A unified approach for general truncated and censored survival data problems. In: Statistical Science. 2018 ; Vol. 33, No. 2. pp. 261-276.
@article{d2abad9047134a8cb9b7ee178f0be993,
title = "Missing information principle: A unified approach for general truncated and censored survival data problems",
abstract = "It is well known that truncated survival data are subject to sampling bias, where the sampling weight depends on the underlying truncation time distribution. Recently, there has been a rising interest in developing methods to better exploit the information about the truncation time, thus the sampling weight function, to obtain more efficient estimation. In this paper, we propose to treat truncation and censoring as {"}missing data mechanism{"} and apply the missing information principle to develop a unified framework for analyzing left-truncated and right-censored data with unspecified or known truncation time distributions. Our framework is structured in a way that is easy to understand and enjoys a great flexibility for handling different types of models. Moreover, a new test for checking the independence between the underlying truncation time and survival time is derived along the same line. The proposed hypothesis testing procedure utilizes all observed data and hence can yield a much higher power than the conditional Kendall's tau test that only involves comparable pairs of observations under truncation. Simulation studies with practical sample sizes are conducted to compare the performance of the proposed method with its competitors. The proposed methodologies are applied to a dementia study and a nursing house study for illustration.",
keywords = "Inverse probability weighted estimator, Kendall's tau, Outcome-dependent sampling, Prevalent sampling, Self-consistency algorithm",
author = "Yifei Sun and Jing Qin and Huang, {Chiung Yu}",
year = "2018",
month = "5",
day = "1",
doi = "10.1214/17-STS638",
language = "English (US)",
volume = "33",
pages = "261--276",
journal = "Statistical Science",
issn = "0883-4237",
publisher = "Institute of Mathematical Statistics",
number = "2",

}

TY - JOUR

T1 - Missing information principle

T2 - A unified approach for general truncated and censored survival data problems

AU - Sun, Yifei

AU - Qin, Jing

AU - Huang, Chiung Yu

PY - 2018/5/1

Y1 - 2018/5/1

N2 - It is well known that truncated survival data are subject to sampling bias, where the sampling weight depends on the underlying truncation time distribution. Recently, there has been a rising interest in developing methods to better exploit the information about the truncation time, thus the sampling weight function, to obtain more efficient estimation. In this paper, we propose to treat truncation and censoring as "missing data mechanism" and apply the missing information principle to develop a unified framework for analyzing left-truncated and right-censored data with unspecified or known truncation time distributions. Our framework is structured in a way that is easy to understand and enjoys a great flexibility for handling different types of models. Moreover, a new test for checking the independence between the underlying truncation time and survival time is derived along the same line. The proposed hypothesis testing procedure utilizes all observed data and hence can yield a much higher power than the conditional Kendall's tau test that only involves comparable pairs of observations under truncation. Simulation studies with practical sample sizes are conducted to compare the performance of the proposed method with its competitors. The proposed methodologies are applied to a dementia study and a nursing house study for illustration.

AB - It is well known that truncated survival data are subject to sampling bias, where the sampling weight depends on the underlying truncation time distribution. Recently, there has been a rising interest in developing methods to better exploit the information about the truncation time, thus the sampling weight function, to obtain more efficient estimation. In this paper, we propose to treat truncation and censoring as "missing data mechanism" and apply the missing information principle to develop a unified framework for analyzing left-truncated and right-censored data with unspecified or known truncation time distributions. Our framework is structured in a way that is easy to understand and enjoys a great flexibility for handling different types of models. Moreover, a new test for checking the independence between the underlying truncation time and survival time is derived along the same line. The proposed hypothesis testing procedure utilizes all observed data and hence can yield a much higher power than the conditional Kendall's tau test that only involves comparable pairs of observations under truncation. Simulation studies with practical sample sizes are conducted to compare the performance of the proposed method with its competitors. The proposed methodologies are applied to a dementia study and a nursing house study for illustration.

KW - Inverse probability weighted estimator

KW - Kendall's tau

KW - Outcome-dependent sampling

KW - Prevalent sampling

KW - Self-consistency algorithm

UR - http://www.scopus.com/inward/record.url?scp=85046830925&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85046830925&partnerID=8YFLogxK

U2 - 10.1214/17-STS638

DO - 10.1214/17-STS638

M3 - Article

C2 - 31327890

AN - SCOPUS:85046830925

VL - 33

SP - 261

EP - 276

JO - Statistical Science

JF - Statistical Science

SN - 0883-4237

IS - 2

ER -