TY - JOUR
T1 - Composite partial likelihood estimation under length-biased sampling, with application to a prevalent cohort study of Dementia
AU - Huang, Chiung Yu
AU - Qin, Jing
N1 - Funding Information:
Chiung-Yu Huang is Mathematical Statistician, Biostatistics Research Branch, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892 (E-mail: huangchi@niaid.nih.gov). Jing Qin is Mathematical Statistician, Biostatistics Research Branch, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892 (E-mail: jingqin@niaid.nih.gov). The authors thank Professors Ian McDowell, Masoud Asgharian, and Christina Wolfson for kindly sharing the Canadian Study of Health and Aging data. The core study was funded by the National Health Research and Development Program (NHRDP) of Health Canada Project 6606-3954-MC(S). Additional funding was provided by Pfizer Canada Incorporated through the Medical Research Council/Pharmaceutical Manufacturers Association of Canada Health Activity Program, NHRDP Project 6603-1417-302(R), Bayer Incorporated, and the British Columbia Health Research Foundation Projects 38 (93-2) and 34 (96-1). The authors also thank the Associate Editor, the referee, Dr Dean Follmann, and Dr Michael Proschan for their comments that improved the presentation of this article.
PY - 2012
Y1 - 2012
N2 - The Canadian Study of Health and Aging (CSHA) employed a prevalent cohort design to study survival after onset of dementia, where patients with dementia were sampled and the onset time of dementia was determined retrospectively. The prevalent cohort sampling scheme favors individuals who survive longer. Thus, the observed survival times are subject to length bias. In recent years, there has been a rising interest in developing estimation procedures for prevalent cohort survival data that not only ac count for length bias but also actually exploit the incidence distribution of the disease to improve efficiency. This article considers semi parametric estimation of the Cox model for the time from dementia onset to death under a stationary assumption with respect to the disease incidence. Under the stationery condition, the semi parametric maximum likelihood estimation is expected to be fully efficient yet difficult to perform for statistical practitioners, as the likelihood depends on the baseline hazard function in a complicated way. Moreover, the asymptotic properties of the semi parametric maximum likelihood estimator are not well-studied. Motivated by the composite likelihood method (Besag 1974), we develop a composite partial likelihood method that retains the simplicity of the popular partial likelihood estimator and can be easily performed using standard statistical software. When applied to the CSHA data, the proposed method estimates a significant difference in survival between the vascular dementia group and the possible Alzheimer's disease group, while the partial likelihood method for left-truncated and right-censored data yields a greater standard error and a 95% confidence interval covering 0, thus highlighting the practical value of employing a more efficient methodology. To check the assumption of stable disease for the CSHA data, we also present new graphical and numerical tests in the article. The R code used to obtain the maximum composite partial likelihood estimator for the CSHA data is available in the online Supplementary Material, posted on the journal web site.
AB - The Canadian Study of Health and Aging (CSHA) employed a prevalent cohort design to study survival after onset of dementia, where patients with dementia were sampled and the onset time of dementia was determined retrospectively. The prevalent cohort sampling scheme favors individuals who survive longer. Thus, the observed survival times are subject to length bias. In recent years, there has been a rising interest in developing estimation procedures for prevalent cohort survival data that not only ac count for length bias but also actually exploit the incidence distribution of the disease to improve efficiency. This article considers semi parametric estimation of the Cox model for the time from dementia onset to death under a stationary assumption with respect to the disease incidence. Under the stationery condition, the semi parametric maximum likelihood estimation is expected to be fully efficient yet difficult to perform for statistical practitioners, as the likelihood depends on the baseline hazard function in a complicated way. Moreover, the asymptotic properties of the semi parametric maximum likelihood estimator are not well-studied. Motivated by the composite likelihood method (Besag 1974), we develop a composite partial likelihood method that retains the simplicity of the popular partial likelihood estimator and can be easily performed using standard statistical software. When applied to the CSHA data, the proposed method estimates a significant difference in survival between the vascular dementia group and the possible Alzheimer's disease group, while the partial likelihood method for left-truncated and right-censored data yields a greater standard error and a 95% confidence interval covering 0, thus highlighting the practical value of employing a more efficient methodology. To check the assumption of stable disease for the CSHA data, we also present new graphical and numerical tests in the article. The R code used to obtain the maximum composite partial likelihood estimator for the CSHA data is available in the online Supplementary Material, posted on the journal web site.
KW - Backward and forward recurrence time
KW - Cross-sectional sampling
KW - Random truncation
KW - Renewal processes
UR - http://www.scopus.com/inward/record.url?scp=84870696343&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84870696343&partnerID=8YFLogxK
U2 - 10.1080/01621459.2012.682544
DO - 10.1080/01621459.2012.682544
M3 - Article
C2 - 24000265
AN - SCOPUS:84870696343
SN - 0162-1459
VL - 107
SP - 946
EP - 957
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 499
ER -