Estimating treatment efficacy over time: A logistic regression model for binary longitudinal outcomes

Leena Choi; Francesca Dominici; Scott L. Zeger; Peter Ouyang

doi:10.1002/sim.2147

Estimating treatment efficacy over time: A logistic regression model for binary longitudinal outcomes

Leena Choi, Francesca Dominici, Scott L. Zeger, Peter Ouyang

Bloomberg School of Public Health

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

This paper presents a case study in longitudinal data analysis where the goal is to estimate the efficacy of a new drug for treatment of a severe chronic constipation. Data consist of long sequences of binary outcomes (relief/no relief) on each of a large number of patients randomized to treatment (low and high dose) or placebo. Data characteristics indicate: (1) the treatment effects vary non-linearly with time; (2) there is substantial heterogeneity across subjects in their responses to treatment; and (3) there is a high proportion of subjects who never experience any relief (the non-responders). To overcome these challenges, we develop a hierarchical model for binary longitudinal data with a mixture distribution on the probability of response to account for the high frequency of non-responders. While the model is specified conditionally on subject-specific latent variables, we also draw inferences on key population-average parameters for the assessment of the treatments' efficacy in a population. In addition we employ a model-checking method to compare the goodness-of-fit for our model against simpler modelling approaches for aggregated counts, such as the zero-inflated Poisson and zero-inflated negative binomial models. We estimate subject-specific and population-average rate ratios of relief for the treatment with respect to the placebo as functions of time (RR_t), and compare them with the rate ratios estimated from the models for aggregated counts. We find that: (1) the treatment is effective with respect to the placebo with higher efficacy at the beginning of the study; (2) the estimated rate ratios from the models for aggregated counts appear to be similar to the average across time of the population-average rate ratios estimated under our model; and (3) model-checking suggests that the hierarchical and zero-inflated negative binomial model fit the data best. If we are mainly interested to establish the overall efficacy (or safety) of a new drug, it is appropriate to aggregate the longitudinal data over time and analyse the count data by use of standard statistical methods. However, the models for aggregated counts cannot capture time trend of treatment such as the initial treatment benefit or the development of tolerance during the early stage of the treatment which may be important information to physicians to predict the treatment effects for their patients.

Original language	English (US)
Pages (from-to)	2789-2805
Number of pages	17
Journal	Statistics in Medicine
Volume	24
Issue number	18
DOIs	https://doi.org/10.1002/sim.2147
State	Published - Sep 30 2005

ASJC Scopus subject areas

Epidemiology
Statistics and Probability

Access to Document

10.1002/sim.2147

Cite this

@article{8fc6f8c1a3d14b89b302e6384a6d636c,

title = "Estimating treatment efficacy over time: A logistic regression model for binary longitudinal outcomes",

abstract = "This paper presents a case study in longitudinal data analysis where the goal is to estimate the efficacy of a new drug for treatment of a severe chronic constipation. Data consist of long sequences of binary outcomes (relief/no relief) on each of a large number of patients randomized to treatment (low and high dose) or placebo. Data characteristics indicate: (1) the treatment effects vary non-linearly with time; (2) there is substantial heterogeneity across subjects in their responses to treatment; and (3) there is a high proportion of subjects who never experience any relief (the non-responders). To overcome these challenges, we develop a hierarchical model for binary longitudinal data with a mixture distribution on the probability of response to account for the high frequency of non-responders. While the model is specified conditionally on subject-specific latent variables, we also draw inferences on key population-average parameters for the assessment of the treatments' efficacy in a population. In addition we employ a model-checking method to compare the goodness-of-fit for our model against simpler modelling approaches for aggregated counts, such as the zero-inflated Poisson and zero-inflated negative binomial models. We estimate subject-specific and population-average rate ratios of relief for the treatment with respect to the placebo as functions of time (RRt), and compare them with the rate ratios estimated from the models for aggregated counts. We find that: (1) the treatment is effective with respect to the placebo with higher efficacy at the beginning of the study; (2) the estimated rate ratios from the models for aggregated counts appear to be similar to the average across time of the population-average rate ratios estimated under our model; and (3) model-checking suggests that the hierarchical and zero-inflated negative binomial model fit the data best. If we are mainly interested to establish the overall efficacy (or safety) of a new drug, it is appropriate to aggregate the longitudinal data over time and analyse the count data by use of standard statistical methods. However, the models for aggregated counts cannot capture time trend of treatment such as the initial treatment benefit or the development of tolerance during the early stage of the treatment which may be important information to physicians to predict the treatment effects for their patients.",

author = "Leena Choi and Francesca Dominici and Zeger, {Scott L.} and Peter Ouyang",

year = "2005",

month = sep,

day = "30",

doi = "10.1002/sim.2147",

language = "English (US)",

volume = "24",

pages = "2789--2805",

journal = "Statistics in Medicine",

issn = "0277-6715",

publisher = "John Wiley and Sons Ltd",

number = "18",

}

TY - JOUR

T1 - Estimating treatment efficacy over time

T2 - A logistic regression model for binary longitudinal outcomes

AU - Choi, Leena

AU - Dominici, Francesca

AU - Zeger, Scott L.

AU - Ouyang, Peter

PY - 2005/9/30

Y1 - 2005/9/30

N2 - This paper presents a case study in longitudinal data analysis where the goal is to estimate the efficacy of a new drug for treatment of a severe chronic constipation. Data consist of long sequences of binary outcomes (relief/no relief) on each of a large number of patients randomized to treatment (low and high dose) or placebo. Data characteristics indicate: (1) the treatment effects vary non-linearly with time; (2) there is substantial heterogeneity across subjects in their responses to treatment; and (3) there is a high proportion of subjects who never experience any relief (the non-responders). To overcome these challenges, we develop a hierarchical model for binary longitudinal data with a mixture distribution on the probability of response to account for the high frequency of non-responders. While the model is specified conditionally on subject-specific latent variables, we also draw inferences on key population-average parameters for the assessment of the treatments' efficacy in a population. In addition we employ a model-checking method to compare the goodness-of-fit for our model against simpler modelling approaches for aggregated counts, such as the zero-inflated Poisson and zero-inflated negative binomial models. We estimate subject-specific and population-average rate ratios of relief for the treatment with respect to the placebo as functions of time (RRt), and compare them with the rate ratios estimated from the models for aggregated counts. We find that: (1) the treatment is effective with respect to the placebo with higher efficacy at the beginning of the study; (2) the estimated rate ratios from the models for aggregated counts appear to be similar to the average across time of the population-average rate ratios estimated under our model; and (3) model-checking suggests that the hierarchical and zero-inflated negative binomial model fit the data best. If we are mainly interested to establish the overall efficacy (or safety) of a new drug, it is appropriate to aggregate the longitudinal data over time and analyse the count data by use of standard statistical methods. However, the models for aggregated counts cannot capture time trend of treatment such as the initial treatment benefit or the development of tolerance during the early stage of the treatment which may be important information to physicians to predict the treatment effects for their patients.

AB - This paper presents a case study in longitudinal data analysis where the goal is to estimate the efficacy of a new drug for treatment of a severe chronic constipation. Data consist of long sequences of binary outcomes (relief/no relief) on each of a large number of patients randomized to treatment (low and high dose) or placebo. Data characteristics indicate: (1) the treatment effects vary non-linearly with time; (2) there is substantial heterogeneity across subjects in their responses to treatment; and (3) there is a high proportion of subjects who never experience any relief (the non-responders). To overcome these challenges, we develop a hierarchical model for binary longitudinal data with a mixture distribution on the probability of response to account for the high frequency of non-responders. While the model is specified conditionally on subject-specific latent variables, we also draw inferences on key population-average parameters for the assessment of the treatments' efficacy in a population. In addition we employ a model-checking method to compare the goodness-of-fit for our model against simpler modelling approaches for aggregated counts, such as the zero-inflated Poisson and zero-inflated negative binomial models. We estimate subject-specific and population-average rate ratios of relief for the treatment with respect to the placebo as functions of time (RRt), and compare them with the rate ratios estimated from the models for aggregated counts. We find that: (1) the treatment is effective with respect to the placebo with higher efficacy at the beginning of the study; (2) the estimated rate ratios from the models for aggregated counts appear to be similar to the average across time of the population-average rate ratios estimated under our model; and (3) model-checking suggests that the hierarchical and zero-inflated negative binomial model fit the data best. If we are mainly interested to establish the overall efficacy (or safety) of a new drug, it is appropriate to aggregate the longitudinal data over time and analyse the count data by use of standard statistical methods. However, the models for aggregated counts cannot capture time trend of treatment such as the initial treatment benefit or the development of tolerance during the early stage of the treatment which may be important information to physicians to predict the treatment effects for their patients.

UR - http://www.scopus.com/inward/record.url?scp=25444472592&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=25444472592&partnerID=8YFLogxK

U2 - 10.1002/sim.2147

DO - 10.1002/sim.2147

M3 - Article

C2 - 16134133

AN - SCOPUS:25444472592

SN - 0277-6715

VL - 24

SP - 2789

EP - 2805

JO - Statistics in Medicine

JF - Statistics in Medicine

IS - 18

ER -

Estimating treatment efficacy over time: A logistic regression model for binary longitudinal outcomes

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this