A partially linear tree-based regression model for multivariate outcomes

Kai Yu, William Wheeler, Qizhai Li, Andrew W. Bergen, Neil Caporaso, Nilanjan Chatterjee, Jinbo Chen

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

In the genetic study of complex traits, especially behavior related ones, such as smoking and alcoholism, usually several phenotypic measurements are obtained for the description of the complex trait, but no single measurement can quantify fully the complicated characteristics of the symptom because of our lack of understanding of the underlying etiology. If those phenotypes share a common genetic mechanism, rather than studying each individual phenotype separately, it is more advantageous to analyze them jointly as a multivariate trait to enhance the power to identify associated genes. We propose a multilocus association test for the study of multivariate traits. The test is derived from a partially linear tree-based regression model for multiple outcomes. This novel tree-based model provides a formal statistical testing framework for the evaluation of the association between a multivariate outcome and a set of candidate predictors, such as markers within a gene or pathway, while accommodating adjustment for other covariates. Through simulation studies we show that the proposed method has an acceptable type I error rate and improved power over the univariate outcome analysis, which studies each component of the complex trait separately with multiple-comparison adjustment. A candidate gene association study of multiple smoking-related phenotypes is used to demonstrate the application and advantages of this new method. The proposed method is general enough to be used for the assessment of the joint effect of a set of multiple risk factors on a multivariate outcome in other biomedical research settings.

Original languageEnglish (US)
Pages (from-to)89-96
Number of pages8
JournalBiometrics
Volume66
Issue number1
DOIs
StatePublished - Mar 2010
Externally publishedYes

Keywords

  • Generalized estimating equation
  • Genetic association study
  • Model selection
  • Multiple-comparison adjustment
  • Tree-based model

ASJC Scopus subject areas

  • Statistics and Probability
  • General Biochemistry, Genetics and Molecular Biology
  • General Immunology and Microbiology
  • General Agricultural and Biological Sciences
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'A partially linear tree-based regression model for multivariate outcomes'. Together they form a unique fingerprint.

Cite this