Joint analysis of binary and quantitative traits with data sharing and outcome-dependent sampling.

Gang Zheng, Colin O. Wu, Minjung Kwak, Wenhua Jiang, Jungnam Joo, Joao Lima

Research output: Contribution to journalArticle

Abstract

We study the analysis of a joint association between a genetic marker with both binary (case-control) and quantitative (continuous) traits, where the quantitative trait values are only available for the cases due to data sharing and outcome-dependent sampling. Data sharing becomes common in genetic association studies, and the outcome-dependent sampling is the consequence of data sharing, under which a phenotype of interest is not measured for some subgroup. The trend test (or Pearson's test) and F-test are often, respectively, used to analyze the binary and quantitative traits. Because of the outcome-dependent sampling, the usual F-test can be applied using the subgroup with the observed quantitative traits. We propose a modified F-test by also incorporating the genotype frequencies of the subgroup whose traits are not observed. Further, a combination of this modified F-test and Pearson's test is proposed by Fisher's combination of their P-values as a joint analysis. Because of the correlation of the two analyses, we propose to use a Gamma (scaled chi-squared) distribution to fit the asymptotic null distribution for the joint analysis. The proposed modified F-test and the joint analysis can also be applied to test single trait association (either binary or quantitative trait). Through simulations, we identify the situations under which the proposed tests are more powerful than the existing ones. Application to a real dataset of rheumatoid arthritis is presented.

Original languageEnglish (US)
Pages (from-to)263-273
Number of pages11
JournalGenetic Epidemiology
Volume36
Issue number3
DOIs
StatePublished - 2012
Externally publishedYes

Fingerprint

Information Dissemination
Genetic Association Studies
Genetic Markers
Rheumatoid Arthritis
Genotype
Phenotype

ASJC Scopus subject areas

  • Epidemiology
  • Genetics(clinical)

Cite this

Joint analysis of binary and quantitative traits with data sharing and outcome-dependent sampling. / Zheng, Gang; Wu, Colin O.; Kwak, Minjung; Jiang, Wenhua; Joo, Jungnam; Lima, Joao.

In: Genetic Epidemiology, Vol. 36, No. 3, 2012, p. 263-273.

Research output: Contribution to journalArticle

Zheng, Gang ; Wu, Colin O. ; Kwak, Minjung ; Jiang, Wenhua ; Joo, Jungnam ; Lima, Joao. / Joint analysis of binary and quantitative traits with data sharing and outcome-dependent sampling. In: Genetic Epidemiology. 2012 ; Vol. 36, No. 3. pp. 263-273.
@article{048ee68c5c2f4e379471e4799f236019,
title = "Joint analysis of binary and quantitative traits with data sharing and outcome-dependent sampling.",
abstract = "We study the analysis of a joint association between a genetic marker with both binary (case-control) and quantitative (continuous) traits, where the quantitative trait values are only available for the cases due to data sharing and outcome-dependent sampling. Data sharing becomes common in genetic association studies, and the outcome-dependent sampling is the consequence of data sharing, under which a phenotype of interest is not measured for some subgroup. The trend test (or Pearson's test) and F-test are often, respectively, used to analyze the binary and quantitative traits. Because of the outcome-dependent sampling, the usual F-test can be applied using the subgroup with the observed quantitative traits. We propose a modified F-test by also incorporating the genotype frequencies of the subgroup whose traits are not observed. Further, a combination of this modified F-test and Pearson's test is proposed by Fisher's combination of their P-values as a joint analysis. Because of the correlation of the two analyses, we propose to use a Gamma (scaled chi-squared) distribution to fit the asymptotic null distribution for the joint analysis. The proposed modified F-test and the joint analysis can also be applied to test single trait association (either binary or quantitative trait). Through simulations, we identify the situations under which the proposed tests are more powerful than the existing ones. Application to a real dataset of rheumatoid arthritis is presented.",
author = "Gang Zheng and Wu, {Colin O.} and Minjung Kwak and Wenhua Jiang and Jungnam Joo and Joao Lima",
year = "2012",
doi = "10.1002/gepi.21619",
language = "English (US)",
volume = "36",
pages = "263--273",
journal = "Genetic Epidemiology",
issn = "0741-0395",
publisher = "Wiley-Liss Inc.",
number = "3",

}

TY - JOUR

T1 - Joint analysis of binary and quantitative traits with data sharing and outcome-dependent sampling.

AU - Zheng, Gang

AU - Wu, Colin O.

AU - Kwak, Minjung

AU - Jiang, Wenhua

AU - Joo, Jungnam

AU - Lima, Joao

PY - 2012

Y1 - 2012

N2 - We study the analysis of a joint association between a genetic marker with both binary (case-control) and quantitative (continuous) traits, where the quantitative trait values are only available for the cases due to data sharing and outcome-dependent sampling. Data sharing becomes common in genetic association studies, and the outcome-dependent sampling is the consequence of data sharing, under which a phenotype of interest is not measured for some subgroup. The trend test (or Pearson's test) and F-test are often, respectively, used to analyze the binary and quantitative traits. Because of the outcome-dependent sampling, the usual F-test can be applied using the subgroup with the observed quantitative traits. We propose a modified F-test by also incorporating the genotype frequencies of the subgroup whose traits are not observed. Further, a combination of this modified F-test and Pearson's test is proposed by Fisher's combination of their P-values as a joint analysis. Because of the correlation of the two analyses, we propose to use a Gamma (scaled chi-squared) distribution to fit the asymptotic null distribution for the joint analysis. The proposed modified F-test and the joint analysis can also be applied to test single trait association (either binary or quantitative trait). Through simulations, we identify the situations under which the proposed tests are more powerful than the existing ones. Application to a real dataset of rheumatoid arthritis is presented.

AB - We study the analysis of a joint association between a genetic marker with both binary (case-control) and quantitative (continuous) traits, where the quantitative trait values are only available for the cases due to data sharing and outcome-dependent sampling. Data sharing becomes common in genetic association studies, and the outcome-dependent sampling is the consequence of data sharing, under which a phenotype of interest is not measured for some subgroup. The trend test (or Pearson's test) and F-test are often, respectively, used to analyze the binary and quantitative traits. Because of the outcome-dependent sampling, the usual F-test can be applied using the subgroup with the observed quantitative traits. We propose a modified F-test by also incorporating the genotype frequencies of the subgroup whose traits are not observed. Further, a combination of this modified F-test and Pearson's test is proposed by Fisher's combination of their P-values as a joint analysis. Because of the correlation of the two analyses, we propose to use a Gamma (scaled chi-squared) distribution to fit the asymptotic null distribution for the joint analysis. The proposed modified F-test and the joint analysis can also be applied to test single trait association (either binary or quantitative trait). Through simulations, we identify the situations under which the proposed tests are more powerful than the existing ones. Application to a real dataset of rheumatoid arthritis is presented.

UR - http://www.scopus.com/inward/record.url?scp=85028097934&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85028097934&partnerID=8YFLogxK

U2 - 10.1002/gepi.21619

DO - 10.1002/gepi.21619

M3 - Article

C2 - 22460626

AN - SCOPUS:85028097934

VL - 36

SP - 263

EP - 273

JO - Genetic Epidemiology

JF - Genetic Epidemiology

SN - 0741-0395

IS - 3

ER -