Joint analysis of binary and quantitative traits with data sharing and outcome-dependent sampling.

Gang Zheng; Colin O. Wu; Minjung Kwak; Wenhua Jiang; Jungnam Joo; Joao A.C. Lima

doi:10.1002/gepi.21619

Joint analysis of binary and quantitative traits with data sharing and outcome-dependent sampling.

Gang Zheng, Colin O. Wu, Minjung Kwak, Wenhua Jiang, Jungnam Joo, Joao A.C. Lima

School of Medicine

Research output: Contribution to journal › Article › peer-review

5 Scopus citations

Abstract

We study the analysis of a joint association between a genetic marker with both binary (case-control) and quantitative (continuous) traits, where the quantitative trait values are only available for the cases due to data sharing and outcome-dependent sampling. Data sharing becomes common in genetic association studies, and the outcome-dependent sampling is the consequence of data sharing, under which a phenotype of interest is not measured for some subgroup. The trend test (or Pearson's test) and F-test are often, respectively, used to analyze the binary and quantitative traits. Because of the outcome-dependent sampling, the usual F-test can be applied using the subgroup with the observed quantitative traits. We propose a modified F-test by also incorporating the genotype frequencies of the subgroup whose traits are not observed. Further, a combination of this modified F-test and Pearson's test is proposed by Fisher's combination of their P-values as a joint analysis. Because of the correlation of the two analyses, we propose to use a Gamma (scaled chi-squared) distribution to fit the asymptotic null distribution for the joint analysis. The proposed modified F-test and the joint analysis can also be applied to test single trait association (either binary or quantitative trait). Through simulations, we identify the situations under which the proposed tests are more powerful than the existing ones. Application to a real dataset of rheumatoid arthritis is presented.

Original language	English (US)
Pages (from-to)	263-273
Number of pages	11
Journal	Genetic epidemiology
Volume	36
Issue number	3
DOIs	https://doi.org/10.1002/gepi.21619
State	Published - Apr 2012

ASJC Scopus subject areas

Epidemiology
Genetics(clinical)

Access to Document

10.1002/gepi.21619

Cite this

@article{048ee68c5c2f4e379471e4799f236019,

title = "Joint analysis of binary and quantitative traits with data sharing and outcome-dependent sampling.",

abstract = "We study the analysis of a joint association between a genetic marker with both binary (case-control) and quantitative (continuous) traits, where the quantitative trait values are only available for the cases due to data sharing and outcome-dependent sampling. Data sharing becomes common in genetic association studies, and the outcome-dependent sampling is the consequence of data sharing, under which a phenotype of interest is not measured for some subgroup. The trend test (or Pearson's test) and F-test are often, respectively, used to analyze the binary and quantitative traits. Because of the outcome-dependent sampling, the usual F-test can be applied using the subgroup with the observed quantitative traits. We propose a modified F-test by also incorporating the genotype frequencies of the subgroup whose traits are not observed. Further, a combination of this modified F-test and Pearson's test is proposed by Fisher's combination of their P-values as a joint analysis. Because of the correlation of the two analyses, we propose to use a Gamma (scaled chi-squared) distribution to fit the asymptotic null distribution for the joint analysis. The proposed modified F-test and the joint analysis can also be applied to test single trait association (either binary or quantitative trait). Through simulations, we identify the situations under which the proposed tests are more powerful than the existing ones. Application to a real dataset of rheumatoid arthritis is presented.",

author = "Gang Zheng and Wu, {Colin O.} and Minjung Kwak and Wenhua Jiang and Jungnam Joo and Lima, {Joao A.C.}",

year = "2012",

month = apr,

doi = "10.1002/gepi.21619",

language = "English (US)",

volume = "36",

pages = "263--273",

journal = "Genetic epidemiology",

issn = "0741-0395",

publisher = "Wiley-Liss Inc.",

number = "3",

}

TY - JOUR

T1 - Joint analysis of binary and quantitative traits with data sharing and outcome-dependent sampling.

AU - Zheng, Gang

AU - Wu, Colin O.

AU - Kwak, Minjung

AU - Jiang, Wenhua

AU - Joo, Jungnam

AU - Lima, Joao A.C.

PY - 2012/4

Y1 - 2012/4

N2 - We study the analysis of a joint association between a genetic marker with both binary (case-control) and quantitative (continuous) traits, where the quantitative trait values are only available for the cases due to data sharing and outcome-dependent sampling. Data sharing becomes common in genetic association studies, and the outcome-dependent sampling is the consequence of data sharing, under which a phenotype of interest is not measured for some subgroup. The trend test (or Pearson's test) and F-test are often, respectively, used to analyze the binary and quantitative traits. Because of the outcome-dependent sampling, the usual F-test can be applied using the subgroup with the observed quantitative traits. We propose a modified F-test by also incorporating the genotype frequencies of the subgroup whose traits are not observed. Further, a combination of this modified F-test and Pearson's test is proposed by Fisher's combination of their P-values as a joint analysis. Because of the correlation of the two analyses, we propose to use a Gamma (scaled chi-squared) distribution to fit the asymptotic null distribution for the joint analysis. The proposed modified F-test and the joint analysis can also be applied to test single trait association (either binary or quantitative trait). Through simulations, we identify the situations under which the proposed tests are more powerful than the existing ones. Application to a real dataset of rheumatoid arthritis is presented.

AB - We study the analysis of a joint association between a genetic marker with both binary (case-control) and quantitative (continuous) traits, where the quantitative trait values are only available for the cases due to data sharing and outcome-dependent sampling. Data sharing becomes common in genetic association studies, and the outcome-dependent sampling is the consequence of data sharing, under which a phenotype of interest is not measured for some subgroup. The trend test (or Pearson's test) and F-test are often, respectively, used to analyze the binary and quantitative traits. Because of the outcome-dependent sampling, the usual F-test can be applied using the subgroup with the observed quantitative traits. We propose a modified F-test by also incorporating the genotype frequencies of the subgroup whose traits are not observed. Further, a combination of this modified F-test and Pearson's test is proposed by Fisher's combination of their P-values as a joint analysis. Because of the correlation of the two analyses, we propose to use a Gamma (scaled chi-squared) distribution to fit the asymptotic null distribution for the joint analysis. The proposed modified F-test and the joint analysis can also be applied to test single trait association (either binary or quantitative trait). Through simulations, we identify the situations under which the proposed tests are more powerful than the existing ones. Application to a real dataset of rheumatoid arthritis is presented.

UR - http://www.scopus.com/inward/record.url?scp=85028097934&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85028097934&partnerID=8YFLogxK

U2 - 10.1002/gepi.21619

DO - 10.1002/gepi.21619

M3 - Article

C2 - 22460626

AN - SCOPUS:85028097934

SN - 0741-0395

VL - 36

SP - 263

EP - 273

JO - Genetic epidemiology

JF - Genetic epidemiology

IS - 3

ER -

Joint analysis of binary and quantitative traits with data sharing and outcome-dependent sampling.

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this