Gene-based segregation method for identifying rare variants in family-based sequencing studies

Dandi Qiao, Christoph Lange, Nan M. Laird, Sungho Won, Craig P. Hersh, Jarrett Morrow, Brian D. Hobbs, Sharon M. Lutz, Ingo Ruczinski, Terri L Beaty, Edwin K. Silverman, Michael H. Cho

Research output: Contribution to journalArticle

Abstract

Whole-exome sequencing using family data has identified rare coding variants in Mendelian diseases or complex diseases with Mendelian subtypes, using filters based on variant novelty, functionality, and segregation with the phenotype within families. However, formal statistical approaches are limited. We propose a gene-based segregation test (GESE) that quantifies the uncertainty of the filtering approach. It is constructed using the probability of segregation events under the null hypothesis of Mendelian transmission. This test takes into account different degrees of relatedness in families, the number of functional rare variants in the gene, and their minor allele frequencies in the corresponding population. In addition, a weighted version of this test allows incorporating additional subject phenotypes to improve statistical power. We show via simulations that the GESE and weighted GESE tests maintain appropriate type I error rate, and have greater power than several commonly used region-based methods. We apply our method to whole-exome sequencing data from 49 extended pedigrees with severe, early-onset chronic obstructive pulmonary disease (COPD) in the Boston Early-Onset COPD study (BEOCOPD) and identify several promising candidate genes. Our proposed methods show great potential for identifying rare coding variants of large effect and high penetrance for family-based sequencing data. The proposed tests are implemented in an R package that is available on CRAN (https://cran.r-project.org/web/packages/GESE/).

Original languageEnglish (US)
Pages (from-to)309-319
Number of pages11
JournalGenetic Epidemiology
Volume41
Issue number4
DOIs
StatePublished - May 1 2017

Fingerprint

Genes
Exome
Phenotype
Penetrance
Pedigree
Gene Frequency
Chronic Obstructive Pulmonary Disease
Uncertainty
Population
Pulmonary Disease, Chronic Obstructive, Severe Early-Onset

Keywords

  • extended families
  • filtering approach
  • Mendelian disease
  • whole exome sequencing

ASJC Scopus subject areas

  • Epidemiology
  • Genetics(clinical)

Cite this

Qiao, D., Lange, C., Laird, N. M., Won, S., Hersh, C. P., Morrow, J., ... Cho, M. H. (2017). Gene-based segregation method for identifying rare variants in family-based sequencing studies. Genetic Epidemiology, 41(4), 309-319. https://doi.org/10.1002/gepi.22037

Gene-based segregation method for identifying rare variants in family-based sequencing studies. / Qiao, Dandi; Lange, Christoph; Laird, Nan M.; Won, Sungho; Hersh, Craig P.; Morrow, Jarrett; Hobbs, Brian D.; Lutz, Sharon M.; Ruczinski, Ingo; Beaty, Terri L; Silverman, Edwin K.; Cho, Michael H.

In: Genetic Epidemiology, Vol. 41, No. 4, 01.05.2017, p. 309-319.

Research output: Contribution to journalArticle

Qiao, D, Lange, C, Laird, NM, Won, S, Hersh, CP, Morrow, J, Hobbs, BD, Lutz, SM, Ruczinski, I, Beaty, TL, Silverman, EK & Cho, MH 2017, 'Gene-based segregation method for identifying rare variants in family-based sequencing studies', Genetic Epidemiology, vol. 41, no. 4, pp. 309-319. https://doi.org/10.1002/gepi.22037
Qiao, Dandi ; Lange, Christoph ; Laird, Nan M. ; Won, Sungho ; Hersh, Craig P. ; Morrow, Jarrett ; Hobbs, Brian D. ; Lutz, Sharon M. ; Ruczinski, Ingo ; Beaty, Terri L ; Silverman, Edwin K. ; Cho, Michael H. / Gene-based segregation method for identifying rare variants in family-based sequencing studies. In: Genetic Epidemiology. 2017 ; Vol. 41, No. 4. pp. 309-319.
@article{1e91c0b4099944718b2f613f2a94759f,
title = "Gene-based segregation method for identifying rare variants in family-based sequencing studies",
abstract = "Whole-exome sequencing using family data has identified rare coding variants in Mendelian diseases or complex diseases with Mendelian subtypes, using filters based on variant novelty, functionality, and segregation with the phenotype within families. However, formal statistical approaches are limited. We propose a gene-based segregation test (GESE) that quantifies the uncertainty of the filtering approach. It is constructed using the probability of segregation events under the null hypothesis of Mendelian transmission. This test takes into account different degrees of relatedness in families, the number of functional rare variants in the gene, and their minor allele frequencies in the corresponding population. In addition, a weighted version of this test allows incorporating additional subject phenotypes to improve statistical power. We show via simulations that the GESE and weighted GESE tests maintain appropriate type I error rate, and have greater power than several commonly used region-based methods. We apply our method to whole-exome sequencing data from 49 extended pedigrees with severe, early-onset chronic obstructive pulmonary disease (COPD) in the Boston Early-Onset COPD study (BEOCOPD) and identify several promising candidate genes. Our proposed methods show great potential for identifying rare coding variants of large effect and high penetrance for family-based sequencing data. The proposed tests are implemented in an R package that is available on CRAN (https://cran.r-project.org/web/packages/GESE/).",
keywords = "extended families, filtering approach, Mendelian disease, whole exome sequencing",
author = "Dandi Qiao and Christoph Lange and Laird, {Nan M.} and Sungho Won and Hersh, {Craig P.} and Jarrett Morrow and Hobbs, {Brian D.} and Lutz, {Sharon M.} and Ingo Ruczinski and Beaty, {Terri L} and Silverman, {Edwin K.} and Cho, {Michael H.}",
year = "2017",
month = "5",
day = "1",
doi = "10.1002/gepi.22037",
language = "English (US)",
volume = "41",
pages = "309--319",
journal = "Genetic Epidemiology",
issn = "0741-0395",
publisher = "Wiley-Liss Inc.",
number = "4",

}

TY - JOUR

T1 - Gene-based segregation method for identifying rare variants in family-based sequencing studies

AU - Qiao, Dandi

AU - Lange, Christoph

AU - Laird, Nan M.

AU - Won, Sungho

AU - Hersh, Craig P.

AU - Morrow, Jarrett

AU - Hobbs, Brian D.

AU - Lutz, Sharon M.

AU - Ruczinski, Ingo

AU - Beaty, Terri L

AU - Silverman, Edwin K.

AU - Cho, Michael H.

PY - 2017/5/1

Y1 - 2017/5/1

N2 - Whole-exome sequencing using family data has identified rare coding variants in Mendelian diseases or complex diseases with Mendelian subtypes, using filters based on variant novelty, functionality, and segregation with the phenotype within families. However, formal statistical approaches are limited. We propose a gene-based segregation test (GESE) that quantifies the uncertainty of the filtering approach. It is constructed using the probability of segregation events under the null hypothesis of Mendelian transmission. This test takes into account different degrees of relatedness in families, the number of functional rare variants in the gene, and their minor allele frequencies in the corresponding population. In addition, a weighted version of this test allows incorporating additional subject phenotypes to improve statistical power. We show via simulations that the GESE and weighted GESE tests maintain appropriate type I error rate, and have greater power than several commonly used region-based methods. We apply our method to whole-exome sequencing data from 49 extended pedigrees with severe, early-onset chronic obstructive pulmonary disease (COPD) in the Boston Early-Onset COPD study (BEOCOPD) and identify several promising candidate genes. Our proposed methods show great potential for identifying rare coding variants of large effect and high penetrance for family-based sequencing data. The proposed tests are implemented in an R package that is available on CRAN (https://cran.r-project.org/web/packages/GESE/).

AB - Whole-exome sequencing using family data has identified rare coding variants in Mendelian diseases or complex diseases with Mendelian subtypes, using filters based on variant novelty, functionality, and segregation with the phenotype within families. However, formal statistical approaches are limited. We propose a gene-based segregation test (GESE) that quantifies the uncertainty of the filtering approach. It is constructed using the probability of segregation events under the null hypothesis of Mendelian transmission. This test takes into account different degrees of relatedness in families, the number of functional rare variants in the gene, and their minor allele frequencies in the corresponding population. In addition, a weighted version of this test allows incorporating additional subject phenotypes to improve statistical power. We show via simulations that the GESE and weighted GESE tests maintain appropriate type I error rate, and have greater power than several commonly used region-based methods. We apply our method to whole-exome sequencing data from 49 extended pedigrees with severe, early-onset chronic obstructive pulmonary disease (COPD) in the Boston Early-Onset COPD study (BEOCOPD) and identify several promising candidate genes. Our proposed methods show great potential for identifying rare coding variants of large effect and high penetrance for family-based sequencing data. The proposed tests are implemented in an R package that is available on CRAN (https://cran.r-project.org/web/packages/GESE/).

KW - extended families

KW - filtering approach

KW - Mendelian disease

KW - whole exome sequencing

UR - http://www.scopus.com/inward/record.url?scp=85012950037&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85012950037&partnerID=8YFLogxK

U2 - 10.1002/gepi.22037

DO - 10.1002/gepi.22037

M3 - Article

C2 - 28191685

AN - SCOPUS:85012950037

VL - 41

SP - 309

EP - 319

JO - Genetic Epidemiology

JF - Genetic Epidemiology

SN - 0741-0395

IS - 4

ER -