Identity-by-descent filtering as a tool for the identification of disease alleles in exome sequence data from distant relatives

Nirmala Akula; Sevilla Detera-Wadleigh; Yin Yao Shugart; Michael Nalls; Jo Steele; Francis J. McMahon

doi:10.1186/1753-6561-5-S9-S76

Identity-by-descent filtering as a tool for the identification of disease alleles in exome sequence data from distant relatives

Nirmala Akula, Sevilla Detera-Wadleigh, Yin Yao Shugart, Michael Nalls, Jo Steele, Francis J. McMahon

Research output: Contribution to journal › Article › peer-review

5 Scopus citations

Abstract

Large-scale, deep resequencing may be the next logical step in the genetic investigation of common complex diseases. Because each individual is likely to carry many thousands of variants, the identification of causal alleles requires an efficient strategy to reduce the number of candidate variants. Under many genetic models, causal alleles can be expected to reside within identity-by-descent (IBD) regions shared by affected relatives. In distant relatives, IBD regions constitute a small portion of the genome and can thus greatly reduce the search space for causal alleles. However, the effectiveness of this strategy is unknown. We test the simulated mini-exome data set in extended pedigrees provided by Genetic Analysis Workshop 17. At the fourth- and fifth-degree level of relatedness, case-case pairs shared between 1% and 9% of the genome identical by descent. As expected, no genes were shared identical by descent by all case subjects, but 43 genes were shared by many case subjects across at least 50 replicates. We filtered variants in these genes based on population frequency, function, informativeness, and evidence of association using the family-based association test. This analysis highlighted five genes previously implicated in triglyceride, lipid, and cholesterol metabolism. Comparison with the list of true risk alleles revealed that strict IBD filtering followed by association testing of the rarest alleles was the most sensitive strategy. IBD filtering may be a useful strategy for narrowing down the list of candidate variants in exome data, but the optimal degree of relatedness of affected pairs will depend on the genetic architecture of the disease under study.

Original language	English (US)
Article number	S76
Journal	BMC Proceedings
Volume	5
Issue number	SUPPL. 9
DOIs	https://doi.org/10.1186/1753-6561-5-S9-S76
State	Published - 2011
Externally published	Yes

ASJC Scopus subject areas

General Biochemistry, Genetics and Molecular Biology

Access to Document

10.1186/1753-6561-5-S9-S76

Cite this

@article{20a10322882049e082d61989528e8834,

title = "Identity-by-descent filtering as a tool for the identification of disease alleles in exome sequence data from distant relatives",

abstract = "Large-scale, deep resequencing may be the next logical step in the genetic investigation of common complex diseases. Because each individual is likely to carry many thousands of variants, the identification of causal alleles requires an efficient strategy to reduce the number of candidate variants. Under many genetic models, causal alleles can be expected to reside within identity-by-descent (IBD) regions shared by affected relatives. In distant relatives, IBD regions constitute a small portion of the genome and can thus greatly reduce the search space for causal alleles. However, the effectiveness of this strategy is unknown. We test the simulated mini-exome data set in extended pedigrees provided by Genetic Analysis Workshop 17. At the fourth- and fifth-degree level of relatedness, case-case pairs shared between 1% and 9% of the genome identical by descent. As expected, no genes were shared identical by descent by all case subjects, but 43 genes were shared by many case subjects across at least 50 replicates. We filtered variants in these genes based on population frequency, function, informativeness, and evidence of association using the family-based association test. This analysis highlighted five genes previously implicated in triglyceride, lipid, and cholesterol metabolism. Comparison with the list of true risk alleles revealed that strict IBD filtering followed by association testing of the rarest alleles was the most sensitive strategy. IBD filtering may be a useful strategy for narrowing down the list of candidate variants in exome data, but the optimal degree of relatedness of affected pairs will depend on the genetic architecture of the disease under study.",

author = "Nirmala Akula and Sevilla Detera-Wadleigh and Shugart, {Yin Yao} and Michael Nalls and Jo Steele and McMahon, {Francis J.}",

note = "Funding Information: The Genetic Analysis Workshop is supported by National Institutes of Health grant R01 GM031575. This work was supported by the National Institute of Mental Health (NIMH) Intramural Research Program. The computational analysis was performed on the Helix server at the National Institutes of Health. This article has been published as part of BMC Proceedings Volume 5 Supplement 9, 2011: Genetic Analysis Workshop 17. The full contents of the supplement are available online at http://www.biomedcentral.com/1753-6561/5?issue=S9.",

year = "2011",

doi = "10.1186/1753-6561-5-S9-S76",

language = "English (US)",

volume = "5",

journal = "BMC Proceedings",

issn = "1753-6561",

publisher = "BioMed Central",

number = "SUPPL. 9",

}

TY - JOUR

T1 - Identity-by-descent filtering as a tool for the identification of disease alleles in exome sequence data from distant relatives

AU - Akula, Nirmala

AU - Detera-Wadleigh, Sevilla

AU - Shugart, Yin Yao

AU - Nalls, Michael

AU - Steele, Jo

AU - McMahon, Francis J.

N1 - Funding Information: The Genetic Analysis Workshop is supported by National Institutes of Health grant R01 GM031575. This work was supported by the National Institute of Mental Health (NIMH) Intramural Research Program. The computational analysis was performed on the Helix server at the National Institutes of Health. This article has been published as part of BMC Proceedings Volume 5 Supplement 9, 2011: Genetic Analysis Workshop 17. The full contents of the supplement are available online at http://www.biomedcentral.com/1753-6561/5?issue=S9.

PY - 2011

Y1 - 2011

N2 - Large-scale, deep resequencing may be the next logical step in the genetic investigation of common complex diseases. Because each individual is likely to carry many thousands of variants, the identification of causal alleles requires an efficient strategy to reduce the number of candidate variants. Under many genetic models, causal alleles can be expected to reside within identity-by-descent (IBD) regions shared by affected relatives. In distant relatives, IBD regions constitute a small portion of the genome and can thus greatly reduce the search space for causal alleles. However, the effectiveness of this strategy is unknown. We test the simulated mini-exome data set in extended pedigrees provided by Genetic Analysis Workshop 17. At the fourth- and fifth-degree level of relatedness, case-case pairs shared between 1% and 9% of the genome identical by descent. As expected, no genes were shared identical by descent by all case subjects, but 43 genes were shared by many case subjects across at least 50 replicates. We filtered variants in these genes based on population frequency, function, informativeness, and evidence of association using the family-based association test. This analysis highlighted five genes previously implicated in triglyceride, lipid, and cholesterol metabolism. Comparison with the list of true risk alleles revealed that strict IBD filtering followed by association testing of the rarest alleles was the most sensitive strategy. IBD filtering may be a useful strategy for narrowing down the list of candidate variants in exome data, but the optimal degree of relatedness of affected pairs will depend on the genetic architecture of the disease under study.

AB - Large-scale, deep resequencing may be the next logical step in the genetic investigation of common complex diseases. Because each individual is likely to carry many thousands of variants, the identification of causal alleles requires an efficient strategy to reduce the number of candidate variants. Under many genetic models, causal alleles can be expected to reside within identity-by-descent (IBD) regions shared by affected relatives. In distant relatives, IBD regions constitute a small portion of the genome and can thus greatly reduce the search space for causal alleles. However, the effectiveness of this strategy is unknown. We test the simulated mini-exome data set in extended pedigrees provided by Genetic Analysis Workshop 17. At the fourth- and fifth-degree level of relatedness, case-case pairs shared between 1% and 9% of the genome identical by descent. As expected, no genes were shared identical by descent by all case subjects, but 43 genes were shared by many case subjects across at least 50 replicates. We filtered variants in these genes based on population frequency, function, informativeness, and evidence of association using the family-based association test. This analysis highlighted five genes previously implicated in triglyceride, lipid, and cholesterol metabolism. Comparison with the list of true risk alleles revealed that strict IBD filtering followed by association testing of the rarest alleles was the most sensitive strategy. IBD filtering may be a useful strategy for narrowing down the list of candidate variants in exome data, but the optimal degree of relatedness of affected pairs will depend on the genetic architecture of the disease under study.

UR - http://www.scopus.com/inward/record.url?scp=82455194217&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=82455194217&partnerID=8YFLogxK

U2 - 10.1186/1753-6561-5-S9-S76

DO - 10.1186/1753-6561-5-S9-S76

M3 - Article

C2 - 22373213

AN - SCOPUS:82455194217

SN - 1753-6561

VL - 5

JO - BMC Proceedings

JF - BMC Proceedings

IS - SUPPL. 9

M1 - S76

ER -

Identity-by-descent filtering as a tool for the identification of disease alleles in exome sequence data from distant relatives

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this