TY - JOUR
T1 - Inferring disease risk genes from sequencing data in multiplex pedigrees through sharing of rare variants
AU - Bureau, Alexandre
AU - Begum, Ferdouse
AU - Taub, Margaret A.
AU - Hetmanski, Jacqueline B.
AU - Parker, Margaret M.
AU - Albacha-Hejazi, Hasan
AU - Scott, Alan F.
AU - Murray, Jeffrey C.
AU - Marazita, Mary L.
AU - Bailey-Wilson, Joan E.
AU - Beaty, Terri H.
AU - Ruczinski, Ingo
N1 - Publisher Copyright:
© 2018 Wiley Periodicals, Inc.
PY - 2019/2
Y1 - 2019/2
N2 - We previously demonstrated how sharing of rare variants (RVs) in distant affected relatives can be used to identify variants causing a complex and heterogeneous disease. This approach tested whether single RVs were shared by all sequenced affected family members. However, as with other study designs, joint analysis of several RVs (e.g., within genes) is sometimes required to obtain sufficient statistical power. Further, phenocopies can lead to false negatives for some causal RVs if complete sharing among affected is required. Here, we extend our methodology (Rare Variant Sharing, RVS) to address these issues. Specifically, we introduce gene-based analyses, a partial sharing test based on RV sharing probabilities for subsets of affected relatives and a haplotype-based RV definition. RVS also has the desirable feature of not requiring external estimates of variant frequency or control samples, provides functionality to assess and address violations of key assumptions, and is available as open source software for genome-wide analysis. Simulations including phenocopies, based on the families of an oral cleft study, revealed the partial and complete sharing versions of RVS achieved similar statistical power compared with alternative methods (RareIBD and the Gene-Based Segregation Test), and had superior power compared with the pedigree Variant Annotation, Analysis, and Search Tool (pVAAST) linkage statistic. In studies of multiplex cleft families, analysis of rare single nucleotide variants in the exome of 151 affected relatives from 54 families revealed no significant excess sharing in any one gene, but highlighted different patterns of sharing revealed by the complete and partial sharing tests.
AB - We previously demonstrated how sharing of rare variants (RVs) in distant affected relatives can be used to identify variants causing a complex and heterogeneous disease. This approach tested whether single RVs were shared by all sequenced affected family members. However, as with other study designs, joint analysis of several RVs (e.g., within genes) is sometimes required to obtain sufficient statistical power. Further, phenocopies can lead to false negatives for some causal RVs if complete sharing among affected is required. Here, we extend our methodology (Rare Variant Sharing, RVS) to address these issues. Specifically, we introduce gene-based analyses, a partial sharing test based on RV sharing probabilities for subsets of affected relatives and a haplotype-based RV definition. RVS also has the desirable feature of not requiring external estimates of variant frequency or control samples, provides functionality to assess and address violations of key assumptions, and is available as open source software for genome-wide analysis. Simulations including phenocopies, based on the families of an oral cleft study, revealed the partial and complete sharing versions of RVS achieved similar statistical power compared with alternative methods (RareIBD and the Gene-Based Segregation Test), and had superior power compared with the pedigree Variant Annotation, Analysis, and Search Tool (pVAAST) linkage statistic. In studies of multiplex cleft families, analysis of rare single nucleotide variants in the exome of 151 affected relatives from 54 families revealed no significant excess sharing in any one gene, but highlighted different patterns of sharing revealed by the complete and partial sharing tests.
KW - family studies
KW - identity by descent
KW - oral clefts
KW - variant sharing
UR - http://www.scopus.com/inward/record.url?scp=85053721411&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85053721411&partnerID=8YFLogxK
U2 - 10.1002/gepi.22155
DO - 10.1002/gepi.22155
M3 - Article
C2 - 30246882
AN - SCOPUS:85053721411
SN - 0741-0395
VL - 43
SP - 37
EP - 49
JO - Genetic epidemiology
JF - Genetic epidemiology
IS - 1
ER -