TY - JOUR
T1 - Imputation of Variants from the 1000 Genomes Project Modestly Improves Known Associations and Can Identify Low-frequency Variant - Phenotype Associations Undetected by HapMap Based Imputation
AU - Wood, Andrew R.
AU - Perry, John R B
AU - Tanaka, Toshiko
AU - Hernandez, Dena G.
AU - Zheng, Hou Feng
AU - Melzer, David
AU - Gibbs, J. Raphael
AU - Nalls, Michael A.
AU - Weedon, Michael N.
AU - Spector, Tim D.
AU - Richards, J. Brent
AU - Bandinelli, Stefania
AU - Ferrucci, Luigi
AU - Singleton, Andrew B.
AU - Frayling, Timothy M.
PY - 2013/5/16
Y1 - 2013/5/16
N2 - Genome-wide association (GWA) studies have been limited by the reliance on common variants present on microarrays or imputable from the HapMap Project data. More recently, the completion of the 1000 Genomes Project has provided variant and haplotype information for several million variants derived from sequencing over 1,000 individuals. To help understand the extent to which more variants (including low frequency (1% ≤ MAF -8 based on HapMap and 1000 Genomes imputation, respectively, and 9 and 11 that reached a stricter, likely conservative, threshold of P-11 respectively. Imputation of 1000 Genomes genotype data modestly improved the strength of known associations. Of 20 associations detected at P-8 in both analyses (17 of which represent well replicated signals in the NHGRI catalogue), six were captured by the same index SNP, five were nominally more strongly associated in 1000 Genomes imputed data and one was nominally more strongly associated in HapMap imputed data. We also detected an association between a low frequency variant and phenotype that was previously missed by HapMap based imputation approaches. An association between rs112635299 and alpha-1 globulin near the SERPINA gene represented the known association between rs28929474 (MAF = 0.007) and alpha1-antitrypsin that predisposes to emphysema (P = 2.5×10-12). Our data provide important proof of principle that 1000 Genomes imputation will detect novel, low frequency-large effect associations.
AB - Genome-wide association (GWA) studies have been limited by the reliance on common variants present on microarrays or imputable from the HapMap Project data. More recently, the completion of the 1000 Genomes Project has provided variant and haplotype information for several million variants derived from sequencing over 1,000 individuals. To help understand the extent to which more variants (including low frequency (1% ≤ MAF -8 based on HapMap and 1000 Genomes imputation, respectively, and 9 and 11 that reached a stricter, likely conservative, threshold of P-11 respectively. Imputation of 1000 Genomes genotype data modestly improved the strength of known associations. Of 20 associations detected at P-8 in both analyses (17 of which represent well replicated signals in the NHGRI catalogue), six were captured by the same index SNP, five were nominally more strongly associated in 1000 Genomes imputed data and one was nominally more strongly associated in HapMap imputed data. We also detected an association between a low frequency variant and phenotype that was previously missed by HapMap based imputation approaches. An association between rs112635299 and alpha-1 globulin near the SERPINA gene represented the known association between rs28929474 (MAF = 0.007) and alpha1-antitrypsin that predisposes to emphysema (P = 2.5×10-12). Our data provide important proof of principle that 1000 Genomes imputation will detect novel, low frequency-large effect associations.
UR - http://www.scopus.com/inward/record.url?scp=84877816544&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84877816544&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0064343
DO - 10.1371/journal.pone.0064343
M3 - Article
C2 - 23696881
AN - SCOPUS:84877816544
SN - 1932-6203
VL - 8
JO - PLoS One
JF - PLoS One
IS - 5
M1 - e64343
ER -