The uniform-score gene set analysis for identifying common pathways associated with different diabetes traits

Hao Mei, Lianna Li, Shijian Liu, Fan Jiang, Michael Griswold, Thomas Mosley

Research output: Contribution to journalArticle

Abstract

Background: Genetic heritability and expression study have shown that different diabetes traits have common genetic components and pathways. A computationally efficient pathway analysis of GWAS results will benefit post-GWAS study of SNP associations and identification of common genetic pathways from diabetes GWAS can help to improve understanding of the disease pathogenesis. Results: We proposed a uniform-score gene-set analysis (USGSA) with implemented package to unify different gene measures by a uniform score for identifying pathways from GWAS data, and use a pre-generated permutation distribution table to quickly obtain multiple-testing adjusted p-value. Simulation studies of uniform score for four gene measures (minP, 2ndP, simP and fishP) have shown that USGSA has strictly controlled family-wise error rate. The power depends on types of gene measure. USGSA with a two-stage study strategy was applied to identify common pathways associated with diabetes traits based on public dbGaP GWAS results. The study identified 7 gene sets that contain binding motifs at promoter region of component genes for 5 transcription factors (TFs) of FOXO4, TCF3, NFAT, VSX1 and POU2F1, and 1 microRNA of mir-218. These gene sets include 25 common genes that are among top 5% of the gene associations over genome for all GWAS. Previous evidences showed that nearly all of these genes are mainly expressed in the brain. Conclusions: USGSA is a computationally efficient approach for pathway analysis of GWAS data with promoted interpretability and comparability. The pathway analysis suggested that different diabetes traits share common pathways and component genes are potentially regulated by common TFs and microRNA. The result also indicated that the central nervous system has a critical role in diabetes pathogenesis. The findings will be important in formulating novel hypotheses for guiding follow-up studies.

Original languageEnglish (US)
Article number336
JournalBMC Genomics
Volume16
Issue number1
DOIs
StatePublished - Apr 23 2015
Externally publishedYes

Fingerprint

Genome-Wide Association Study
Genes
Gene Components
MicroRNAs
Transcription Factors
Genetic Promoter Regions
Single Nucleotide Polymorphism
Central Nervous System
Genome
Brain

Keywords

  • Diabetes
  • GWAS
  • Pathway

ASJC Scopus subject areas

  • Biotechnology
  • Genetics

Cite this

The uniform-score gene set analysis for identifying common pathways associated with different diabetes traits. / Mei, Hao; Li, Lianna; Liu, Shijian; Jiang, Fan; Griswold, Michael; Mosley, Thomas.

In: BMC Genomics, Vol. 16, No. 1, 336, 23.04.2015.

Research output: Contribution to journalArticle

Mei, Hao ; Li, Lianna ; Liu, Shijian ; Jiang, Fan ; Griswold, Michael ; Mosley, Thomas. / The uniform-score gene set analysis for identifying common pathways associated with different diabetes traits. In: BMC Genomics. 2015 ; Vol. 16, No. 1.
@article{c25879acbae84396ba2062f8c8bdf712,
title = "The uniform-score gene set analysis for identifying common pathways associated with different diabetes traits",
abstract = "Background: Genetic heritability and expression study have shown that different diabetes traits have common genetic components and pathways. A computationally efficient pathway analysis of GWAS results will benefit post-GWAS study of SNP associations and identification of common genetic pathways from diabetes GWAS can help to improve understanding of the disease pathogenesis. Results: We proposed a uniform-score gene-set analysis (USGSA) with implemented package to unify different gene measures by a uniform score for identifying pathways from GWAS data, and use a pre-generated permutation distribution table to quickly obtain multiple-testing adjusted p-value. Simulation studies of uniform score for four gene measures (minP, 2ndP, simP and fishP) have shown that USGSA has strictly controlled family-wise error rate. The power depends on types of gene measure. USGSA with a two-stage study strategy was applied to identify common pathways associated with diabetes traits based on public dbGaP GWAS results. The study identified 7 gene sets that contain binding motifs at promoter region of component genes for 5 transcription factors (TFs) of FOXO4, TCF3, NFAT, VSX1 and POU2F1, and 1 microRNA of mir-218. These gene sets include 25 common genes that are among top 5{\%} of the gene associations over genome for all GWAS. Previous evidences showed that nearly all of these genes are mainly expressed in the brain. Conclusions: USGSA is a computationally efficient approach for pathway analysis of GWAS data with promoted interpretability and comparability. The pathway analysis suggested that different diabetes traits share common pathways and component genes are potentially regulated by common TFs and microRNA. The result also indicated that the central nervous system has a critical role in diabetes pathogenesis. The findings will be important in formulating novel hypotheses for guiding follow-up studies.",
keywords = "Diabetes, GWAS, Pathway",
author = "Hao Mei and Lianna Li and Shijian Liu and Fan Jiang and Michael Griswold and Thomas Mosley",
year = "2015",
month = "4",
day = "23",
doi = "10.1186/s12864-015-1515-3",
language = "English (US)",
volume = "16",
journal = "BMC Genomics",
issn = "1471-2164",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - The uniform-score gene set analysis for identifying common pathways associated with different diabetes traits

AU - Mei, Hao

AU - Li, Lianna

AU - Liu, Shijian

AU - Jiang, Fan

AU - Griswold, Michael

AU - Mosley, Thomas

PY - 2015/4/23

Y1 - 2015/4/23

N2 - Background: Genetic heritability and expression study have shown that different diabetes traits have common genetic components and pathways. A computationally efficient pathway analysis of GWAS results will benefit post-GWAS study of SNP associations and identification of common genetic pathways from diabetes GWAS can help to improve understanding of the disease pathogenesis. Results: We proposed a uniform-score gene-set analysis (USGSA) with implemented package to unify different gene measures by a uniform score for identifying pathways from GWAS data, and use a pre-generated permutation distribution table to quickly obtain multiple-testing adjusted p-value. Simulation studies of uniform score for four gene measures (minP, 2ndP, simP and fishP) have shown that USGSA has strictly controlled family-wise error rate. The power depends on types of gene measure. USGSA with a two-stage study strategy was applied to identify common pathways associated with diabetes traits based on public dbGaP GWAS results. The study identified 7 gene sets that contain binding motifs at promoter region of component genes for 5 transcription factors (TFs) of FOXO4, TCF3, NFAT, VSX1 and POU2F1, and 1 microRNA of mir-218. These gene sets include 25 common genes that are among top 5% of the gene associations over genome for all GWAS. Previous evidences showed that nearly all of these genes are mainly expressed in the brain. Conclusions: USGSA is a computationally efficient approach for pathway analysis of GWAS data with promoted interpretability and comparability. The pathway analysis suggested that different diabetes traits share common pathways and component genes are potentially regulated by common TFs and microRNA. The result also indicated that the central nervous system has a critical role in diabetes pathogenesis. The findings will be important in formulating novel hypotheses for guiding follow-up studies.

AB - Background: Genetic heritability and expression study have shown that different diabetes traits have common genetic components and pathways. A computationally efficient pathway analysis of GWAS results will benefit post-GWAS study of SNP associations and identification of common genetic pathways from diabetes GWAS can help to improve understanding of the disease pathogenesis. Results: We proposed a uniform-score gene-set analysis (USGSA) with implemented package to unify different gene measures by a uniform score for identifying pathways from GWAS data, and use a pre-generated permutation distribution table to quickly obtain multiple-testing adjusted p-value. Simulation studies of uniform score for four gene measures (minP, 2ndP, simP and fishP) have shown that USGSA has strictly controlled family-wise error rate. The power depends on types of gene measure. USGSA with a two-stage study strategy was applied to identify common pathways associated with diabetes traits based on public dbGaP GWAS results. The study identified 7 gene sets that contain binding motifs at promoter region of component genes for 5 transcription factors (TFs) of FOXO4, TCF3, NFAT, VSX1 and POU2F1, and 1 microRNA of mir-218. These gene sets include 25 common genes that are among top 5% of the gene associations over genome for all GWAS. Previous evidences showed that nearly all of these genes are mainly expressed in the brain. Conclusions: USGSA is a computationally efficient approach for pathway analysis of GWAS data with promoted interpretability and comparability. The pathway analysis suggested that different diabetes traits share common pathways and component genes are potentially regulated by common TFs and microRNA. The result also indicated that the central nervous system has a critical role in diabetes pathogenesis. The findings will be important in formulating novel hypotheses for guiding follow-up studies.

KW - Diabetes

KW - GWAS

KW - Pathway

UR - http://www.scopus.com/inward/record.url?scp=84928522831&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84928522831&partnerID=8YFLogxK

U2 - 10.1186/s12864-015-1515-3

DO - 10.1186/s12864-015-1515-3

M3 - Article

C2 - 25898945

AN - SCOPUS:84928522831

VL - 16

JO - BMC Genomics

JF - BMC Genomics

SN - 1471-2164

IS - 1

M1 - 336

ER -