Pathway analysis software: Annotation errors and solutions

Nicole K. Henderson-MacLennan, Jeanette C. Papp, C. Conover Talbot, Edward R.B. McCabe, Angela P. Presson

Research output: Contribution to journalArticlepeer-review

Abstract

Genetic databases contain a variety of annotation errors that often go unnoticed due to the large size of modern genetic data sets. Interpretation of these data sets requires bioinformatics tools that may contribute to this problem. While providing gene symbol annotations for identifiers (IDs) such as microarray probe set, RefSeq, GenBank, and Entrez Gene is seemingly trivial, the accuracy is fundamental to any subsequent conclusions. We examine gene symbol annotations and results from three commercial pathway analysis software (PAS) packages: Ingenuity Pathways Analysis, GeneGO, and Pathway Studio. We compare gene symbol annotations and canonical pathway results over time and among different input ID types. We find that PAS results can be affected by variation in gene symbol annotations across software releases and the input ID type analyzed. As a result, we offer suggestions for using commercial PAS and reporting microarray results to improve research quality. We propose a wiki type website to facilitate communication of bioinformatics software problems within the scientific community.

Original languageEnglish (US)
Pages (from-to)134-140
Number of pages7
JournalMolecular genetics and metabolism
Volume101
Issue number2-3
DOIs
StatePublished - Oct 2010

Keywords

  • Annotation
  • Bioinformatics
  • Gene identifiers
  • Microarray
  • Systems biology

ASJC Scopus subject areas

  • Endocrinology, Diabetes and Metabolism
  • Biochemistry
  • Molecular Biology
  • Genetics
  • Endocrinology

Fingerprint Dive into the research topics of 'Pathway analysis software: Annotation errors and solutions'. Together they form a unique fingerprint.

Cite this