Computational gene prediction using multiple sources of evidence

Research output: Contribution to journalArticle

Abstract

This article describes a computational method to construct gene models by using evidence generated from a diverse set of sources, including those typical of a genome annotation pipeline. The program, called Combiner, takes as input a genomic sequence and the locations of gene predictions from ab initio gene finders, protein sequence alignments, expressed sequence tag and cDNA alignments, splice site predictions, and other evidence. Three different algorithms for combining evidence in the Combiner were implemented and tested on 1783 confirmed genes in Arabidopsis thaliana. Our results show that combining gene prediction evidence consistently outperforms even the best individual gene finder and, in some cases, can produce dramatic improvements in sensitivity and specificity.

Original languageEnglish (US)
Pages (from-to)142-148
Number of pages7
JournalGenome research
Volume14
Issue number1
DOIs
StatePublished - Jan 2004

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Fingerprint Dive into the research topics of 'Computational gene prediction using multiple sources of evidence'. Together they form a unique fingerprint.

Cite this