Operon structure is an important organization feature of bacterial genomes. Many sets of genes occur in the same order on multiple genomes; these conserved gene groupings represent candidate operons. This study describes a computational method to estimate the likelihood that such conserved gene sets form operons. The method was used to analyze 34 bacterial and archaeal genomes, and yielded more than 7600 pairs of genes that are highly likely (P ≥ 0.98) to belong to the same operon. The sensitivity of our method is 30-50% for the Escherichia coli genome. The predicted gene pairs are available from our World Wide Web site http:// www.tigr.org/tigr-scripts/operons/operons.cgi.
ASJC Scopus subject areas