Mugsy: Fast multiple alignment of closely related whole genomes

Samuel V. Angiuoli, Steven L. Salzberg

Research output: Contribution to journalArticle

Abstract

Motivation: The relative ease and low cost of current generation sequencing technologies has led to a dramatic increase in the number of sequenced genomes for species across the tree of life. This increasing volume of data requires tools that can quickly compare multiple whole-genome sequences, millions of base pairs in length, to aid in the study of populations, pan-genomes, and genome evolution. Results: We present a new multiple alignment tool for whole genomes named Mugsy. Mugsy is computationally efficient and can align 31 Streptococcus pneumoniae genomes in less than 2 hours producing alignments that compare favorably to other tools. Mugsy is also the fastest program evaluated for the multiple alignment of assembled human chromosome sequences from four individuals. Mugsy does not require a reference sequence, can align mixtures of assembled draft and completed genome data, and is robust in identifying a rich complement of genetic variation including duplications, rearrangements, and large-scale gain and loss of sequence.

Original languageEnglish (US)
Article numberbtq665
Pages (from-to)334-342
Number of pages9
JournalBioinformatics
Volume27
Issue number3
DOIs
StatePublished - Feb 1 2011
Externally publishedYes

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Fingerprint Dive into the research topics of 'Mugsy: Fast multiple alignment of closely related whole genomes'. Together they form a unique fingerprint.

  • Cite this