Cargando…

Mugsy: fast multiple alignment of closely related whole genomes

Motivation: The relative ease and low cost of current generation sequencing technologies has led to a dramatic increase in the number of sequenced genomes for species across the tree of life. This increasing volume of data requires tools that can quickly compare multiple whole-genome sequences, mill...

Descripción completa

Detalles Bibliográficos
Autores principales: Angiuoli, Samuel V., Salzberg, Steven L.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3031037/
https://www.ncbi.nlm.nih.gov/pubmed/21148543
http://dx.doi.org/10.1093/bioinformatics/btq665
_version_ 1782197313196785664
author Angiuoli, Samuel V.
Salzberg, Steven L.
author_facet Angiuoli, Samuel V.
Salzberg, Steven L.
author_sort Angiuoli, Samuel V.
collection PubMed
description Motivation: The relative ease and low cost of current generation sequencing technologies has led to a dramatic increase in the number of sequenced genomes for species across the tree of life. This increasing volume of data requires tools that can quickly compare multiple whole-genome sequences, millions of base pairs in length, to aid in the study of populations, pan-genomes, and genome evolution. Results: We present a new multiple alignment tool for whole genomes named Mugsy. Mugsy is computationally efficient and can align 31 Streptococcus pneumoniae genomes in less than 2 hours producing alignments that compare favorably to other tools. Mugsy is also the fastest program evaluated for the multiple alignment of assembled human chromosome sequences from four individuals. Mugsy does not require a reference sequence, can align mixtures of assembled draft and completed genome data, and is robust in identifying a rich complement of genetic variation including duplications, rearrangements, and large-scale gain and loss of sequence. Availability: Mugsy is free, open-source software available from http://mugsy.sf.net. Contact: angiuoli@cs.umd.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-3031037
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30310372011-02-02 Mugsy: fast multiple alignment of closely related whole genomes Angiuoli, Samuel V. Salzberg, Steven L. Bioinformatics Original Papers Motivation: The relative ease and low cost of current generation sequencing technologies has led to a dramatic increase in the number of sequenced genomes for species across the tree of life. This increasing volume of data requires tools that can quickly compare multiple whole-genome sequences, millions of base pairs in length, to aid in the study of populations, pan-genomes, and genome evolution. Results: We present a new multiple alignment tool for whole genomes named Mugsy. Mugsy is computationally efficient and can align 31 Streptococcus pneumoniae genomes in less than 2 hours producing alignments that compare favorably to other tools. Mugsy is also the fastest program evaluated for the multiple alignment of assembled human chromosome sequences from four individuals. Mugsy does not require a reference sequence, can align mixtures of assembled draft and completed genome data, and is robust in identifying a rich complement of genetic variation including duplications, rearrangements, and large-scale gain and loss of sequence. Availability: Mugsy is free, open-source software available from http://mugsy.sf.net. Contact: angiuoli@cs.umd.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-02-01 2010-12-09 /pmc/articles/PMC3031037/ /pubmed/21148543 http://dx.doi.org/10.1093/bioinformatics/btq665 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Angiuoli, Samuel V.
Salzberg, Steven L.
Mugsy: fast multiple alignment of closely related whole genomes
title Mugsy: fast multiple alignment of closely related whole genomes
title_full Mugsy: fast multiple alignment of closely related whole genomes
title_fullStr Mugsy: fast multiple alignment of closely related whole genomes
title_full_unstemmed Mugsy: fast multiple alignment of closely related whole genomes
title_short Mugsy: fast multiple alignment of closely related whole genomes
title_sort mugsy: fast multiple alignment of closely related whole genomes
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3031037/
https://www.ncbi.nlm.nih.gov/pubmed/21148543
http://dx.doi.org/10.1093/bioinformatics/btq665
work_keys_str_mv AT angiuolisamuelv mugsyfastmultiplealignmentofcloselyrelatedwholegenomes
AT salzbergstevenl mugsyfastmultiplealignmentofcloselyrelatedwholegenomes