Cargando…

MUSCLE: a multiple sequence alignment method with reduced time and space complexity

BACKGROUND: In a previous paper, we introduced MUSCLE, a new program for creating multiple alignments of protein sequences, giving a brief summary of the algorithm and showing MUSCLE to achieve the highest scores reported to date on four alignment accuracy benchmarks. Here we present a more complete...

Descripción completa

Detalles Bibliográficos
Autor principal: Edgar, Robert C
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC517706/
https://www.ncbi.nlm.nih.gov/pubmed/15318951
http://dx.doi.org/10.1186/1471-2105-5-113
_version_ 1782121783897358336
author Edgar, Robert C
author_facet Edgar, Robert C
author_sort Edgar, Robert C
collection PubMed
description BACKGROUND: In a previous paper, we introduced MUSCLE, a new program for creating multiple alignments of protein sequences, giving a brief summary of the algorithm and showing MUSCLE to achieve the highest scores reported to date on four alignment accuracy benchmarks. Here we present a more complete discussion of the algorithm, describing several previously unpublished techniques that improve biological accuracy and / or computational complexity. We introduce a new option, MUSCLE-fast, designed for high-throughput applications. We also describe a new protocol for evaluating objective functions that align two profiles. RESULTS: We compare the speed and accuracy of MUSCLE with CLUSTALW, Progressive POA and the MAFFT script FFTNS1, the fastest previously published program known to the author. Accuracy is measured using four benchmarks: BAliBASE, PREFAB, SABmark and SMART. We test three variants that offer highest accuracy (MUSCLE with default settings), highest speed (MUSCLE-fast), and a carefully chosen compromise between the two (MUSCLE-prog). We find MUSCLE-fast to be the fastest algorithm on all test sets, achieving average alignment accuracy similar to CLUSTALW in times that are typically two to three orders of magnitude less. MUSCLE-fast is able to align 1,000 sequences of average length 282 in 21 seconds on a current desktop computer. CONCLUSIONS: MUSCLE offers a range of options that provide improved speed and / or alignment accuracy compared with currently available programs. MUSCLE is freely available at .
format Text
id pubmed-517706
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-5177062004-09-19 MUSCLE: a multiple sequence alignment method with reduced time and space complexity Edgar, Robert C BMC Bioinformatics Software BACKGROUND: In a previous paper, we introduced MUSCLE, a new program for creating multiple alignments of protein sequences, giving a brief summary of the algorithm and showing MUSCLE to achieve the highest scores reported to date on four alignment accuracy benchmarks. Here we present a more complete discussion of the algorithm, describing several previously unpublished techniques that improve biological accuracy and / or computational complexity. We introduce a new option, MUSCLE-fast, designed for high-throughput applications. We also describe a new protocol for evaluating objective functions that align two profiles. RESULTS: We compare the speed and accuracy of MUSCLE with CLUSTALW, Progressive POA and the MAFFT script FFTNS1, the fastest previously published program known to the author. Accuracy is measured using four benchmarks: BAliBASE, PREFAB, SABmark and SMART. We test three variants that offer highest accuracy (MUSCLE with default settings), highest speed (MUSCLE-fast), and a carefully chosen compromise between the two (MUSCLE-prog). We find MUSCLE-fast to be the fastest algorithm on all test sets, achieving average alignment accuracy similar to CLUSTALW in times that are typically two to three orders of magnitude less. MUSCLE-fast is able to align 1,000 sequences of average length 282 in 21 seconds on a current desktop computer. CONCLUSIONS: MUSCLE offers a range of options that provide improved speed and / or alignment accuracy compared with currently available programs. MUSCLE is freely available at . BioMed Central 2004-08-19 /pmc/articles/PMC517706/ /pubmed/15318951 http://dx.doi.org/10.1186/1471-2105-5-113 Text en Copyright © 2004 Edgar; licensee BioMed Central Ltd.
spellingShingle Software
Edgar, Robert C
MUSCLE: a multiple sequence alignment method with reduced time and space complexity
title MUSCLE: a multiple sequence alignment method with reduced time and space complexity
title_full MUSCLE: a multiple sequence alignment method with reduced time and space complexity
title_fullStr MUSCLE: a multiple sequence alignment method with reduced time and space complexity
title_full_unstemmed MUSCLE: a multiple sequence alignment method with reduced time and space complexity
title_short MUSCLE: a multiple sequence alignment method with reduced time and space complexity
title_sort muscle: a multiple sequence alignment method with reduced time and space complexity
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC517706/
https://www.ncbi.nlm.nih.gov/pubmed/15318951
http://dx.doi.org/10.1186/1471-2105-5-113
work_keys_str_mv AT edgarrobertc muscleamultiplesequencealignmentmethodwithreducedtimeandspacecomplexity