Cargando…

Parallelization of the MAFFT multiple sequence alignment program

Summary: Multiple sequence alignment (MSA) is an important step in comparative sequence analyses. Parallelization is a key technique for reducing the time required for large-scale sequence analyses. The three calculation stages, all-to-all comparison, progressive alignment and iterative refinement,...

Descripción completa

Detalles Bibliográficos
Autores principales: Katoh, Kazutaka, Toh, Hiroyuki
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2905546/
https://www.ncbi.nlm.nih.gov/pubmed/20427515
http://dx.doi.org/10.1093/bioinformatics/btq224
Descripción
Sumario:Summary: Multiple sequence alignment (MSA) is an important step in comparative sequence analyses. Parallelization is a key technique for reducing the time required for large-scale sequence analyses. The three calculation stages, all-to-all comparison, progressive alignment and iterative refinement, of the MAFFT MSA program were parallelized using the POSIX Threads library. Two natural parallelization strategies (best-first and simple hill-climbing) were implemented for the iterative refinement stage. Based on comparisons of the objective scores and benchmark scores between the two approaches, we selected a simple hill-climbing approach as the default. Availability: The parallelized version of MAFFT is available at http://mafft.cbrc.jp/alignment/software/. This version currently supports the Linux operating system only. Contact: kazutaka.katoh@aist.go.jp Supplementary information: Supplementary data are available at Bioinformatics online.