Cargando…

Optimization strategies for fast detection of positive selection on phylogenetic trees

Motivation: The detection of positive selection is widely used to study gene and genome evolution, but its application remains limited by the high computational cost of existing implementations. We present a series of computational optimizations for more efficient estimation of the likelihood functi...

Descripción completa

Detalles Bibliográficos
Autores principales: Valle, Mario, Schabauer, Hannes, Pacher, Christoph, Stockinger, Heinz, Stamatakis, Alexandros, Robinson-Rechavi, Marc, Salamin, Nicolas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3982156/
https://www.ncbi.nlm.nih.gov/pubmed/24389654
http://dx.doi.org/10.1093/bioinformatics/btt760
_version_ 1782311145780019200
author Valle, Mario
Schabauer, Hannes
Pacher, Christoph
Stockinger, Heinz
Stamatakis, Alexandros
Robinson-Rechavi, Marc
Salamin, Nicolas
author_facet Valle, Mario
Schabauer, Hannes
Pacher, Christoph
Stockinger, Heinz
Stamatakis, Alexandros
Robinson-Rechavi, Marc
Salamin, Nicolas
author_sort Valle, Mario
collection PubMed
description Motivation: The detection of positive selection is widely used to study gene and genome evolution, but its application remains limited by the high computational cost of existing implementations. We present a series of computational optimizations for more efficient estimation of the likelihood function on large-scale phylogenetic problems. We illustrate our approach using the branch-site model of codon evolution. Results: We introduce novel optimization techniques that substantially outperform both CodeML from the PAML package and our previously optimized sequential version SlimCodeML. These techniques can also be applied to other likelihood-based phylogeny software. Our implementation scales well for large numbers of codons and/or species. It can therefore analyse substantially larger datasets than CodeML. We evaluated FastCodeML on different platforms and measured average sequential speedups of FastCodeML (single-threaded) versus CodeML of up to 5.8, average speedups of FastCodeML (multi-threaded) versus CodeML on a single node (shared memory) of up to 36.9 for 12 CPU cores, and average speedups of the distributed FastCodeML versus CodeML of up to 170.9 on eight nodes (96 CPU cores in total). Availability and implementation: ftp://ftp.vital-it.ch/tools/FastCodeML/. Contact: selectome@unil.ch or nicolas.salamin@unil.ch
format Online
Article
Text
id pubmed-3982156
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-39821562014-05-14 Optimization strategies for fast detection of positive selection on phylogenetic trees Valle, Mario Schabauer, Hannes Pacher, Christoph Stockinger, Heinz Stamatakis, Alexandros Robinson-Rechavi, Marc Salamin, Nicolas Bioinformatics Original Papers Motivation: The detection of positive selection is widely used to study gene and genome evolution, but its application remains limited by the high computational cost of existing implementations. We present a series of computational optimizations for more efficient estimation of the likelihood function on large-scale phylogenetic problems. We illustrate our approach using the branch-site model of codon evolution. Results: We introduce novel optimization techniques that substantially outperform both CodeML from the PAML package and our previously optimized sequential version SlimCodeML. These techniques can also be applied to other likelihood-based phylogeny software. Our implementation scales well for large numbers of codons and/or species. It can therefore analyse substantially larger datasets than CodeML. We evaluated FastCodeML on different platforms and measured average sequential speedups of FastCodeML (single-threaded) versus CodeML of up to 5.8, average speedups of FastCodeML (multi-threaded) versus CodeML on a single node (shared memory) of up to 36.9 for 12 CPU cores, and average speedups of the distributed FastCodeML versus CodeML of up to 170.9 on eight nodes (96 CPU cores in total). Availability and implementation: ftp://ftp.vital-it.ch/tools/FastCodeML/. Contact: selectome@unil.ch or nicolas.salamin@unil.ch Oxford University Press 2014-04-15 2014-01-02 /pmc/articles/PMC3982156/ /pubmed/24389654 http://dx.doi.org/10.1093/bioinformatics/btt760 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Valle, Mario
Schabauer, Hannes
Pacher, Christoph
Stockinger, Heinz
Stamatakis, Alexandros
Robinson-Rechavi, Marc
Salamin, Nicolas
Optimization strategies for fast detection of positive selection on phylogenetic trees
title Optimization strategies for fast detection of positive selection on phylogenetic trees
title_full Optimization strategies for fast detection of positive selection on phylogenetic trees
title_fullStr Optimization strategies for fast detection of positive selection on phylogenetic trees
title_full_unstemmed Optimization strategies for fast detection of positive selection on phylogenetic trees
title_short Optimization strategies for fast detection of positive selection on phylogenetic trees
title_sort optimization strategies for fast detection of positive selection on phylogenetic trees
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3982156/
https://www.ncbi.nlm.nih.gov/pubmed/24389654
http://dx.doi.org/10.1093/bioinformatics/btt760
work_keys_str_mv AT vallemario optimizationstrategiesforfastdetectionofpositiveselectiononphylogenetictrees
AT schabauerhannes optimizationstrategiesforfastdetectionofpositiveselectiononphylogenetictrees
AT pacherchristoph optimizationstrategiesforfastdetectionofpositiveselectiononphylogenetictrees
AT stockingerheinz optimizationstrategiesforfastdetectionofpositiveselectiononphylogenetictrees
AT stamatakisalexandros optimizationstrategiesforfastdetectionofpositiveselectiononphylogenetictrees
AT robinsonrechavimarc optimizationstrategiesforfastdetectionofpositiveselectiononphylogenetictrees
AT salaminnicolas optimizationstrategiesforfastdetectionofpositiveselectiononphylogenetictrees