Cargando…

PUmPER: phylogenies updated perpetually

Summary: New sequence data useful for phylogenetic and evolutionary analyses continues to be added to public databases. The construction of multiple sequence alignments and inference of huge phylogenies comprising large taxonomic groups are expensive tasks, both in terms of man hours and computation...

Descripción completa

Detalles Bibliográficos
Autores principales: Izquierdo-Carrasco, Fernando, Cazes, John, Smith, Stephen A., Stamatakis, Alexandros
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4016711/
https://www.ncbi.nlm.nih.gov/pubmed/24478338
http://dx.doi.org/10.1093/bioinformatics/btu053
_version_ 1782315553602404352
author Izquierdo-Carrasco, Fernando
Cazes, John
Smith, Stephen A.
Stamatakis, Alexandros
author_facet Izquierdo-Carrasco, Fernando
Cazes, John
Smith, Stephen A.
Stamatakis, Alexandros
author_sort Izquierdo-Carrasco, Fernando
collection PubMed
description Summary: New sequence data useful for phylogenetic and evolutionary analyses continues to be added to public databases. The construction of multiple sequence alignments and inference of huge phylogenies comprising large taxonomic groups are expensive tasks, both in terms of man hours and computational resources. Therefore, maintaining comprehensive phylogenies, based on representative and up-to-date molecular sequences, is challenging. PUmPER is a framework that can perpetually construct multi-gene alignments (with PHLAWD) and phylogenetic trees (with ExaML or RAxML-Light) for a given NCBI taxonomic group. When sufficient numbers of new gene sequences for the selected taxonomic group have accumulated in GenBank, PUmPER automatically extends the alignment and infers extended phylogenetic trees by using previously inferred smaller trees as starting topologies. Using our framework, large phylogenetic trees can be perpetually updated without human intervention. Importantly, resulting phylogenies are not statistically significantly worse than trees inferred from scratch. Availability and implementation: PUmPER can run in stand-alone mode on a single server, or offload the computationally expensive phylogenetic searches to a parallel computing cluster. Source code, documentation, and tutorials are available at https://github.com/fizquierdo/perpetually-updated-trees. Contact: Fernando.Izquierdo@h-its.org Supplementary information: Supplementary Material is available at Bioinformatics online.
format Online
Article
Text
id pubmed-4016711
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40167112014-05-12 PUmPER: phylogenies updated perpetually Izquierdo-Carrasco, Fernando Cazes, John Smith, Stephen A. Stamatakis, Alexandros Bioinformatics Applications Notes Summary: New sequence data useful for phylogenetic and evolutionary analyses continues to be added to public databases. The construction of multiple sequence alignments and inference of huge phylogenies comprising large taxonomic groups are expensive tasks, both in terms of man hours and computational resources. Therefore, maintaining comprehensive phylogenies, based on representative and up-to-date molecular sequences, is challenging. PUmPER is a framework that can perpetually construct multi-gene alignments (with PHLAWD) and phylogenetic trees (with ExaML or RAxML-Light) for a given NCBI taxonomic group. When sufficient numbers of new gene sequences for the selected taxonomic group have accumulated in GenBank, PUmPER automatically extends the alignment and infers extended phylogenetic trees by using previously inferred smaller trees as starting topologies. Using our framework, large phylogenetic trees can be perpetually updated without human intervention. Importantly, resulting phylogenies are not statistically significantly worse than trees inferred from scratch. Availability and implementation: PUmPER can run in stand-alone mode on a single server, or offload the computationally expensive phylogenetic searches to a parallel computing cluster. Source code, documentation, and tutorials are available at https://github.com/fizquierdo/perpetually-updated-trees. Contact: Fernando.Izquierdo@h-its.org Supplementary information: Supplementary Material is available at Bioinformatics online. Oxford University Press 2014-05-15 2014-02-28 /pmc/articles/PMC4016711/ /pubmed/24478338 http://dx.doi.org/10.1093/bioinformatics/btu053 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Applications Notes
Izquierdo-Carrasco, Fernando
Cazes, John
Smith, Stephen A.
Stamatakis, Alexandros
PUmPER: phylogenies updated perpetually
title PUmPER: phylogenies updated perpetually
title_full PUmPER: phylogenies updated perpetually
title_fullStr PUmPER: phylogenies updated perpetually
title_full_unstemmed PUmPER: phylogenies updated perpetually
title_short PUmPER: phylogenies updated perpetually
title_sort pumper: phylogenies updated perpetually
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4016711/
https://www.ncbi.nlm.nih.gov/pubmed/24478338
http://dx.doi.org/10.1093/bioinformatics/btu053
work_keys_str_mv AT izquierdocarrascofernando pumperphylogeniesupdatedperpetually
AT cazesjohn pumperphylogeniesupdatedperpetually
AT smithstephena pumperphylogeniesupdatedperpetually
AT stamatakisalexandros pumperphylogeniesupdatedperpetually