Cargando…

Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice

The presence of rogue taxa (rogues) in a set of trees can frequently have a negative impact on the results of a bootstrap analysis (e.g., the overall support in consensus trees). We introduce an efficient graph-based algorithm for rogue taxon identification as well as an interactive webservice imple...

Descripción completa

Detalles Bibliográficos
Autores principales: Aberer, Andre J., Krompass, Denis, Stamatakis, Alexandros
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526802/
https://www.ncbi.nlm.nih.gov/pubmed/22962004
http://dx.doi.org/10.1093/sysbio/sys078
_version_ 1782253626417217536
author Aberer, Andre J.
Krompass, Denis
Stamatakis, Alexandros
author_facet Aberer, Andre J.
Krompass, Denis
Stamatakis, Alexandros
author_sort Aberer, Andre J.
collection PubMed
description The presence of rogue taxa (rogues) in a set of trees can frequently have a negative impact on the results of a bootstrap analysis (e.g., the overall support in consensus trees). We introduce an efficient graph-based algorithm for rogue taxon identification as well as an interactive webservice implementing this algorithm. Compared with our previous method, the new algorithm is up to 4 orders of magnitude faster, while returning qualitatively identical results. Because of this significant improvement in scalability, the new algorithm can now identify substantially more complex and compute-intensive rogue taxon constellations. On a large and diverse collection of real-world data sets, we show that our method yields better supported reduced/pruned consensus trees than any competing rogue taxon identification method. Using the parallel version of our open-source code, we successfully identified rogue taxa in a set of 100 trees with 116 334 taxa each. For simulated data sets, we show that when removing/pruning rogue taxa with our method from a tree set, we consistently obtain bootstrap consensus trees as well as maximum-likelihood trees that are topologically closer to the respective true trees.
format Online
Article
Text
id pubmed-3526802
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35268022012-12-20 Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice Aberer, Andre J. Krompass, Denis Stamatakis, Alexandros Syst Biol Software for Systematics and Evolution The presence of rogue taxa (rogues) in a set of trees can frequently have a negative impact on the results of a bootstrap analysis (e.g., the overall support in consensus trees). We introduce an efficient graph-based algorithm for rogue taxon identification as well as an interactive webservice implementing this algorithm. Compared with our previous method, the new algorithm is up to 4 orders of magnitude faster, while returning qualitatively identical results. Because of this significant improvement in scalability, the new algorithm can now identify substantially more complex and compute-intensive rogue taxon constellations. On a large and diverse collection of real-world data sets, we show that our method yields better supported reduced/pruned consensus trees than any competing rogue taxon identification method. Using the parallel version of our open-source code, we successfully identified rogue taxa in a set of 100 trees with 116 334 taxa each. For simulated data sets, we show that when removing/pruning rogue taxa with our method from a tree set, we consistently obtain bootstrap consensus trees as well as maximum-likelihood trees that are topologically closer to the respective true trees. Oxford University Press 2013-01 2012-11-08 /pmc/articles/PMC3526802/ /pubmed/22962004 http://dx.doi.org/10.1093/sysbio/sys078 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software for Systematics and Evolution
Aberer, Andre J.
Krompass, Denis
Stamatakis, Alexandros
Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice
title Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice
title_full Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice
title_fullStr Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice
title_full_unstemmed Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice
title_short Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice
title_sort pruning rogue taxa improves phylogenetic accuracy: an efficient algorithm and webservice
topic Software for Systematics and Evolution
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526802/
https://www.ncbi.nlm.nih.gov/pubmed/22962004
http://dx.doi.org/10.1093/sysbio/sys078
work_keys_str_mv AT abererandrej pruningroguetaxaimprovesphylogeneticaccuracyanefficientalgorithmandwebservice
AT krompassdenis pruningroguetaxaimprovesphylogeneticaccuracyanefficientalgorithmandwebservice
AT stamatakisalexandros pruningroguetaxaimprovesphylogeneticaccuracyanefficientalgorithmandwebservice