Cargando…
Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice
The presence of rogue taxa (rogues) in a set of trees can frequently have a negative impact on the results of a bootstrap analysis (e.g., the overall support in consensus trees). We introduce an efficient graph-based algorithm for rogue taxon identification as well as an interactive webservice imple...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526802/ https://www.ncbi.nlm.nih.gov/pubmed/22962004 http://dx.doi.org/10.1093/sysbio/sys078 |
_version_ | 1782253626417217536 |
---|---|
author | Aberer, Andre J. Krompass, Denis Stamatakis, Alexandros |
author_facet | Aberer, Andre J. Krompass, Denis Stamatakis, Alexandros |
author_sort | Aberer, Andre J. |
collection | PubMed |
description | The presence of rogue taxa (rogues) in a set of trees can frequently have a negative impact on the results of a bootstrap analysis (e.g., the overall support in consensus trees). We introduce an efficient graph-based algorithm for rogue taxon identification as well as an interactive webservice implementing this algorithm. Compared with our previous method, the new algorithm is up to 4 orders of magnitude faster, while returning qualitatively identical results. Because of this significant improvement in scalability, the new algorithm can now identify substantially more complex and compute-intensive rogue taxon constellations. On a large and diverse collection of real-world data sets, we show that our method yields better supported reduced/pruned consensus trees than any competing rogue taxon identification method. Using the parallel version of our open-source code, we successfully identified rogue taxa in a set of 100 trees with 116 334 taxa each. For simulated data sets, we show that when removing/pruning rogue taxa with our method from a tree set, we consistently obtain bootstrap consensus trees as well as maximum-likelihood trees that are topologically closer to the respective true trees. |
format | Online Article Text |
id | pubmed-3526802 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-35268022012-12-20 Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice Aberer, Andre J. Krompass, Denis Stamatakis, Alexandros Syst Biol Software for Systematics and Evolution The presence of rogue taxa (rogues) in a set of trees can frequently have a negative impact on the results of a bootstrap analysis (e.g., the overall support in consensus trees). We introduce an efficient graph-based algorithm for rogue taxon identification as well as an interactive webservice implementing this algorithm. Compared with our previous method, the new algorithm is up to 4 orders of magnitude faster, while returning qualitatively identical results. Because of this significant improvement in scalability, the new algorithm can now identify substantially more complex and compute-intensive rogue taxon constellations. On a large and diverse collection of real-world data sets, we show that our method yields better supported reduced/pruned consensus trees than any competing rogue taxon identification method. Using the parallel version of our open-source code, we successfully identified rogue taxa in a set of 100 trees with 116 334 taxa each. For simulated data sets, we show that when removing/pruning rogue taxa with our method from a tree set, we consistently obtain bootstrap consensus trees as well as maximum-likelihood trees that are topologically closer to the respective true trees. Oxford University Press 2013-01 2012-11-08 /pmc/articles/PMC3526802/ /pubmed/22962004 http://dx.doi.org/10.1093/sysbio/sys078 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software for Systematics and Evolution Aberer, Andre J. Krompass, Denis Stamatakis, Alexandros Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice |
title | Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice |
title_full | Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice |
title_fullStr | Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice |
title_full_unstemmed | Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice |
title_short | Pruning Rogue Taxa Improves Phylogenetic Accuracy: An Efficient Algorithm and Webservice |
title_sort | pruning rogue taxa improves phylogenetic accuracy: an efficient algorithm and webservice |
topic | Software for Systematics and Evolution |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526802/ https://www.ncbi.nlm.nih.gov/pubmed/22962004 http://dx.doi.org/10.1093/sysbio/sys078 |
work_keys_str_mv | AT abererandrej pruningroguetaxaimprovesphylogeneticaccuracyanefficientalgorithmandwebservice AT krompassdenis pruningroguetaxaimprovesphylogeneticaccuracyanefficientalgorithmandwebservice AT stamatakisalexandros pruningroguetaxaimprovesphylogeneticaccuracyanefficientalgorithmandwebservice |