Cargando…
Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction
Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees im...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5553649/ https://www.ncbi.nlm.nih.gov/pubmed/28800608 http://dx.doi.org/10.1371/journal.pone.0182238 |
_version_ | 1783256651134926848 |
---|---|
author | Mai, Uyen Sayyari, Erfan Mirarab, Siavash |
author_facet | Mai, Uyen Sayyari, Erfan Mirarab, Siavash |
author_sort | Mai, Uyen |
collection | PubMed |
description | Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees impacts methods of rooting a phylogenetic tree has not been extensively studied. In this paper, we introduce a new method of rooting a tree based on its branch length distribution; our method, which minimizes the variance of root to tip distances, is inspired by the traditional midpoint rerooting and is justified when deviations from the strict molecular clock are random. Like midpoint rerooting, the method can be implemented in a linear time algorithm. In extensive simulations that consider discordance between gene trees and the species tree, we show that the new method is more accurate than midpoint rerooting, but its relative accuracy compared to using outgroups to root gene trees depends on the size of the dataset and levels of deviations from the strict clock. We show high levels of error for all methods of rooting estimated gene trees due to factors that include effects of gene tree discordance, deviations from the clock, and gene tree estimation error. Our simulations, however, did not reveal significant differences between two equivalent methods for species tree estimation that use rooted and unrooted input, namely, STAR and NJst. Nevertheless, our results point to limitations of existing scalable rooting methods. |
format | Online Article Text |
id | pubmed-5553649 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-55536492017-08-25 Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction Mai, Uyen Sayyari, Erfan Mirarab, Siavash PLoS One Research Article Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees impacts methods of rooting a phylogenetic tree has not been extensively studied. In this paper, we introduce a new method of rooting a tree based on its branch length distribution; our method, which minimizes the variance of root to tip distances, is inspired by the traditional midpoint rerooting and is justified when deviations from the strict molecular clock are random. Like midpoint rerooting, the method can be implemented in a linear time algorithm. In extensive simulations that consider discordance between gene trees and the species tree, we show that the new method is more accurate than midpoint rerooting, but its relative accuracy compared to using outgroups to root gene trees depends on the size of the dataset and levels of deviations from the strict clock. We show high levels of error for all methods of rooting estimated gene trees due to factors that include effects of gene tree discordance, deviations from the clock, and gene tree estimation error. Our simulations, however, did not reveal significant differences between two equivalent methods for species tree estimation that use rooted and unrooted input, namely, STAR and NJst. Nevertheless, our results point to limitations of existing scalable rooting methods. Public Library of Science 2017-08-11 /pmc/articles/PMC5553649/ /pubmed/28800608 http://dx.doi.org/10.1371/journal.pone.0182238 Text en © 2017 Mai et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Mai, Uyen Sayyari, Erfan Mirarab, Siavash Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction |
title | Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction |
title_full | Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction |
title_fullStr | Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction |
title_full_unstemmed | Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction |
title_short | Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction |
title_sort | minimum variance rooting of phylogenetic trees and implications for species tree reconstruction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5553649/ https://www.ncbi.nlm.nih.gov/pubmed/28800608 http://dx.doi.org/10.1371/journal.pone.0182238 |
work_keys_str_mv | AT maiuyen minimumvariancerootingofphylogenetictreesandimplicationsforspeciestreereconstruction AT sayyarierfan minimumvariancerootingofphylogenetictreesandimplicationsforspeciestreereconstruction AT mirarabsiavash minimumvariancerootingofphylogenetictreesandimplicationsforspeciestreereconstruction |