Cargando…

Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction

Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees im...

Descripción completa

Detalles Bibliográficos
Autores principales: Mai, Uyen, Sayyari, Erfan, Mirarab, Siavash
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5553649/
https://www.ncbi.nlm.nih.gov/pubmed/28800608
http://dx.doi.org/10.1371/journal.pone.0182238
_version_ 1783256651134926848
author Mai, Uyen
Sayyari, Erfan
Mirarab, Siavash
author_facet Mai, Uyen
Sayyari, Erfan
Mirarab, Siavash
author_sort Mai, Uyen
collection PubMed
description Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees impacts methods of rooting a phylogenetic tree has not been extensively studied. In this paper, we introduce a new method of rooting a tree based on its branch length distribution; our method, which minimizes the variance of root to tip distances, is inspired by the traditional midpoint rerooting and is justified when deviations from the strict molecular clock are random. Like midpoint rerooting, the method can be implemented in a linear time algorithm. In extensive simulations that consider discordance between gene trees and the species tree, we show that the new method is more accurate than midpoint rerooting, but its relative accuracy compared to using outgroups to root gene trees depends on the size of the dataset and levels of deviations from the strict clock. We show high levels of error for all methods of rooting estimated gene trees due to factors that include effects of gene tree discordance, deviations from the clock, and gene tree estimation error. Our simulations, however, did not reveal significant differences between two equivalent methods for species tree estimation that use rooted and unrooted input, namely, STAR and NJst. Nevertheless, our results point to limitations of existing scalable rooting methods.
format Online
Article
Text
id pubmed-5553649
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55536492017-08-25 Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction Mai, Uyen Sayyari, Erfan Mirarab, Siavash PLoS One Research Article Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees impacts methods of rooting a phylogenetic tree has not been extensively studied. In this paper, we introduce a new method of rooting a tree based on its branch length distribution; our method, which minimizes the variance of root to tip distances, is inspired by the traditional midpoint rerooting and is justified when deviations from the strict molecular clock are random. Like midpoint rerooting, the method can be implemented in a linear time algorithm. In extensive simulations that consider discordance between gene trees and the species tree, we show that the new method is more accurate than midpoint rerooting, but its relative accuracy compared to using outgroups to root gene trees depends on the size of the dataset and levels of deviations from the strict clock. We show high levels of error for all methods of rooting estimated gene trees due to factors that include effects of gene tree discordance, deviations from the clock, and gene tree estimation error. Our simulations, however, did not reveal significant differences between two equivalent methods for species tree estimation that use rooted and unrooted input, namely, STAR and NJst. Nevertheless, our results point to limitations of existing scalable rooting methods. Public Library of Science 2017-08-11 /pmc/articles/PMC5553649/ /pubmed/28800608 http://dx.doi.org/10.1371/journal.pone.0182238 Text en © 2017 Mai et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mai, Uyen
Sayyari, Erfan
Mirarab, Siavash
Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction
title Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction
title_full Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction
title_fullStr Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction
title_full_unstemmed Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction
title_short Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction
title_sort minimum variance rooting of phylogenetic trees and implications for species tree reconstruction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5553649/
https://www.ncbi.nlm.nih.gov/pubmed/28800608
http://dx.doi.org/10.1371/journal.pone.0182238
work_keys_str_mv AT maiuyen minimumvariancerootingofphylogenetictreesandimplicationsforspeciestreereconstruction
AT sayyarierfan minimumvariancerootingofphylogenetictreesandimplicationsforspeciestreereconstruction
AT mirarabsiavash minimumvariancerootingofphylogenetictreesandimplicationsforspeciestreereconstruction