Cargando…

Supertrees Based on the Subtree Prune-and-Regraft Distance

Supertree methods reconcile a set of phylogenetic trees into a single structure that is often interpreted as a branching history of species. A key challenge is combining conflicting evolutionary histories that are due to artifacts of phylogenetic reconstruction and phenomena such as lateral gene tra...

Descripción completa

Detalles Bibliográficos
Autores principales: Whidden, Christopher, Zeh, Norbert, Beiko, Robert G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4055872/
https://www.ncbi.nlm.nih.gov/pubmed/24695589
http://dx.doi.org/10.1093/sysbio/syu023
_version_ 1782320742848790528
author Whidden, Christopher
Zeh, Norbert
Beiko, Robert G.
author_facet Whidden, Christopher
Zeh, Norbert
Beiko, Robert G.
author_sort Whidden, Christopher
collection PubMed
description Supertree methods reconcile a set of phylogenetic trees into a single structure that is often interpreted as a branching history of species. A key challenge is combining conflicting evolutionary histories that are due to artifacts of phylogenetic reconstruction and phenomena such as lateral gene transfer (LGT). Many supertree approaches use optimality criteria that do not reflect underlying processes, have known biases, and may be unduly influenced by LGT. We present the first method to construct supertrees by using the subtree prune-and-regraft (SPR) distance as an optimality criterion. Although calculating the rooted SPR distance between a pair of trees is NP-hard, our new maximum agreement forest-based methods can reconcile trees with hundreds of taxa and > 50 transfers in fractions of a second, which enables repeated calculations during the course of an iterative search. Our approach can accommodate trees in which uncertain relationships have been collapsed to multifurcating nodes. Using a series of benchmark datasets simulated under plausible rates of LGT, we show that SPR supertrees are more similar to correct species histories than supertrees based on parsimony or Robinson–Foulds distance criteria. We successfully constructed an SPR supertree from a phylogenomic dataset of 40,631 gene trees that covered 244 genomes representing several major bacterial phyla. Our SPR-based approach also allowed direct inference of highways of gene transfer between bacterial classes and genera. A Small number of these highways connect genera in different phyla and can highlight specific genes implicated in long-distance LGT. [Lateral gene transfer; matrix representation with parsimony; phylogenomics; prokaryotic phylogeny; Robinson–Foulds; subtree prune-and-regraft; supertrees.]
format Online
Article
Text
id pubmed-4055872
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40558722014-06-13 Supertrees Based on the Subtree Prune-and-Regraft Distance Whidden, Christopher Zeh, Norbert Beiko, Robert G. Syst Biol Regular Articles Supertree methods reconcile a set of phylogenetic trees into a single structure that is often interpreted as a branching history of species. A key challenge is combining conflicting evolutionary histories that are due to artifacts of phylogenetic reconstruction and phenomena such as lateral gene transfer (LGT). Many supertree approaches use optimality criteria that do not reflect underlying processes, have known biases, and may be unduly influenced by LGT. We present the first method to construct supertrees by using the subtree prune-and-regraft (SPR) distance as an optimality criterion. Although calculating the rooted SPR distance between a pair of trees is NP-hard, our new maximum agreement forest-based methods can reconcile trees with hundreds of taxa and > 50 transfers in fractions of a second, which enables repeated calculations during the course of an iterative search. Our approach can accommodate trees in which uncertain relationships have been collapsed to multifurcating nodes. Using a series of benchmark datasets simulated under plausible rates of LGT, we show that SPR supertrees are more similar to correct species histories than supertrees based on parsimony or Robinson–Foulds distance criteria. We successfully constructed an SPR supertree from a phylogenomic dataset of 40,631 gene trees that covered 244 genomes representing several major bacterial phyla. Our SPR-based approach also allowed direct inference of highways of gene transfer between bacterial classes and genera. A Small number of these highways connect genera in different phyla and can highlight specific genes implicated in long-distance LGT. [Lateral gene transfer; matrix representation with parsimony; phylogenomics; prokaryotic phylogeny; Robinson–Foulds; subtree prune-and-regraft; supertrees.] Oxford University Press 2014-07 2014-04-02 /pmc/articles/PMC4055872/ /pubmed/24695589 http://dx.doi.org/10.1093/sysbio/syu023 Text en © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Regular Articles
Whidden, Christopher
Zeh, Norbert
Beiko, Robert G.
Supertrees Based on the Subtree Prune-and-Regraft Distance
title Supertrees Based on the Subtree Prune-and-Regraft Distance
title_full Supertrees Based on the Subtree Prune-and-Regraft Distance
title_fullStr Supertrees Based on the Subtree Prune-and-Regraft Distance
title_full_unstemmed Supertrees Based on the Subtree Prune-and-Regraft Distance
title_short Supertrees Based on the Subtree Prune-and-Regraft Distance
title_sort supertrees based on the subtree prune-and-regraft distance
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4055872/
https://www.ncbi.nlm.nih.gov/pubmed/24695589
http://dx.doi.org/10.1093/sysbio/syu023
work_keys_str_mv AT whiddenchristopher supertreesbasedonthesubtreepruneandregraftdistance
AT zehnorbert supertreesbasedonthesubtreepruneandregraftdistance
AT beikorobertg supertreesbasedonthesubtreepruneandregraftdistance