Cargando…

Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees

The widely used model for evolutionary relationships is a bifurcating tree with all taxa/observations placed at the leaves. This is not appropriate if the taxa have been densely sampled across evolutionary time and may be in a direct ancestral relationship, or if there is not enough information to f...

Descripción completa

Detalles Bibliográficos
Autores principales: Kalaghatgi, Prabhav, Pfeifer, Nico, Lengauer, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5026249/
https://www.ncbi.nlm.nih.gov/pubmed/27436007
http://dx.doi.org/10.1093/molbev/msw123
_version_ 1782454098388320256
author Kalaghatgi, Prabhav
Pfeifer, Nico
Lengauer, Thomas
author_facet Kalaghatgi, Prabhav
Pfeifer, Nico
Lengauer, Thomas
author_sort Kalaghatgi, Prabhav
collection PubMed
description The widely used model for evolutionary relationships is a bifurcating tree with all taxa/observations placed at the leaves. This is not appropriate if the taxa have been densely sampled across evolutionary time and may be in a direct ancestral relationship, or if there is not enough information to fully resolve all the branching points in the evolutionary tree. In this article, we present a fast distance-based agglomeration method called family-joining (FJ) for constructing so-called generally labeled trees in which taxa may be placed at internal vertices and the tree may contain polytomies. FJ constructs such trees on the basis of pairwise distances and a distance threshold. We tested three methods for threshold selection, FJ-AIC, FJ-BIC, and FJ-CV, which minimize Akaike information criterion, Bayesian information criterion, and cross-validation error, respectively. When compared with related methods on simulated data, FJ-BIC was among the best at reconstructing the correct tree across a wide range of simulation scenarios. FJ-BIC was applied to HIV sequences sampled from individuals involved in a known transmission chain. The FJ-BIC tree was found to be compatible with almost all transmission events. On average, internal branches in the FJ-BIC tree have higher bootstrap support than branches in the leaf-labeled bifurcating tree constructed using RAxML. 36% and 25% of the internal branches in the FJ-BIC tree and RAxML tree, respectively, have bootstrap support greater than 70%. To the best of our knowledge the method presented here is the first attempt at modeling evolutionary relationships using generally labeled trees.
format Online
Article
Text
id pubmed-5026249
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-50262492016-09-20 Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees Kalaghatgi, Prabhav Pfeifer, Nico Lengauer, Thomas Mol Biol Evol Methods The widely used model for evolutionary relationships is a bifurcating tree with all taxa/observations placed at the leaves. This is not appropriate if the taxa have been densely sampled across evolutionary time and may be in a direct ancestral relationship, or if there is not enough information to fully resolve all the branching points in the evolutionary tree. In this article, we present a fast distance-based agglomeration method called family-joining (FJ) for constructing so-called generally labeled trees in which taxa may be placed at internal vertices and the tree may contain polytomies. FJ constructs such trees on the basis of pairwise distances and a distance threshold. We tested three methods for threshold selection, FJ-AIC, FJ-BIC, and FJ-CV, which minimize Akaike information criterion, Bayesian information criterion, and cross-validation error, respectively. When compared with related methods on simulated data, FJ-BIC was among the best at reconstructing the correct tree across a wide range of simulation scenarios. FJ-BIC was applied to HIV sequences sampled from individuals involved in a known transmission chain. The FJ-BIC tree was found to be compatible with almost all transmission events. On average, internal branches in the FJ-BIC tree have higher bootstrap support than branches in the leaf-labeled bifurcating tree constructed using RAxML. 36% and 25% of the internal branches in the FJ-BIC tree and RAxML tree, respectively, have bootstrap support greater than 70%. To the best of our knowledge the method presented here is the first attempt at modeling evolutionary relationships using generally labeled trees. Oxford University Press 2016-10 2016-07-19 /pmc/articles/PMC5026249/ /pubmed/27436007 http://dx.doi.org/10.1093/molbev/msw123 Text en © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Kalaghatgi, Prabhav
Pfeifer, Nico
Lengauer, Thomas
Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees
title Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees
title_full Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees
title_fullStr Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees
title_full_unstemmed Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees
title_short Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees
title_sort family-joining: a fast distance-based method for constructing generally labeled trees
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5026249/
https://www.ncbi.nlm.nih.gov/pubmed/27436007
http://dx.doi.org/10.1093/molbev/msw123
work_keys_str_mv AT kalaghatgiprabhav familyjoiningafastdistancebasedmethodforconstructinggenerallylabeledtrees
AT pfeifernico familyjoiningafastdistancebasedmethodforconstructinggenerallylabeledtrees
AT lengauerthomas familyjoiningafastdistancebasedmethodforconstructinggenerallylabeledtrees