Cargando…
Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees
The widely used model for evolutionary relationships is a bifurcating tree with all taxa/observations placed at the leaves. This is not appropriate if the taxa have been densely sampled across evolutionary time and may be in a direct ancestral relationship, or if there is not enough information to f...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5026249/ https://www.ncbi.nlm.nih.gov/pubmed/27436007 http://dx.doi.org/10.1093/molbev/msw123 |
_version_ | 1782454098388320256 |
---|---|
author | Kalaghatgi, Prabhav Pfeifer, Nico Lengauer, Thomas |
author_facet | Kalaghatgi, Prabhav Pfeifer, Nico Lengauer, Thomas |
author_sort | Kalaghatgi, Prabhav |
collection | PubMed |
description | The widely used model for evolutionary relationships is a bifurcating tree with all taxa/observations placed at the leaves. This is not appropriate if the taxa have been densely sampled across evolutionary time and may be in a direct ancestral relationship, or if there is not enough information to fully resolve all the branching points in the evolutionary tree. In this article, we present a fast distance-based agglomeration method called family-joining (FJ) for constructing so-called generally labeled trees in which taxa may be placed at internal vertices and the tree may contain polytomies. FJ constructs such trees on the basis of pairwise distances and a distance threshold. We tested three methods for threshold selection, FJ-AIC, FJ-BIC, and FJ-CV, which minimize Akaike information criterion, Bayesian information criterion, and cross-validation error, respectively. When compared with related methods on simulated data, FJ-BIC was among the best at reconstructing the correct tree across a wide range of simulation scenarios. FJ-BIC was applied to HIV sequences sampled from individuals involved in a known transmission chain. The FJ-BIC tree was found to be compatible with almost all transmission events. On average, internal branches in the FJ-BIC tree have higher bootstrap support than branches in the leaf-labeled bifurcating tree constructed using RAxML. 36% and 25% of the internal branches in the FJ-BIC tree and RAxML tree, respectively, have bootstrap support greater than 70%. To the best of our knowledge the method presented here is the first attempt at modeling evolutionary relationships using generally labeled trees. |
format | Online Article Text |
id | pubmed-5026249 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-50262492016-09-20 Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees Kalaghatgi, Prabhav Pfeifer, Nico Lengauer, Thomas Mol Biol Evol Methods The widely used model for evolutionary relationships is a bifurcating tree with all taxa/observations placed at the leaves. This is not appropriate if the taxa have been densely sampled across evolutionary time and may be in a direct ancestral relationship, or if there is not enough information to fully resolve all the branching points in the evolutionary tree. In this article, we present a fast distance-based agglomeration method called family-joining (FJ) for constructing so-called generally labeled trees in which taxa may be placed at internal vertices and the tree may contain polytomies. FJ constructs such trees on the basis of pairwise distances and a distance threshold. We tested three methods for threshold selection, FJ-AIC, FJ-BIC, and FJ-CV, which minimize Akaike information criterion, Bayesian information criterion, and cross-validation error, respectively. When compared with related methods on simulated data, FJ-BIC was among the best at reconstructing the correct tree across a wide range of simulation scenarios. FJ-BIC was applied to HIV sequences sampled from individuals involved in a known transmission chain. The FJ-BIC tree was found to be compatible with almost all transmission events. On average, internal branches in the FJ-BIC tree have higher bootstrap support than branches in the leaf-labeled bifurcating tree constructed using RAxML. 36% and 25% of the internal branches in the FJ-BIC tree and RAxML tree, respectively, have bootstrap support greater than 70%. To the best of our knowledge the method presented here is the first attempt at modeling evolutionary relationships using generally labeled trees. Oxford University Press 2016-10 2016-07-19 /pmc/articles/PMC5026249/ /pubmed/27436007 http://dx.doi.org/10.1093/molbev/msw123 Text en © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Kalaghatgi, Prabhav Pfeifer, Nico Lengauer, Thomas Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees |
title | Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees |
title_full | Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees |
title_fullStr | Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees |
title_full_unstemmed | Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees |
title_short | Family-Joining: A Fast Distance-Based Method for Constructing Generally Labeled Trees |
title_sort | family-joining: a fast distance-based method for constructing generally labeled trees |
topic | Methods |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5026249/ https://www.ncbi.nlm.nih.gov/pubmed/27436007 http://dx.doi.org/10.1093/molbev/msw123 |
work_keys_str_mv | AT kalaghatgiprabhav familyjoiningafastdistancebasedmethodforconstructinggenerallylabeledtrees AT pfeifernico familyjoiningafastdistancebasedmethodforconstructinggenerallylabeledtrees AT lengauerthomas familyjoiningafastdistancebasedmethodforconstructinggenerallylabeledtrees |