Cargando…
Polytomy identification in microbial phylogenetic reconstruction
BACKGROUND: A phylogenetic tree, showing ancestral relations among organisms, is commonly represented as a rooted tree with sets of bifurcating branches (dichotomies) for simplicity, although polytomies (multifurcating branches) may reflect more accurate evolutionary relationships. To represent the...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287570/ https://www.ncbi.nlm.nih.gov/pubmed/22784621 http://dx.doi.org/10.1186/1752-0509-5-S3-S2 |
_version_ | 1782224693560868864 |
---|---|
author | Lin, Guan Ning Zhang, Chao Xu, Dong |
author_facet | Lin, Guan Ning Zhang, Chao Xu, Dong |
author_sort | Lin, Guan Ning |
collection | PubMed |
description | BACKGROUND: A phylogenetic tree, showing ancestral relations among organisms, is commonly represented as a rooted tree with sets of bifurcating branches (dichotomies) for simplicity, although polytomies (multifurcating branches) may reflect more accurate evolutionary relationships. To represent the true evolutionary relationships, it is important to systematically identify the polytomies from a bifurcating tree and generate a taxonomy-compatible multifurcating tree. For this purpose we propose a novel approach, "PolyPhy", which would classify a set of bifurcating branches of a phylogenetic tree into a set of branches with dichotomies and polytomies by considering genome distances among genomes and tree topological properties. RESULTS: PolyPhy employs a machine learning technique, BLR (Bayesian logistic regression) classifier, to identify possible bifurcating subtrees as polytomies from the trees resulted from ComPhy. Other than considering genome-scale distances between all pairs of species, PolyPhy also takes into account different properties of tree topology between dichotomy and polytomy, such as long-branch retraction and short-branch contraction, and quantifies these properties into comparable rates among different sub-branches. We extract three tree topological features, 'LR' (Leaf rate), 'IntraR' (Intra-subset branch rate) and 'InterR' (Inter-subset branch rate), all of which are calculated from bifurcating tree branch sets for classification. We have achieved F-measure (balanced measure between precision and recall) of 81% with about 0.9 area under the curve (AUC) of ROC. CONCLUSIONS: PolyPhy is a fast and robust method to identify polytomies from phylogenetic trees based on genome-wide inference of evolutionary relationships among genomes. The software package and test data can be downloaded from http://digbio.missouri.edu/ComPhy/phyloTreeBiNonBi-1.0.zip. |
format | Online Article Text |
id | pubmed-3287570 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32875702012-03-01 Polytomy identification in microbial phylogenetic reconstruction Lin, Guan Ning Zhang, Chao Xu, Dong BMC Syst Biol Research Article BACKGROUND: A phylogenetic tree, showing ancestral relations among organisms, is commonly represented as a rooted tree with sets of bifurcating branches (dichotomies) for simplicity, although polytomies (multifurcating branches) may reflect more accurate evolutionary relationships. To represent the true evolutionary relationships, it is important to systematically identify the polytomies from a bifurcating tree and generate a taxonomy-compatible multifurcating tree. For this purpose we propose a novel approach, "PolyPhy", which would classify a set of bifurcating branches of a phylogenetic tree into a set of branches with dichotomies and polytomies by considering genome distances among genomes and tree topological properties. RESULTS: PolyPhy employs a machine learning technique, BLR (Bayesian logistic regression) classifier, to identify possible bifurcating subtrees as polytomies from the trees resulted from ComPhy. Other than considering genome-scale distances between all pairs of species, PolyPhy also takes into account different properties of tree topology between dichotomy and polytomy, such as long-branch retraction and short-branch contraction, and quantifies these properties into comparable rates among different sub-branches. We extract three tree topological features, 'LR' (Leaf rate), 'IntraR' (Intra-subset branch rate) and 'InterR' (Inter-subset branch rate), all of which are calculated from bifurcating tree branch sets for classification. We have achieved F-measure (balanced measure between precision and recall) of 81% with about 0.9 area under the curve (AUC) of ROC. CONCLUSIONS: PolyPhy is a fast and robust method to identify polytomies from phylogenetic trees based on genome-wide inference of evolutionary relationships among genomes. The software package and test data can be downloaded from http://digbio.missouri.edu/ComPhy/phyloTreeBiNonBi-1.0.zip. BioMed Central 2011-12-23 /pmc/articles/PMC3287570/ /pubmed/22784621 http://dx.doi.org/10.1186/1752-0509-5-S3-S2 Text en Copyright ©2011 Lin et al. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Lin, Guan Ning Zhang, Chao Xu, Dong Polytomy identification in microbial phylogenetic reconstruction |
title | Polytomy identification in microbial phylogenetic reconstruction |
title_full | Polytomy identification in microbial phylogenetic reconstruction |
title_fullStr | Polytomy identification in microbial phylogenetic reconstruction |
title_full_unstemmed | Polytomy identification in microbial phylogenetic reconstruction |
title_short | Polytomy identification in microbial phylogenetic reconstruction |
title_sort | polytomy identification in microbial phylogenetic reconstruction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287570/ https://www.ncbi.nlm.nih.gov/pubmed/22784621 http://dx.doi.org/10.1186/1752-0509-5-S3-S2 |
work_keys_str_mv | AT linguanning polytomyidentificationinmicrobialphylogeneticreconstruction AT zhangchao polytomyidentificationinmicrobialphylogeneticreconstruction AT xudong polytomyidentificationinmicrobialphylogeneticreconstruction |