Cargando…

Polytomy identification in microbial phylogenetic reconstruction

BACKGROUND: A phylogenetic tree, showing ancestral relations among organisms, is commonly represented as a rooted tree with sets of bifurcating branches (dichotomies) for simplicity, although polytomies (multifurcating branches) may reflect more accurate evolutionary relationships. To represent the...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Guan Ning, Zhang, Chao, Xu, Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287570/
https://www.ncbi.nlm.nih.gov/pubmed/22784621
http://dx.doi.org/10.1186/1752-0509-5-S3-S2
_version_ 1782224693560868864
author Lin, Guan Ning
Zhang, Chao
Xu, Dong
author_facet Lin, Guan Ning
Zhang, Chao
Xu, Dong
author_sort Lin, Guan Ning
collection PubMed
description BACKGROUND: A phylogenetic tree, showing ancestral relations among organisms, is commonly represented as a rooted tree with sets of bifurcating branches (dichotomies) for simplicity, although polytomies (multifurcating branches) may reflect more accurate evolutionary relationships. To represent the true evolutionary relationships, it is important to systematically identify the polytomies from a bifurcating tree and generate a taxonomy-compatible multifurcating tree. For this purpose we propose a novel approach, "PolyPhy", which would classify a set of bifurcating branches of a phylogenetic tree into a set of branches with dichotomies and polytomies by considering genome distances among genomes and tree topological properties. RESULTS: PolyPhy employs a machine learning technique, BLR (Bayesian logistic regression) classifier, to identify possible bifurcating subtrees as polytomies from the trees resulted from ComPhy. Other than considering genome-scale distances between all pairs of species, PolyPhy also takes into account different properties of tree topology between dichotomy and polytomy, such as long-branch retraction and short-branch contraction, and quantifies these properties into comparable rates among different sub-branches. We extract three tree topological features, 'LR' (Leaf rate), 'IntraR' (Intra-subset branch rate) and 'InterR' (Inter-subset branch rate), all of which are calculated from bifurcating tree branch sets for classification. We have achieved F-measure (balanced measure between precision and recall) of 81% with about 0.9 area under the curve (AUC) of ROC. CONCLUSIONS: PolyPhy is a fast and robust method to identify polytomies from phylogenetic trees based on genome-wide inference of evolutionary relationships among genomes. The software package and test data can be downloaded from http://digbio.missouri.edu/ComPhy/phyloTreeBiNonBi-1.0.zip.
format Online
Article
Text
id pubmed-3287570
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32875702012-03-01 Polytomy identification in microbial phylogenetic reconstruction Lin, Guan Ning Zhang, Chao Xu, Dong BMC Syst Biol Research Article BACKGROUND: A phylogenetic tree, showing ancestral relations among organisms, is commonly represented as a rooted tree with sets of bifurcating branches (dichotomies) for simplicity, although polytomies (multifurcating branches) may reflect more accurate evolutionary relationships. To represent the true evolutionary relationships, it is important to systematically identify the polytomies from a bifurcating tree and generate a taxonomy-compatible multifurcating tree. For this purpose we propose a novel approach, "PolyPhy", which would classify a set of bifurcating branches of a phylogenetic tree into a set of branches with dichotomies and polytomies by considering genome distances among genomes and tree topological properties. RESULTS: PolyPhy employs a machine learning technique, BLR (Bayesian logistic regression) classifier, to identify possible bifurcating subtrees as polytomies from the trees resulted from ComPhy. Other than considering genome-scale distances between all pairs of species, PolyPhy also takes into account different properties of tree topology between dichotomy and polytomy, such as long-branch retraction and short-branch contraction, and quantifies these properties into comparable rates among different sub-branches. We extract three tree topological features, 'LR' (Leaf rate), 'IntraR' (Intra-subset branch rate) and 'InterR' (Inter-subset branch rate), all of which are calculated from bifurcating tree branch sets for classification. We have achieved F-measure (balanced measure between precision and recall) of 81% with about 0.9 area under the curve (AUC) of ROC. CONCLUSIONS: PolyPhy is a fast and robust method to identify polytomies from phylogenetic trees based on genome-wide inference of evolutionary relationships among genomes. The software package and test data can be downloaded from http://digbio.missouri.edu/ComPhy/phyloTreeBiNonBi-1.0.zip. BioMed Central 2011-12-23 /pmc/articles/PMC3287570/ /pubmed/22784621 http://dx.doi.org/10.1186/1752-0509-5-S3-S2 Text en Copyright ©2011 Lin et al. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lin, Guan Ning
Zhang, Chao
Xu, Dong
Polytomy identification in microbial phylogenetic reconstruction
title Polytomy identification in microbial phylogenetic reconstruction
title_full Polytomy identification in microbial phylogenetic reconstruction
title_fullStr Polytomy identification in microbial phylogenetic reconstruction
title_full_unstemmed Polytomy identification in microbial phylogenetic reconstruction
title_short Polytomy identification in microbial phylogenetic reconstruction
title_sort polytomy identification in microbial phylogenetic reconstruction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287570/
https://www.ncbi.nlm.nih.gov/pubmed/22784621
http://dx.doi.org/10.1186/1752-0509-5-S3-S2
work_keys_str_mv AT linguanning polytomyidentificationinmicrobialphylogeneticreconstruction
AT zhangchao polytomyidentificationinmicrobialphylogeneticreconstruction
AT xudong polytomyidentificationinmicrobialphylogeneticreconstruction