Cargando…

An improved probability mapping approach to assess genome mosaicism

BACKGROUND: Maximum likelihood and posterior probability mapping are useful visualization techniques that are used to ascertain the mosaic nature of prokaryotic genomes. However, posterior probabilities, especially when calculated for four-taxon cases, tend to overestimate the support for tree topol...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhaxybayeva, Olga, Gogarten, J Peter
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2003
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC222983/
https://www.ncbi.nlm.nih.gov/pubmed/12974984
http://dx.doi.org/10.1186/1471-2164-4-37
_version_ 1782120977615814656
author Zhaxybayeva, Olga
Gogarten, J Peter
author_facet Zhaxybayeva, Olga
Gogarten, J Peter
author_sort Zhaxybayeva, Olga
collection PubMed
description BACKGROUND: Maximum likelihood and posterior probability mapping are useful visualization techniques that are used to ascertain the mosaic nature of prokaryotic genomes. However, posterior probabilities, especially when calculated for four-taxon cases, tend to overestimate the support for tree topologies. Furthermore, because of poor taxon sampling four-taxon analyses suffer from sensitivity to the long branch attraction artifact. Here we extend the probability mapping approach by improving taxon sampling of the analyzed datasets, and by using bootstrap support values, a more conservative tool to assess reliability. RESULTS: Quartets of orthologous proteins were complemented with homologs from selected reference genomes. The mapping of bootstrap support values from these extended datasets gives results similar to the original maximum likelihood and posterior probability mapping. The more conservative nature of the plotted support values allows to focus further analyses on those protein families that strongly disagree with the majority or plurality of genes present in the analyzed genomes. CONCLUSION: Posterior probability is a non-conservative measure for support, and posterior probability mapping only provides a quick estimation of phylogenetic information content of four genomes. This approach can be utilized as a pre-screen to select genes that might have been horizontally transferred. Better taxon sampling combined with subtree analyses prevents the inconsistencies associated with four-taxon analyses, but retains the power of visual representation. Nevertheless, a case-by-case inspection of individual multi-taxon phylogenies remains necessary to differentiate unrecognized paralogy and shared phylogenetic reconstruction artifacts from horizontal gene transfer events.
format Text
id pubmed-222983
institution National Center for Biotechnology Information
language English
publishDate 2003
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-2229832003-10-24 An improved probability mapping approach to assess genome mosaicism Zhaxybayeva, Olga Gogarten, J Peter BMC Genomics Methodology Article BACKGROUND: Maximum likelihood and posterior probability mapping are useful visualization techniques that are used to ascertain the mosaic nature of prokaryotic genomes. However, posterior probabilities, especially when calculated for four-taxon cases, tend to overestimate the support for tree topologies. Furthermore, because of poor taxon sampling four-taxon analyses suffer from sensitivity to the long branch attraction artifact. Here we extend the probability mapping approach by improving taxon sampling of the analyzed datasets, and by using bootstrap support values, a more conservative tool to assess reliability. RESULTS: Quartets of orthologous proteins were complemented with homologs from selected reference genomes. The mapping of bootstrap support values from these extended datasets gives results similar to the original maximum likelihood and posterior probability mapping. The more conservative nature of the plotted support values allows to focus further analyses on those protein families that strongly disagree with the majority or plurality of genes present in the analyzed genomes. CONCLUSION: Posterior probability is a non-conservative measure for support, and posterior probability mapping only provides a quick estimation of phylogenetic information content of four genomes. This approach can be utilized as a pre-screen to select genes that might have been horizontally transferred. Better taxon sampling combined with subtree analyses prevents the inconsistencies associated with four-taxon analyses, but retains the power of visual representation. Nevertheless, a case-by-case inspection of individual multi-taxon phylogenies remains necessary to differentiate unrecognized paralogy and shared phylogenetic reconstruction artifacts from horizontal gene transfer events. BioMed Central 2003-09-15 /pmc/articles/PMC222983/ /pubmed/12974984 http://dx.doi.org/10.1186/1471-2164-4-37 Text en Copyright © 2003 Zhaxybayeva and Gogarten; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Methodology Article
Zhaxybayeva, Olga
Gogarten, J Peter
An improved probability mapping approach to assess genome mosaicism
title An improved probability mapping approach to assess genome mosaicism
title_full An improved probability mapping approach to assess genome mosaicism
title_fullStr An improved probability mapping approach to assess genome mosaicism
title_full_unstemmed An improved probability mapping approach to assess genome mosaicism
title_short An improved probability mapping approach to assess genome mosaicism
title_sort improved probability mapping approach to assess genome mosaicism
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC222983/
https://www.ncbi.nlm.nih.gov/pubmed/12974984
http://dx.doi.org/10.1186/1471-2164-4-37
work_keys_str_mv AT zhaxybayevaolga animprovedprobabilitymappingapproachtoassessgenomemosaicism
AT gogartenjpeter animprovedprobabilitymappingapproachtoassessgenomemosaicism
AT zhaxybayevaolga improvedprobabilitymappingapproachtoassessgenomemosaicism
AT gogartenjpeter improvedprobabilitymappingapproachtoassessgenomemosaicism