Cargando…

Concatenated alignments and the case of the disappearing tree

BACKGROUND: Analyzed individually, gene trees for a given taxon set tend to harbour incongruent or conflicting signals. One popular approach to deal with this circumstance is to use concatenated data. But especially in prokaryotes, where lateral gene transfer (LGT) is a natural mechanism of generati...

Descripción completa

Detalles Bibliográficos
Autores principales: Thiergart, Thorsten, Landan, Giddy, Martin, William F
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4302582/
https://www.ncbi.nlm.nih.gov/pubmed/25547755
http://dx.doi.org/10.1186/s12862-014-0266-0
_version_ 1782353829376819200
author Thiergart, Thorsten
Landan, Giddy
Martin, William F
author_facet Thiergart, Thorsten
Landan, Giddy
Martin, William F
author_sort Thiergart, Thorsten
collection PubMed
description BACKGROUND: Analyzed individually, gene trees for a given taxon set tend to harbour incongruent or conflicting signals. One popular approach to deal with this circumstance is to use concatenated data. But especially in prokaryotes, where lateral gene transfer (LGT) is a natural mechanism of generating genetic diversity, there are open questions as to whether concatenation amplifies or averages phylogenetic signals residing in individual genes. Here we investigate concatenations of prokaryotic and eukaryotic datasets to investigate possible sources of incongruence in phylogenetic trees and to examine the level of overlap between individual and concatenated alignments. RESULTS: We analyzed prokaryotic datasets comprising 248 invidual gene trees from 315 genomes at three taxonomic depths spanning gammaproteobacteria, proteobacteria, and prokaryotes (bacteria plus archaea), and eukaryotic datasets comprising 279 invidual gene trees from 85 genomes at two taxonomic depths: across plants-animals-fungi and within fungi. Consistent with previous findings, the branches in trees made from concatenated alignments are, in general, not supported by any of their underlying individual gene trees, even though the concatenation trees tend to possess high bootstrap proportions values. For the prokaryote data, this observation is independent of phylogenetic depth and sequence conservation. The eukaryotic data show much better agreement between concatenation and single gene trees. LGT frequencies in trees were estimated using established methods. Sequence length in individual alignments, but not sequence divergence, was found to correlate with the generation of branches that correspond to the concatenated tree. CONCLUSIONS: The weak correspondence of concatenation trees with single gene trees gives rise to the question where the phylogenetic signal in concatenated trees is coming from. The eukaryote data reveals a better correspondence between individual and concatenation trees than the prokaryote data. The question of whether the lack of correspondence between individual genes and the concatenation tree in the prokaryotic data is due to LGT or phylogenetic artefacts remains unanswered. If LGT is the cause of incongruence between concatenation and individual trees, we would have expected to see greater degrees of incongruence for more divergent prokaryotic data sets, which was not observed, although estimated rates of LGT suggest that LGT is responsible for at least some of the observed incongruence. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12862-014-0266-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4302582
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43025822015-01-23 Concatenated alignments and the case of the disappearing tree Thiergart, Thorsten Landan, Giddy Martin, William F BMC Evol Biol Research Article BACKGROUND: Analyzed individually, gene trees for a given taxon set tend to harbour incongruent or conflicting signals. One popular approach to deal with this circumstance is to use concatenated data. But especially in prokaryotes, where lateral gene transfer (LGT) is a natural mechanism of generating genetic diversity, there are open questions as to whether concatenation amplifies or averages phylogenetic signals residing in individual genes. Here we investigate concatenations of prokaryotic and eukaryotic datasets to investigate possible sources of incongruence in phylogenetic trees and to examine the level of overlap between individual and concatenated alignments. RESULTS: We analyzed prokaryotic datasets comprising 248 invidual gene trees from 315 genomes at three taxonomic depths spanning gammaproteobacteria, proteobacteria, and prokaryotes (bacteria plus archaea), and eukaryotic datasets comprising 279 invidual gene trees from 85 genomes at two taxonomic depths: across plants-animals-fungi and within fungi. Consistent with previous findings, the branches in trees made from concatenated alignments are, in general, not supported by any of their underlying individual gene trees, even though the concatenation trees tend to possess high bootstrap proportions values. For the prokaryote data, this observation is independent of phylogenetic depth and sequence conservation. The eukaryotic data show much better agreement between concatenation and single gene trees. LGT frequencies in trees were estimated using established methods. Sequence length in individual alignments, but not sequence divergence, was found to correlate with the generation of branches that correspond to the concatenated tree. CONCLUSIONS: The weak correspondence of concatenation trees with single gene trees gives rise to the question where the phylogenetic signal in concatenated trees is coming from. The eukaryote data reveals a better correspondence between individual and concatenation trees than the prokaryote data. The question of whether the lack of correspondence between individual genes and the concatenation tree in the prokaryotic data is due to LGT or phylogenetic artefacts remains unanswered. If LGT is the cause of incongruence between concatenation and individual trees, we would have expected to see greater degrees of incongruence for more divergent prokaryotic data sets, which was not observed, although estimated rates of LGT suggest that LGT is responsible for at least some of the observed incongruence. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12862-014-0266-0) contains supplementary material, which is available to authorized users. BioMed Central 2014-12-30 /pmc/articles/PMC4302582/ /pubmed/25547755 http://dx.doi.org/10.1186/s12862-014-0266-0 Text en © Thiergart et al.; licensee BioMed Central. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Thiergart, Thorsten
Landan, Giddy
Martin, William F
Concatenated alignments and the case of the disappearing tree
title Concatenated alignments and the case of the disappearing tree
title_full Concatenated alignments and the case of the disappearing tree
title_fullStr Concatenated alignments and the case of the disappearing tree
title_full_unstemmed Concatenated alignments and the case of the disappearing tree
title_short Concatenated alignments and the case of the disappearing tree
title_sort concatenated alignments and the case of the disappearing tree
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4302582/
https://www.ncbi.nlm.nih.gov/pubmed/25547755
http://dx.doi.org/10.1186/s12862-014-0266-0
work_keys_str_mv AT thiergartthorsten concatenatedalignmentsandthecaseofthedisappearingtree
AT landangiddy concatenatedalignmentsandthecaseofthedisappearingtree
AT martinwilliamf concatenatedalignmentsandthecaseofthedisappearingtree