Cargando…

Multiple consensus trees: a method to separate divergent genes

BACKGROUND: It is generally admitted that the species tree cannot be inferred from the genetic sequences of a single gene because the evolution of different genes, and thus the gene tree topologies, may vary substantially. Gene trees can differ, for example, because of horizontal transfer events or...

Descripción completa

Detalles Bibliográficos
Autor principal: Guénoche, Alain
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3599424/
https://www.ncbi.nlm.nih.gov/pubmed/23394478
http://dx.doi.org/10.1186/1471-2105-14-46
_version_ 1782262960205332480
author Guénoche, Alain
author_facet Guénoche, Alain
author_sort Guénoche, Alain
collection PubMed
description BACKGROUND: It is generally admitted that the species tree cannot be inferred from the genetic sequences of a single gene because the evolution of different genes, and thus the gene tree topologies, may vary substantially. Gene trees can differ, for example, because of horizontal transfer events or because some of them correspond to paralogous instead of orthologous sequences. A variety of methods has been proposed to tackle the problem of the reconciliation of gene trees in order to reconstruct a species tree. When the taxa in all the trees are identical, the problem can be stated as a consensus tree problem. RESULTS: In this paper we define a new method for deciding whether a unique consensus tree or multiple consensus trees can best represent a set of given phylogenetic trees. If the given trees are all congruent, they should be compatible into a single consensus tree. Otherwise, several consensus trees corresponding to divergent genetic patterns can be identified. We introduce a method optimizing the generalized score, over a set of tree partitions in order to decide whether the given set of gene trees is homogeneous or not. CONCLUSIONS: The proposed method has been validated with simulated data (random trees organized in three topological groups) as well as with real data (bootstrap trees, homogeneous set of trees, and a set of non homogeneous gene trees of 30 E. Coli strains; it is worth noting that some of the latter genes underwent horizontal gene transfers). A computer program, MCT - Multiple Consensus Trees, written in C was made freely available for the research community (it can be downloaded from http://bioinformatics.lif.univ-mrs.fr/consensus/index.html). It handles trees in a standard Newick format, builds three hierarchies corresponding to RF and QS similarities between trees and the greedy ascending algorithm. The generalized score values of all tree partitions are computed.
format Online
Article
Text
id pubmed-3599424
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35994242013-03-23 Multiple consensus trees: a method to separate divergent genes Guénoche, Alain BMC Bioinformatics Methodology Article BACKGROUND: It is generally admitted that the species tree cannot be inferred from the genetic sequences of a single gene because the evolution of different genes, and thus the gene tree topologies, may vary substantially. Gene trees can differ, for example, because of horizontal transfer events or because some of them correspond to paralogous instead of orthologous sequences. A variety of methods has been proposed to tackle the problem of the reconciliation of gene trees in order to reconstruct a species tree. When the taxa in all the trees are identical, the problem can be stated as a consensus tree problem. RESULTS: In this paper we define a new method for deciding whether a unique consensus tree or multiple consensus trees can best represent a set of given phylogenetic trees. If the given trees are all congruent, they should be compatible into a single consensus tree. Otherwise, several consensus trees corresponding to divergent genetic patterns can be identified. We introduce a method optimizing the generalized score, over a set of tree partitions in order to decide whether the given set of gene trees is homogeneous or not. CONCLUSIONS: The proposed method has been validated with simulated data (random trees organized in three topological groups) as well as with real data (bootstrap trees, homogeneous set of trees, and a set of non homogeneous gene trees of 30 E. Coli strains; it is worth noting that some of the latter genes underwent horizontal gene transfers). A computer program, MCT - Multiple Consensus Trees, written in C was made freely available for the research community (it can be downloaded from http://bioinformatics.lif.univ-mrs.fr/consensus/index.html). It handles trees in a standard Newick format, builds three hierarchies corresponding to RF and QS similarities between trees and the greedy ascending algorithm. The generalized score values of all tree partitions are computed. BioMed Central 2013-02-09 /pmc/articles/PMC3599424/ /pubmed/23394478 http://dx.doi.org/10.1186/1471-2105-14-46 Text en Copyright ©2013 Guénoche; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Guénoche, Alain
Multiple consensus trees: a method to separate divergent genes
title Multiple consensus trees: a method to separate divergent genes
title_full Multiple consensus trees: a method to separate divergent genes
title_fullStr Multiple consensus trees: a method to separate divergent genes
title_full_unstemmed Multiple consensus trees: a method to separate divergent genes
title_short Multiple consensus trees: a method to separate divergent genes
title_sort multiple consensus trees: a method to separate divergent genes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3599424/
https://www.ncbi.nlm.nih.gov/pubmed/23394478
http://dx.doi.org/10.1186/1471-2105-14-46
work_keys_str_mv AT guenochealain multipleconsensustreesamethodtoseparatedivergentgenes