Cargando…

Efficient Exploration of the Space of Reconciled Gene Trees

Gene trees record the combination of gene-level events, such as duplication, transfer and loss (DTL), and species-level events, such as speciation and extinction. Gene tree–species tree reconciliation methods model these processes by drawing gene trees into the species tree using a series of gene an...

Descripción completa

Detalles Bibliográficos
Autores principales: Szöllősi, Gergely J., Rosikiewicz, Wojciech, Boussau, Bastien, Tannier, Eric, Daubin, Vincent
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3797637/
https://www.ncbi.nlm.nih.gov/pubmed/23925510
http://dx.doi.org/10.1093/sysbio/syt054
_version_ 1782287641655377920
author Szöllősi, Gergely J.
Rosikiewicz, Wojciech
Boussau, Bastien
Tannier, Eric
Daubin, Vincent
author_facet Szöllősi, Gergely J.
Rosikiewicz, Wojciech
Boussau, Bastien
Tannier, Eric
Daubin, Vincent
author_sort Szöllősi, Gergely J.
collection PubMed
description Gene trees record the combination of gene-level events, such as duplication, transfer and loss (DTL), and species-level events, such as speciation and extinction. Gene tree–species tree reconciliation methods model these processes by drawing gene trees into the species tree using a series of gene and species-level events. The reconstruction of gene trees based on sequence alone almost always involves choosing between statistically equivalent or weakly distinguishable relationships that could be much better resolved based on a putative species tree. To exploit this potential for accurate reconstruction of gene trees, the space of reconciled gene trees must be explored according to a joint model of sequence evolution and gene tree–species tree reconciliation. Here we present amalgamated likelihood estimation (ALE), a probabilistic approach to exhaustively explore all reconciled gene trees that can be amalgamated as a combination of clades observed in a sample of gene trees. We implement the ALE approach in the context of a reconciliation model (Szöllősi et al. 2013), which allows for the DTL of genes. We use ALE to efficiently approximate the sum of the joint likelihood over amalgamations and to find the reconciled gene tree that maximizes the joint likelihood among all such trees. We demonstrate using simulations that gene trees reconstructed using the joint likelihood are substantially more accurate than those reconstructed using sequence alone. Using realistic gene tree topologies, branch lengths, and alignment sizes, we demonstrate that ALE produces more accurate gene trees even if the model of sequence evolution is greatly simplified. Finally, examining 1099 gene families from 36 cyanobacterial genomes we find that joint likelihood-based inference results in a striking reduction in apparent phylogenetic discord, with respectively. 24%, 59%, and 46% reductions in the mean numbers of duplications, transfers, and losses per gene family. The open source implementation of ALE is available from https://github.com/ssolo/ALE.git. [amalgamation; gene tree reconciliation; gene tree reconstruction; lateral gene transfer; phylogeny.]
format Online
Article
Text
id pubmed-3797637
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-37976372013-10-17 Efficient Exploration of the Space of Reconciled Gene Trees Szöllősi, Gergely J. Rosikiewicz, Wojciech Boussau, Bastien Tannier, Eric Daubin, Vincent Syst Biol Regular Articles Gene trees record the combination of gene-level events, such as duplication, transfer and loss (DTL), and species-level events, such as speciation and extinction. Gene tree–species tree reconciliation methods model these processes by drawing gene trees into the species tree using a series of gene and species-level events. The reconstruction of gene trees based on sequence alone almost always involves choosing between statistically equivalent or weakly distinguishable relationships that could be much better resolved based on a putative species tree. To exploit this potential for accurate reconstruction of gene trees, the space of reconciled gene trees must be explored according to a joint model of sequence evolution and gene tree–species tree reconciliation. Here we present amalgamated likelihood estimation (ALE), a probabilistic approach to exhaustively explore all reconciled gene trees that can be amalgamated as a combination of clades observed in a sample of gene trees. We implement the ALE approach in the context of a reconciliation model (Szöllősi et al. 2013), which allows for the DTL of genes. We use ALE to efficiently approximate the sum of the joint likelihood over amalgamations and to find the reconciled gene tree that maximizes the joint likelihood among all such trees. We demonstrate using simulations that gene trees reconstructed using the joint likelihood are substantially more accurate than those reconstructed using sequence alone. Using realistic gene tree topologies, branch lengths, and alignment sizes, we demonstrate that ALE produces more accurate gene trees even if the model of sequence evolution is greatly simplified. Finally, examining 1099 gene families from 36 cyanobacterial genomes we find that joint likelihood-based inference results in a striking reduction in apparent phylogenetic discord, with respectively. 24%, 59%, and 46% reductions in the mean numbers of duplications, transfers, and losses per gene family. The open source implementation of ALE is available from https://github.com/ssolo/ALE.git. [amalgamation; gene tree reconciliation; gene tree reconstruction; lateral gene transfer; phylogeny.] Oxford University Press 2013-11 2013-09-18 /pmc/articles/PMC3797637/ /pubmed/23925510 http://dx.doi.org/10.1093/sysbio/syt054 Text en © The Author(s) 2013. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Regular Articles
Szöllősi, Gergely J.
Rosikiewicz, Wojciech
Boussau, Bastien
Tannier, Eric
Daubin, Vincent
Efficient Exploration of the Space of Reconciled Gene Trees
title Efficient Exploration of the Space of Reconciled Gene Trees
title_full Efficient Exploration of the Space of Reconciled Gene Trees
title_fullStr Efficient Exploration of the Space of Reconciled Gene Trees
title_full_unstemmed Efficient Exploration of the Space of Reconciled Gene Trees
title_short Efficient Exploration of the Space of Reconciled Gene Trees
title_sort efficient exploration of the space of reconciled gene trees
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3797637/
https://www.ncbi.nlm.nih.gov/pubmed/23925510
http://dx.doi.org/10.1093/sysbio/syt054
work_keys_str_mv AT szollosigergelyj efficientexplorationofthespaceofreconciledgenetrees
AT rosikiewiczwojciech efficientexplorationofthespaceofreconciledgenetrees
AT boussaubastien efficientexplorationofthespaceofreconciledgenetrees
AT tanniereric efficientexplorationofthespaceofreconciledgenetrees
AT daubinvincent efficientexplorationofthespaceofreconciledgenetrees