Cargando…

Joint amalgamation of most parsimonious reconciled gene trees

Motivation: Traditionally, gene phylogenies have been reconstructed solely on the basis of molecular sequences; this, however, often does not provide enough information to distinguish between statistically equivalent relationships. To address this problem, several recent methods have incorporated in...

Descripción completa

Detalles Bibliográficos
Autores principales: Scornavacca, Celine, Jacox, Edwin, Szöllősi, Gergely J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4380024/
https://www.ncbi.nlm.nih.gov/pubmed/25380957
http://dx.doi.org/10.1093/bioinformatics/btu728
_version_ 1782364276792492032
author Scornavacca, Celine
Jacox, Edwin
Szöllősi, Gergely J.
author_facet Scornavacca, Celine
Jacox, Edwin
Szöllősi, Gergely J.
author_sort Scornavacca, Celine
collection PubMed
description Motivation: Traditionally, gene phylogenies have been reconstructed solely on the basis of molecular sequences; this, however, often does not provide enough information to distinguish between statistically equivalent relationships. To address this problem, several recent methods have incorporated information on the species phylogeny in gene tree reconstruction, leading to dramatic improvements in accuracy. Although probabilistic methods are able to estimate all model parameters but are computationally expensive, parsimony methods—generally computationally more efficient—require a prior estimate of parameters and of the statistical support. Results: Here, we present the Tree Estimation using Reconciliation (TERA) algorithm, a parsimony based, species tree aware method for gene tree reconstruction based on a scoring scheme combining duplication, transfer and loss costs with an estimate of the sequence likelihood. TERA explores all reconciled gene trees that can be amalgamated from a sample of gene trees. Using a large scale simulated dataset, we demonstrate that TERA achieves the same accuracy as the corresponding probabilistic method while being faster, and outperforms other parsimony-based methods in both accuracy and speed. Running TERA on a set of 1099 homologous gene families from complete cyanobacterial genomes, we find that incorporating knowledge of the species tree results in a two thirds reduction in the number of apparent transfer events. Availability and implementation: The algorithm is implemented in our program TERA, which is freely available from http://mbb.univ-montp2.fr/MBB/download_sources/16__TERA. Contact: celine.scornavacca@univ-montp2.fr, ssolo@angel.elte.hu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4380024
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43800242015-04-15 Joint amalgamation of most parsimonious reconciled gene trees Scornavacca, Celine Jacox, Edwin Szöllősi, Gergely J. Bioinformatics Original Papers Motivation: Traditionally, gene phylogenies have been reconstructed solely on the basis of molecular sequences; this, however, often does not provide enough information to distinguish between statistically equivalent relationships. To address this problem, several recent methods have incorporated information on the species phylogeny in gene tree reconstruction, leading to dramatic improvements in accuracy. Although probabilistic methods are able to estimate all model parameters but are computationally expensive, parsimony methods—generally computationally more efficient—require a prior estimate of parameters and of the statistical support. Results: Here, we present the Tree Estimation using Reconciliation (TERA) algorithm, a parsimony based, species tree aware method for gene tree reconstruction based on a scoring scheme combining duplication, transfer and loss costs with an estimate of the sequence likelihood. TERA explores all reconciled gene trees that can be amalgamated from a sample of gene trees. Using a large scale simulated dataset, we demonstrate that TERA achieves the same accuracy as the corresponding probabilistic method while being faster, and outperforms other parsimony-based methods in both accuracy and speed. Running TERA on a set of 1099 homologous gene families from complete cyanobacterial genomes, we find that incorporating knowledge of the species tree results in a two thirds reduction in the number of apparent transfer events. Availability and implementation: The algorithm is implemented in our program TERA, which is freely available from http://mbb.univ-montp2.fr/MBB/download_sources/16__TERA. Contact: celine.scornavacca@univ-montp2.fr, ssolo@angel.elte.hu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-03-15 2014-11-06 /pmc/articles/PMC4380024/ /pubmed/25380957 http://dx.doi.org/10.1093/bioinformatics/btu728 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Scornavacca, Celine
Jacox, Edwin
Szöllősi, Gergely J.
Joint amalgamation of most parsimonious reconciled gene trees
title Joint amalgamation of most parsimonious reconciled gene trees
title_full Joint amalgamation of most parsimonious reconciled gene trees
title_fullStr Joint amalgamation of most parsimonious reconciled gene trees
title_full_unstemmed Joint amalgamation of most parsimonious reconciled gene trees
title_short Joint amalgamation of most parsimonious reconciled gene trees
title_sort joint amalgamation of most parsimonious reconciled gene trees
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4380024/
https://www.ncbi.nlm.nih.gov/pubmed/25380957
http://dx.doi.org/10.1093/bioinformatics/btu728
work_keys_str_mv AT scornavaccaceline jointamalgamationofmostparsimoniousreconciledgenetrees
AT jacoxedwin jointamalgamationofmostparsimoniousreconciledgenetrees
AT szollosigergelyj jointamalgamationofmostparsimoniousreconciledgenetrees