Cargando…

Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem

BACKGROUND: Evolutionary methods are increasingly challenged by the wealth of fast growing resources of genomic sequence information. Evolutionary events, like gene duplication, loss, and deep coalescence, account more then ever for incongruence between gene trees and the actual species tree. Gene t...

Descripción completa

Detalles Bibliográficos
Autores principales: Górecki, Pawel, Eulenstein, Oliver
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3382441/
https://www.ncbi.nlm.nih.gov/pubmed/22759419
http://dx.doi.org/10.1186/1471-2105-13-S10-S14
_version_ 1782236500709081088
author Górecki, Pawel
Eulenstein, Oliver
author_facet Górecki, Pawel
Eulenstein, Oliver
author_sort Górecki, Pawel
collection PubMed
description BACKGROUND: Evolutionary methods are increasingly challenged by the wealth of fast growing resources of genomic sequence information. Evolutionary events, like gene duplication, loss, and deep coalescence, account more then ever for incongruence between gene trees and the actual species tree. Gene tree reconciliation is addressing this fundamental problem by invoking the minimum number of gene duplication and losses that reconcile a rooted gene tree with a rooted species tree. However, the reconciliation process is highly sensitive to topological error or wrong rooting of the gene tree, a condition that is not met by most gene trees in practice. Thus, despite the promises of gene tree reconciliation, its applicability in practice is severely limited. RESULTS: We introduce the problem of reconciling unrooted and erroneous gene trees by simultaneously rooting and error-correcting them, and describe an efficient algorithm for this problem. Moreover, we introduce an error-corrected version of the gene duplication problem, a standard application of gene tree reconciliation. We introduce an effective heuristic for our error-corrected version of the gene duplication problem, given that the original version of this problem is NP-hard. Our experimental results suggest that our error-correcting approaches for unrooted input trees can significantly improve on the accuracy of gene tree reconciliation, and the species tree inference under the gene duplication problem. Furthermore, the efficiency of our algorithm for error-correcting reconciliation is capable of handling truly large-scale phylogenetic studies. CONCLUSIONS: Our presented error-correction approach is a crucial step towards making gene tree reconciliation more robust, and thus to improve on the accuracy of applications that fundamentally rely on gene tree reconciliation, like the inference of gene-duplication supertrees.
format Online
Article
Text
id pubmed-3382441
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33824412012-06-28 Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem Górecki, Pawel Eulenstein, Oliver BMC Bioinformatics Proceedings BACKGROUND: Evolutionary methods are increasingly challenged by the wealth of fast growing resources of genomic sequence information. Evolutionary events, like gene duplication, loss, and deep coalescence, account more then ever for incongruence between gene trees and the actual species tree. Gene tree reconciliation is addressing this fundamental problem by invoking the minimum number of gene duplication and losses that reconcile a rooted gene tree with a rooted species tree. However, the reconciliation process is highly sensitive to topological error or wrong rooting of the gene tree, a condition that is not met by most gene trees in practice. Thus, despite the promises of gene tree reconciliation, its applicability in practice is severely limited. RESULTS: We introduce the problem of reconciling unrooted and erroneous gene trees by simultaneously rooting and error-correcting them, and describe an efficient algorithm for this problem. Moreover, we introduce an error-corrected version of the gene duplication problem, a standard application of gene tree reconciliation. We introduce an effective heuristic for our error-corrected version of the gene duplication problem, given that the original version of this problem is NP-hard. Our experimental results suggest that our error-correcting approaches for unrooted input trees can significantly improve on the accuracy of gene tree reconciliation, and the species tree inference under the gene duplication problem. Furthermore, the efficiency of our algorithm for error-correcting reconciliation is capable of handling truly large-scale phylogenetic studies. CONCLUSIONS: Our presented error-correction approach is a crucial step towards making gene tree reconciliation more robust, and thus to improve on the accuracy of applications that fundamentally rely on gene tree reconciliation, like the inference of gene-duplication supertrees. BioMed Central 2012-06-25 /pmc/articles/PMC3382441/ /pubmed/22759419 http://dx.doi.org/10.1186/1471-2105-13-S10-S14 Text en Copyright ©2012 Górecki and Eulenstein; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Górecki, Pawel
Eulenstein, Oliver
Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem
title Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem
title_full Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem
title_fullStr Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem
title_full_unstemmed Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem
title_short Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem
title_sort algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3382441/
https://www.ncbi.nlm.nih.gov/pubmed/22759419
http://dx.doi.org/10.1186/1471-2105-13-S10-S14
work_keys_str_mv AT goreckipawel algorithmssimultaneouserrorcorrectionandrootingforgenetreereconciliationandthegeneduplicationproblem
AT eulensteinoliver algorithmssimultaneouserrorcorrectionandrootingforgenetreereconciliationandthegeneduplicationproblem