Cargando…

Genomic duplication problems for unrooted gene trees

BACKGROUND: Discovering the location of gene duplications and multiple gene duplication episodes is a fundamental issue in evolutionary molecular biology. The problem introduced by Guigó et al. in 1996 is to map gene duplication events from a collection of rooted, binary gene family trees onto their...

Descripción completa

Detalles Bibliográficos
Autores principales: Paszek, Jarosław, Górecki, Paweł
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895600/
https://www.ncbi.nlm.nih.gov/pubmed/26818591
http://dx.doi.org/10.1186/s12864-015-2308-4
_version_ 1782435883144708096
author Paszek, Jarosław
Górecki, Paweł
author_facet Paszek, Jarosław
Górecki, Paweł
author_sort Paszek, Jarosław
collection PubMed
description BACKGROUND: Discovering the location of gene duplications and multiple gene duplication episodes is a fundamental issue in evolutionary molecular biology. The problem introduced by Guigó et al. in 1996 is to map gene duplication events from a collection of rooted, binary gene family trees onto theirs corresponding rooted binary species tree in such a way that the total number of multiple gene duplication episodes is minimized. There are several models in the literature that specify how gene duplications from gene families can be interpreted as one duplication episode. However, in all duplication episode problems gene trees are rooted. This restriction limits the applicability, since unrooted gene family trees are frequently inferred by phylogenetic methods. RESULTS: In this article we show the first solution to the open problem of episode clustering where the input gene family trees are unrooted. In particular, by using theoretical properties of unrooted reconciliation, we show an efficient algorithm that reduces this problem into the episode clustering problems defined for rooted trees. We show theoretical properties of the reduction algorithm and evaluation of empirical datasets. CONCLUSIONS: We provided algorithms and tools that were successfully applied to several empirical datasets. In particular, our comparative study shows that we can improve known results on genomic duplication inference from real datasets.
format Online
Article
Text
id pubmed-4895600
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48956002016-06-10 Genomic duplication problems for unrooted gene trees Paszek, Jarosław Górecki, Paweł BMC Genomics Proceedings BACKGROUND: Discovering the location of gene duplications and multiple gene duplication episodes is a fundamental issue in evolutionary molecular biology. The problem introduced by Guigó et al. in 1996 is to map gene duplication events from a collection of rooted, binary gene family trees onto theirs corresponding rooted binary species tree in such a way that the total number of multiple gene duplication episodes is minimized. There are several models in the literature that specify how gene duplications from gene families can be interpreted as one duplication episode. However, in all duplication episode problems gene trees are rooted. This restriction limits the applicability, since unrooted gene family trees are frequently inferred by phylogenetic methods. RESULTS: In this article we show the first solution to the open problem of episode clustering where the input gene family trees are unrooted. In particular, by using theoretical properties of unrooted reconciliation, we show an efficient algorithm that reduces this problem into the episode clustering problems defined for rooted trees. We show theoretical properties of the reduction algorithm and evaluation of empirical datasets. CONCLUSIONS: We provided algorithms and tools that were successfully applied to several empirical datasets. In particular, our comparative study shows that we can improve known results on genomic duplication inference from real datasets. BioMed Central 2016-01-11 /pmc/articles/PMC4895600/ /pubmed/26818591 http://dx.doi.org/10.1186/s12864-015-2308-4 Text en © Paszek and Górecki. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Paszek, Jarosław
Górecki, Paweł
Genomic duplication problems for unrooted gene trees
title Genomic duplication problems for unrooted gene trees
title_full Genomic duplication problems for unrooted gene trees
title_fullStr Genomic duplication problems for unrooted gene trees
title_full_unstemmed Genomic duplication problems for unrooted gene trees
title_short Genomic duplication problems for unrooted gene trees
title_sort genomic duplication problems for unrooted gene trees
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895600/
https://www.ncbi.nlm.nih.gov/pubmed/26818591
http://dx.doi.org/10.1186/s12864-015-2308-4
work_keys_str_mv AT paszekjarosław genomicduplicationproblemsforunrootedgenetrees
AT goreckipaweł genomicduplicationproblemsforunrootedgenetrees