Cargando…
Genomic duplication problems for unrooted gene trees
BACKGROUND: Discovering the location of gene duplications and multiple gene duplication episodes is a fundamental issue in evolutionary molecular biology. The problem introduced by Guigó et al. in 1996 is to map gene duplication events from a collection of rooted, binary gene family trees onto their...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895600/ https://www.ncbi.nlm.nih.gov/pubmed/26818591 http://dx.doi.org/10.1186/s12864-015-2308-4 |
_version_ | 1782435883144708096 |
---|---|
author | Paszek, Jarosław Górecki, Paweł |
author_facet | Paszek, Jarosław Górecki, Paweł |
author_sort | Paszek, Jarosław |
collection | PubMed |
description | BACKGROUND: Discovering the location of gene duplications and multiple gene duplication episodes is a fundamental issue in evolutionary molecular biology. The problem introduced by Guigó et al. in 1996 is to map gene duplication events from a collection of rooted, binary gene family trees onto theirs corresponding rooted binary species tree in such a way that the total number of multiple gene duplication episodes is minimized. There are several models in the literature that specify how gene duplications from gene families can be interpreted as one duplication episode. However, in all duplication episode problems gene trees are rooted. This restriction limits the applicability, since unrooted gene family trees are frequently inferred by phylogenetic methods. RESULTS: In this article we show the first solution to the open problem of episode clustering where the input gene family trees are unrooted. In particular, by using theoretical properties of unrooted reconciliation, we show an efficient algorithm that reduces this problem into the episode clustering problems defined for rooted trees. We show theoretical properties of the reduction algorithm and evaluation of empirical datasets. CONCLUSIONS: We provided algorithms and tools that were successfully applied to several empirical datasets. In particular, our comparative study shows that we can improve known results on genomic duplication inference from real datasets. |
format | Online Article Text |
id | pubmed-4895600 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48956002016-06-10 Genomic duplication problems for unrooted gene trees Paszek, Jarosław Górecki, Paweł BMC Genomics Proceedings BACKGROUND: Discovering the location of gene duplications and multiple gene duplication episodes is a fundamental issue in evolutionary molecular biology. The problem introduced by Guigó et al. in 1996 is to map gene duplication events from a collection of rooted, binary gene family trees onto theirs corresponding rooted binary species tree in such a way that the total number of multiple gene duplication episodes is minimized. There are several models in the literature that specify how gene duplications from gene families can be interpreted as one duplication episode. However, in all duplication episode problems gene trees are rooted. This restriction limits the applicability, since unrooted gene family trees are frequently inferred by phylogenetic methods. RESULTS: In this article we show the first solution to the open problem of episode clustering where the input gene family trees are unrooted. In particular, by using theoretical properties of unrooted reconciliation, we show an efficient algorithm that reduces this problem into the episode clustering problems defined for rooted trees. We show theoretical properties of the reduction algorithm and evaluation of empirical datasets. CONCLUSIONS: We provided algorithms and tools that were successfully applied to several empirical datasets. In particular, our comparative study shows that we can improve known results on genomic duplication inference from real datasets. BioMed Central 2016-01-11 /pmc/articles/PMC4895600/ /pubmed/26818591 http://dx.doi.org/10.1186/s12864-015-2308-4 Text en © Paszek and Górecki. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Paszek, Jarosław Górecki, Paweł Genomic duplication problems for unrooted gene trees |
title | Genomic duplication problems for unrooted gene trees |
title_full | Genomic duplication problems for unrooted gene trees |
title_fullStr | Genomic duplication problems for unrooted gene trees |
title_full_unstemmed | Genomic duplication problems for unrooted gene trees |
title_short | Genomic duplication problems for unrooted gene trees |
title_sort | genomic duplication problems for unrooted gene trees |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895600/ https://www.ncbi.nlm.nih.gov/pubmed/26818591 http://dx.doi.org/10.1186/s12864-015-2308-4 |
work_keys_str_mv | AT paszekjarosław genomicduplicationproblemsforunrootedgenetrees AT goreckipaweł genomicduplicationproblemsforunrootedgenetrees |