Cargando…
Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species
BACKGROUND: In phylogenetics, we often seek to reconcile gene trees with species trees within the framework of an evolutionary model. While the most popular models for eukaryotic species allow for only gene duplication and gene loss or only multispecies coalescence, recent work has combined these ph...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5460407/ https://www.ncbi.nlm.nih.gov/pubmed/28583091 http://dx.doi.org/10.1186/s12859-017-1701-1 |
_version_ | 1783242165675098112 |
---|---|
author | Rogers, Jennifer Fishberg, Andrew Youngs, Nora Wu, Yi-Chieh |
author_facet | Rogers, Jennifer Fishberg, Andrew Youngs, Nora Wu, Yi-Chieh |
author_sort | Rogers, Jennifer |
collection | PubMed |
description | BACKGROUND: In phylogenetics, we often seek to reconcile gene trees with species trees within the framework of an evolutionary model. While the most popular models for eukaryotic species allow for only gene duplication and gene loss or only multispecies coalescence, recent work has combined these phenomena through a reconciliation structure, the labeled coalescent tree (LCT), that simultaneously describes the duplication-loss and coalescent history of a gene family. However, the LCT makes the simplifying assumption that only one individual is sampled per species whereas, with advances in gene sequencing, we now have access to multiple samples per species. RESULTS: We demonstrate that with these additional samples, there exist gene tree topologies that are impossible to reconcile with any species tree. In particular, the multiple samples enforce new constraints on the placement of duplications within a valid reconciliation. To model these constraints, we extend the LCT to a new structure, the partially labeled coalescent tree (PLCT) and demonstrate how to use the PLCT to evaluate the feasibility of a gene tree topology. We apply our algorithm to two clades of apes and flies to characterize possible sources of infeasibility. CONCLUSION: Going forward, we believe that this model represents a first step towards understanding reconciliations in duplication-loss-coalescence models with multiple samples per species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1701-1) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5460407 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-54604072017-06-07 Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species Rogers, Jennifer Fishberg, Andrew Youngs, Nora Wu, Yi-Chieh BMC Bioinformatics Methodology Article BACKGROUND: In phylogenetics, we often seek to reconcile gene trees with species trees within the framework of an evolutionary model. While the most popular models for eukaryotic species allow for only gene duplication and gene loss or only multispecies coalescence, recent work has combined these phenomena through a reconciliation structure, the labeled coalescent tree (LCT), that simultaneously describes the duplication-loss and coalescent history of a gene family. However, the LCT makes the simplifying assumption that only one individual is sampled per species whereas, with advances in gene sequencing, we now have access to multiple samples per species. RESULTS: We demonstrate that with these additional samples, there exist gene tree topologies that are impossible to reconcile with any species tree. In particular, the multiple samples enforce new constraints on the placement of duplications within a valid reconciliation. To model these constraints, we extend the LCT to a new structure, the partially labeled coalescent tree (PLCT) and demonstrate how to use the PLCT to evaluate the feasibility of a gene tree topology. We apply our algorithm to two clades of apes and flies to characterize possible sources of infeasibility. CONCLUSION: Going forward, we believe that this model represents a first step towards understanding reconciliations in duplication-loss-coalescence models with multiple samples per species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1701-1) contains supplementary material, which is available to authorized users. BioMed Central 2017-06-05 /pmc/articles/PMC5460407/ /pubmed/28583091 http://dx.doi.org/10.1186/s12859-017-1701-1 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Rogers, Jennifer Fishberg, Andrew Youngs, Nora Wu, Yi-Chieh Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species |
title | Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species |
title_full | Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species |
title_fullStr | Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species |
title_full_unstemmed | Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species |
title_short | Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species |
title_sort | reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5460407/ https://www.ncbi.nlm.nih.gov/pubmed/28583091 http://dx.doi.org/10.1186/s12859-017-1701-1 |
work_keys_str_mv | AT rogersjennifer reconciliationfeasibilityinthepresenceofgeneduplicationlossandcoalescencewithmultipleindividualsperspecies AT fishbergandrew reconciliationfeasibilityinthepresenceofgeneduplicationlossandcoalescencewithmultipleindividualsperspecies AT youngsnora reconciliationfeasibilityinthepresenceofgeneduplicationlossandcoalescencewithmultipleindividualsperspecies AT wuyichieh reconciliationfeasibilityinthepresenceofgeneduplicationlossandcoalescencewithmultipleindividualsperspecies |