Cargando…

Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species

BACKGROUND: In phylogenetics, we often seek to reconcile gene trees with species trees within the framework of an evolutionary model. While the most popular models for eukaryotic species allow for only gene duplication and gene loss or only multispecies coalescence, recent work has combined these ph...

Descripción completa

Detalles Bibliográficos
Autores principales: Rogers, Jennifer, Fishberg, Andrew, Youngs, Nora, Wu, Yi-Chieh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5460407/
https://www.ncbi.nlm.nih.gov/pubmed/28583091
http://dx.doi.org/10.1186/s12859-017-1701-1
_version_ 1783242165675098112
author Rogers, Jennifer
Fishberg, Andrew
Youngs, Nora
Wu, Yi-Chieh
author_facet Rogers, Jennifer
Fishberg, Andrew
Youngs, Nora
Wu, Yi-Chieh
author_sort Rogers, Jennifer
collection PubMed
description BACKGROUND: In phylogenetics, we often seek to reconcile gene trees with species trees within the framework of an evolutionary model. While the most popular models for eukaryotic species allow for only gene duplication and gene loss or only multispecies coalescence, recent work has combined these phenomena through a reconciliation structure, the labeled coalescent tree (LCT), that simultaneously describes the duplication-loss and coalescent history of a gene family. However, the LCT makes the simplifying assumption that only one individual is sampled per species whereas, with advances in gene sequencing, we now have access to multiple samples per species. RESULTS: We demonstrate that with these additional samples, there exist gene tree topologies that are impossible to reconcile with any species tree. In particular, the multiple samples enforce new constraints on the placement of duplications within a valid reconciliation. To model these constraints, we extend the LCT to a new structure, the partially labeled coalescent tree (PLCT) and demonstrate how to use the PLCT to evaluate the feasibility of a gene tree topology. We apply our algorithm to two clades of apes and flies to characterize possible sources of infeasibility. CONCLUSION: Going forward, we believe that this model represents a first step towards understanding reconciliations in duplication-loss-coalescence models with multiple samples per species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1701-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5460407
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54604072017-06-07 Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species Rogers, Jennifer Fishberg, Andrew Youngs, Nora Wu, Yi-Chieh BMC Bioinformatics Methodology Article BACKGROUND: In phylogenetics, we often seek to reconcile gene trees with species trees within the framework of an evolutionary model. While the most popular models for eukaryotic species allow for only gene duplication and gene loss or only multispecies coalescence, recent work has combined these phenomena through a reconciliation structure, the labeled coalescent tree (LCT), that simultaneously describes the duplication-loss and coalescent history of a gene family. However, the LCT makes the simplifying assumption that only one individual is sampled per species whereas, with advances in gene sequencing, we now have access to multiple samples per species. RESULTS: We demonstrate that with these additional samples, there exist gene tree topologies that are impossible to reconcile with any species tree. In particular, the multiple samples enforce new constraints on the placement of duplications within a valid reconciliation. To model these constraints, we extend the LCT to a new structure, the partially labeled coalescent tree (PLCT) and demonstrate how to use the PLCT to evaluate the feasibility of a gene tree topology. We apply our algorithm to two clades of apes and flies to characterize possible sources of infeasibility. CONCLUSION: Going forward, we believe that this model represents a first step towards understanding reconciliations in duplication-loss-coalescence models with multiple samples per species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1701-1) contains supplementary material, which is available to authorized users. BioMed Central 2017-06-05 /pmc/articles/PMC5460407/ /pubmed/28583091 http://dx.doi.org/10.1186/s12859-017-1701-1 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Rogers, Jennifer
Fishberg, Andrew
Youngs, Nora
Wu, Yi-Chieh
Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species
title Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species
title_full Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species
title_fullStr Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species
title_full_unstemmed Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species
title_short Reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species
title_sort reconciliation feasibility in the presence of gene duplication, loss, and coalescence with multiple individuals per species
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5460407/
https://www.ncbi.nlm.nih.gov/pubmed/28583091
http://dx.doi.org/10.1186/s12859-017-1701-1
work_keys_str_mv AT rogersjennifer reconciliationfeasibilityinthepresenceofgeneduplicationlossandcoalescencewithmultipleindividualsperspecies
AT fishbergandrew reconciliationfeasibilityinthepresenceofgeneduplicationlossandcoalescencewithmultipleindividualsperspecies
AT youngsnora reconciliationfeasibilityinthepresenceofgeneduplicationlossandcoalescencewithmultipleindividualsperspecies
AT wuyichieh reconciliationfeasibilityinthepresenceofgeneduplicationlossandcoalescencewithmultipleindividualsperspecies