Cargando…

Cophylogeny Reconstruction via an Approximate Bayesian Computation

Despite an increasingly vast literature on cophylogenetic reconstructions for studying host–parasite associations, understanding the common evolutionary history of such systems remains a problem that is far from being solved. Most algorithms for host–parasite reconciliation use an event-based model,...

Descripción completa

Detalles Bibliográficos
Autores principales: Baudet, C., Donati, B., Sinaimeri, B., Crescenzi, P., Gautier, C., Matias, C., Sagot, M.-F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4395844/
https://www.ncbi.nlm.nih.gov/pubmed/25540454
http://dx.doi.org/10.1093/sysbio/syu129
_version_ 1782366501994496000
author Baudet, C.
Donati, B.
Sinaimeri, B.
Crescenzi, P.
Gautier, C.
Matias, C.
Sagot, M.-F.
author_facet Baudet, C.
Donati, B.
Sinaimeri, B.
Crescenzi, P.
Gautier, C.
Matias, C.
Sagot, M.-F.
author_sort Baudet, C.
collection PubMed
description Despite an increasingly vast literature on cophylogenetic reconstructions for studying host–parasite associations, understanding the common evolutionary history of such systems remains a problem that is far from being solved. Most algorithms for host–parasite reconciliation use an event-based model, where the events include in general (a subset of) cospeciation, duplication, loss, and host switch. All known parsimonious event-based methods then assign a cost to each type of event in order to find a reconstruction of minimum cost. The main problem with this approach is that the cost of the events strongly influences the reconciliation obtained. Some earlier approaches attempt to avoid this problem by finding a Pareto set of solutions and hence by considering event costs under some minimization constraints. To deal with this problem, we developed an algorithm, called Coala, for estimating the frequency of the events based on an approximate Bayesian computation approach. The benefits of this method are 2-fold: (i) it provides more confidence in the set of costs to be used in a reconciliation, and (ii) it allows estimation of the frequency of the events in cases where the data set consists of trees with a large number of taxa. We evaluate our method on simulated and on biological data sets. We show that in both cases, for the same pair of host and parasite trees, different sets of frequencies for the events lead to equally probable solutions. Moreover, often these solutions differ greatly in terms of the number of inferred events. It appears crucial to take this into account before attempting any further biological interpretation of such reconciliations. More generally, we also show that the set of frequencies can vary widely depending on the input host and parasite trees. Indiscriminately applying a standard vector of costs may thus not be a good strategy.
format Online
Article
Text
id pubmed-4395844
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43958442015-04-15 Cophylogeny Reconstruction via an Approximate Bayesian Computation Baudet, C. Donati, B. Sinaimeri, B. Crescenzi, P. Gautier, C. Matias, C. Sagot, M.-F. Syst Biol Regular Articles Despite an increasingly vast literature on cophylogenetic reconstructions for studying host–parasite associations, understanding the common evolutionary history of such systems remains a problem that is far from being solved. Most algorithms for host–parasite reconciliation use an event-based model, where the events include in general (a subset of) cospeciation, duplication, loss, and host switch. All known parsimonious event-based methods then assign a cost to each type of event in order to find a reconstruction of minimum cost. The main problem with this approach is that the cost of the events strongly influences the reconciliation obtained. Some earlier approaches attempt to avoid this problem by finding a Pareto set of solutions and hence by considering event costs under some minimization constraints. To deal with this problem, we developed an algorithm, called Coala, for estimating the frequency of the events based on an approximate Bayesian computation approach. The benefits of this method are 2-fold: (i) it provides more confidence in the set of costs to be used in a reconciliation, and (ii) it allows estimation of the frequency of the events in cases where the data set consists of trees with a large number of taxa. We evaluate our method on simulated and on biological data sets. We show that in both cases, for the same pair of host and parasite trees, different sets of frequencies for the events lead to equally probable solutions. Moreover, often these solutions differ greatly in terms of the number of inferred events. It appears crucial to take this into account before attempting any further biological interpretation of such reconciliations. More generally, we also show that the set of frequencies can vary widely depending on the input host and parasite trees. Indiscriminately applying a standard vector of costs may thus not be a good strategy. Oxford University Press 2015-05 2014-12-24 /pmc/articles/PMC4395844/ /pubmed/25540454 http://dx.doi.org/10.1093/sysbio/syu129 Text en © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Regular Articles
Baudet, C.
Donati, B.
Sinaimeri, B.
Crescenzi, P.
Gautier, C.
Matias, C.
Sagot, M.-F.
Cophylogeny Reconstruction via an Approximate Bayesian Computation
title Cophylogeny Reconstruction via an Approximate Bayesian Computation
title_full Cophylogeny Reconstruction via an Approximate Bayesian Computation
title_fullStr Cophylogeny Reconstruction via an Approximate Bayesian Computation
title_full_unstemmed Cophylogeny Reconstruction via an Approximate Bayesian Computation
title_short Cophylogeny Reconstruction via an Approximate Bayesian Computation
title_sort cophylogeny reconstruction via an approximate bayesian computation
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4395844/
https://www.ncbi.nlm.nih.gov/pubmed/25540454
http://dx.doi.org/10.1093/sysbio/syu129
work_keys_str_mv AT baudetc cophylogenyreconstructionviaanapproximatebayesiancomputation
AT donatib cophylogenyreconstructionviaanapproximatebayesiancomputation
AT sinaimerib cophylogenyreconstructionviaanapproximatebayesiancomputation
AT crescenzip cophylogenyreconstructionviaanapproximatebayesiancomputation
AT gautierc cophylogenyreconstructionviaanapproximatebayesiancomputation
AT matiasc cophylogenyreconstructionviaanapproximatebayesiancomputation
AT sagotmf cophylogenyreconstructionviaanapproximatebayesiancomputation