Cargando…

Using Likelihood-Free Inference to Compare Evolutionary Dynamics of the Protein Networks of H. pylori and P. falciparum

Gene duplication with subsequent interaction divergence is one of the primary driving forces in the evolution of genetic systems. Yet little is known about the precise mechanisms and the role of duplication divergence in the evolution of protein networks from the prokaryote and eukaryote domains. We...

Descripción completa

Detalles Bibliográficos
Autores principales: Ratmann, Oliver, Jørgensen, Ole, Hinkley, Trevor, Stumpf, Michael, Richardson, Sylvia, Wiuf, Carsten
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2098858/
https://www.ncbi.nlm.nih.gov/pubmed/18052538
http://dx.doi.org/10.1371/journal.pcbi.0030230
_version_ 1782138282261348352
author Ratmann, Oliver
Jørgensen, Ole
Hinkley, Trevor
Stumpf, Michael
Richardson, Sylvia
Wiuf, Carsten
author_facet Ratmann, Oliver
Jørgensen, Ole
Hinkley, Trevor
Stumpf, Michael
Richardson, Sylvia
Wiuf, Carsten
author_sort Ratmann, Oliver
collection PubMed
description Gene duplication with subsequent interaction divergence is one of the primary driving forces in the evolution of genetic systems. Yet little is known about the precise mechanisms and the role of duplication divergence in the evolution of protein networks from the prokaryote and eukaryote domains. We developed a novel, model-based approach for Bayesian inference on biological network data that centres on approximate Bayesian computation, or likelihood-free inference. Instead of computing the intractable likelihood of the protein network topology, our method summarizes key features of the network and, based on these, uses a MCMC algorithm to approximate the posterior distribution of the model parameters. This allowed us to reliably fit a flexible mixture model that captures hallmarks of evolution by gene duplication and subfunctionalization to protein interaction network data of Helicobacter pylori and Plasmodium falciparum. The 80% credible intervals for the duplication–divergence component are [0.64, 0.98] for H. pylori and [0.87, 0.99] for P. falciparum. The remaining parameter estimates are not inconsistent with sequence data. An extensive sensitivity analysis showed that incompleteness of PIN data does not largely affect the analysis of models of protein network evolution, and that the degree sequence alone barely captures the evolutionary footprints of protein networks relative to other statistics. Our likelihood-free inference approach enables a fully Bayesian analysis of a complex and highly stochastic system that is otherwise intractable at present. Modelling the evolutionary history of PIN data, it transpires that only the simultaneous analysis of several global aspects of protein networks enables credible and consistent inference to be made from available datasets. Our results indicate that gene duplication has played a larger part in the network evolution of the eukaryote than in the prokaryote, and suggests that single gene duplications with immediate divergence alone may explain more than 60% of biological network data in both domains.
format Text
id pubmed-2098858
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-20988582007-11-29 Using Likelihood-Free Inference to Compare Evolutionary Dynamics of the Protein Networks of H. pylori and P. falciparum Ratmann, Oliver Jørgensen, Ole Hinkley, Trevor Stumpf, Michael Richardson, Sylvia Wiuf, Carsten PLoS Comput Biol Research Article Gene duplication with subsequent interaction divergence is one of the primary driving forces in the evolution of genetic systems. Yet little is known about the precise mechanisms and the role of duplication divergence in the evolution of protein networks from the prokaryote and eukaryote domains. We developed a novel, model-based approach for Bayesian inference on biological network data that centres on approximate Bayesian computation, or likelihood-free inference. Instead of computing the intractable likelihood of the protein network topology, our method summarizes key features of the network and, based on these, uses a MCMC algorithm to approximate the posterior distribution of the model parameters. This allowed us to reliably fit a flexible mixture model that captures hallmarks of evolution by gene duplication and subfunctionalization to protein interaction network data of Helicobacter pylori and Plasmodium falciparum. The 80% credible intervals for the duplication–divergence component are [0.64, 0.98] for H. pylori and [0.87, 0.99] for P. falciparum. The remaining parameter estimates are not inconsistent with sequence data. An extensive sensitivity analysis showed that incompleteness of PIN data does not largely affect the analysis of models of protein network evolution, and that the degree sequence alone barely captures the evolutionary footprints of protein networks relative to other statistics. Our likelihood-free inference approach enables a fully Bayesian analysis of a complex and highly stochastic system that is otherwise intractable at present. Modelling the evolutionary history of PIN data, it transpires that only the simultaneous analysis of several global aspects of protein networks enables credible and consistent inference to be made from available datasets. Our results indicate that gene duplication has played a larger part in the network evolution of the eukaryote than in the prokaryote, and suggests that single gene duplications with immediate divergence alone may explain more than 60% of biological network data in both domains. Public Library of Science 2007-11 2007-11-30 /pmc/articles/PMC2098858/ /pubmed/18052538 http://dx.doi.org/10.1371/journal.pcbi.0030230 Text en © 2007 Ratmann et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ratmann, Oliver
Jørgensen, Ole
Hinkley, Trevor
Stumpf, Michael
Richardson, Sylvia
Wiuf, Carsten
Using Likelihood-Free Inference to Compare Evolutionary Dynamics of the Protein Networks of H. pylori and P. falciparum
title Using Likelihood-Free Inference to Compare Evolutionary Dynamics of the Protein Networks of H. pylori and P. falciparum
title_full Using Likelihood-Free Inference to Compare Evolutionary Dynamics of the Protein Networks of H. pylori and P. falciparum
title_fullStr Using Likelihood-Free Inference to Compare Evolutionary Dynamics of the Protein Networks of H. pylori and P. falciparum
title_full_unstemmed Using Likelihood-Free Inference to Compare Evolutionary Dynamics of the Protein Networks of H. pylori and P. falciparum
title_short Using Likelihood-Free Inference to Compare Evolutionary Dynamics of the Protein Networks of H. pylori and P. falciparum
title_sort using likelihood-free inference to compare evolutionary dynamics of the protein networks of h. pylori and p. falciparum
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2098858/
https://www.ncbi.nlm.nih.gov/pubmed/18052538
http://dx.doi.org/10.1371/journal.pcbi.0030230
work_keys_str_mv AT ratmannoliver usinglikelihoodfreeinferencetocompareevolutionarydynamicsoftheproteinnetworksofhpyloriandpfalciparum
AT jørgensenole usinglikelihoodfreeinferencetocompareevolutionarydynamicsoftheproteinnetworksofhpyloriandpfalciparum
AT hinkleytrevor usinglikelihoodfreeinferencetocompareevolutionarydynamicsoftheproteinnetworksofhpyloriandpfalciparum
AT stumpfmichael usinglikelihoodfreeinferencetocompareevolutionarydynamicsoftheproteinnetworksofhpyloriandpfalciparum
AT richardsonsylvia usinglikelihoodfreeinferencetocompareevolutionarydynamicsoftheproteinnetworksofhpyloriandpfalciparum
AT wiufcarsten usinglikelihoodfreeinferencetocompareevolutionarydynamicsoftheproteinnetworksofhpyloriandpfalciparum