Cargando…

A multi-scale coevolutionary approach to predict interactions between protein domains

Interacting proteins and protein domains coevolve on multiple scales, from their correlated presence across species, to correlations in amino-acid usage. Genomic databases provide rapidly growing data for variability in genomic protein content and in protein sequences, calling for computational pred...

Descripción completa

Detalles Bibliográficos
Autores principales: Croce, Giancarlo, Gueudré, Thomas, Ruiz Cuevas, Maria Virginia, Keidel, Victoria, Figliuzzi, Matteo, Szurmant, Hendrik, Weigt, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6822775/
https://www.ncbi.nlm.nih.gov/pubmed/31634362
http://dx.doi.org/10.1371/journal.pcbi.1006891
_version_ 1783464403286360064
author Croce, Giancarlo
Gueudré, Thomas
Ruiz Cuevas, Maria Virginia
Keidel, Victoria
Figliuzzi, Matteo
Szurmant, Hendrik
Weigt, Martin
author_facet Croce, Giancarlo
Gueudré, Thomas
Ruiz Cuevas, Maria Virginia
Keidel, Victoria
Figliuzzi, Matteo
Szurmant, Hendrik
Weigt, Martin
author_sort Croce, Giancarlo
collection PubMed
description Interacting proteins and protein domains coevolve on multiple scales, from their correlated presence across species, to correlations in amino-acid usage. Genomic databases provide rapidly growing data for variability in genomic protein content and in protein sequences, calling for computational predictions of unknown interactions. We first introduce the concept of direct phyletic couplings, based on global statistical models of phylogenetic profiles. They strongly increase the accuracy of predicting pairs of related protein domains beyond simpler correlation-based approaches like phylogenetic profiling (80% vs. 30–50% positives out of the 1000 highest-scoring pairs). Combined with the direct coupling analysis of inter-protein residue-residue coevolution, we provide multi-scale evidence for direct but unknown interaction between protein families. An in-depth discussion shows these to be biologically sensible and directly experimentally testable. Negative phyletic couplings highlight alternative solutions for the same functionality, including documented cases of convergent evolution. Thereby our work proves the strong potential of global statistical modeling approaches to genome-wide coevolutionary analysis, far beyond the established use for individual protein complexes and domain-domain interactions.
format Online
Article
Text
id pubmed-6822775
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-68227752019-11-12 A multi-scale coevolutionary approach to predict interactions between protein domains Croce, Giancarlo Gueudré, Thomas Ruiz Cuevas, Maria Virginia Keidel, Victoria Figliuzzi, Matteo Szurmant, Hendrik Weigt, Martin PLoS Comput Biol Research Article Interacting proteins and protein domains coevolve on multiple scales, from their correlated presence across species, to correlations in amino-acid usage. Genomic databases provide rapidly growing data for variability in genomic protein content and in protein sequences, calling for computational predictions of unknown interactions. We first introduce the concept of direct phyletic couplings, based on global statistical models of phylogenetic profiles. They strongly increase the accuracy of predicting pairs of related protein domains beyond simpler correlation-based approaches like phylogenetic profiling (80% vs. 30–50% positives out of the 1000 highest-scoring pairs). Combined with the direct coupling analysis of inter-protein residue-residue coevolution, we provide multi-scale evidence for direct but unknown interaction between protein families. An in-depth discussion shows these to be biologically sensible and directly experimentally testable. Negative phyletic couplings highlight alternative solutions for the same functionality, including documented cases of convergent evolution. Thereby our work proves the strong potential of global statistical modeling approaches to genome-wide coevolutionary analysis, far beyond the established use for individual protein complexes and domain-domain interactions. Public Library of Science 2019-10-21 /pmc/articles/PMC6822775/ /pubmed/31634362 http://dx.doi.org/10.1371/journal.pcbi.1006891 Text en © 2019 Croce et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Croce, Giancarlo
Gueudré, Thomas
Ruiz Cuevas, Maria Virginia
Keidel, Victoria
Figliuzzi, Matteo
Szurmant, Hendrik
Weigt, Martin
A multi-scale coevolutionary approach to predict interactions between protein domains
title A multi-scale coevolutionary approach to predict interactions between protein domains
title_full A multi-scale coevolutionary approach to predict interactions between protein domains
title_fullStr A multi-scale coevolutionary approach to predict interactions between protein domains
title_full_unstemmed A multi-scale coevolutionary approach to predict interactions between protein domains
title_short A multi-scale coevolutionary approach to predict interactions between protein domains
title_sort multi-scale coevolutionary approach to predict interactions between protein domains
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6822775/
https://www.ncbi.nlm.nih.gov/pubmed/31634362
http://dx.doi.org/10.1371/journal.pcbi.1006891
work_keys_str_mv AT crocegiancarlo amultiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT gueudrethomas amultiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT ruizcuevasmariavirginia amultiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT keidelvictoria amultiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT figliuzzimatteo amultiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT szurmanthendrik amultiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT weigtmartin amultiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT crocegiancarlo multiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT gueudrethomas multiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT ruizcuevasmariavirginia multiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT keidelvictoria multiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT figliuzzimatteo multiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT szurmanthendrik multiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains
AT weigtmartin multiscalecoevolutionaryapproachtopredictinteractionsbetweenproteindomains