Cargando…

Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins

Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are spec...

Descripción completa

Detalles Bibliográficos
Autores principales: Gandarilla-Pérez, Carlos A., Pinilla, Sergio, Bitbol, Anne-Florence, Weigt, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10089317/
https://www.ncbi.nlm.nih.gov/pubmed/36996234
http://dx.doi.org/10.1371/journal.pcbi.1011010
_version_ 1785022740119420928
author Gandarilla-Pérez, Carlos A.
Pinilla, Sergio
Bitbol, Anne-Florence
Weigt, Martin
author_facet Gandarilla-Pérez, Carlos A.
Pinilla, Sergio
Bitbol, Anne-Florence
Weigt, Martin
author_sort Gandarilla-Pérez, Carlos A.
collection PubMed
description Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are specific interaction partners within each species. We show that these two signals can be combined to improve the performance of the inference of interaction partners among paralogs. For this, we first align the sequence-similarity graphs of the two families through simulated annealing, yielding a robust partial pairing. We next use this partial pairing to seed a coevolution-based iterative pairing algorithm. This combined method improves performance over either separate method. The improvement obtained is striking in the difficult cases where the average number of paralogs per species is large or where the total number of sequences is modest.
format Online
Article
Text
id pubmed-10089317
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-100893172023-04-12 Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins Gandarilla-Pérez, Carlos A. Pinilla, Sergio Bitbol, Anne-Florence Weigt, Martin PLoS Comput Biol Research Article Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are specific interaction partners within each species. We show that these two signals can be combined to improve the performance of the inference of interaction partners among paralogs. For this, we first align the sequence-similarity graphs of the two families through simulated annealing, yielding a robust partial pairing. We next use this partial pairing to seed a coevolution-based iterative pairing algorithm. This combined method improves performance over either separate method. The improvement obtained is striking in the difficult cases where the average number of paralogs per species is large or where the total number of sequences is modest. Public Library of Science 2023-03-30 /pmc/articles/PMC10089317/ /pubmed/36996234 http://dx.doi.org/10.1371/journal.pcbi.1011010 Text en © 2023 Gandarilla-Pérez et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Gandarilla-Pérez, Carlos A.
Pinilla, Sergio
Bitbol, Anne-Florence
Weigt, Martin
Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins
title Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins
title_full Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins
title_fullStr Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins
title_full_unstemmed Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins
title_short Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins
title_sort combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10089317/
https://www.ncbi.nlm.nih.gov/pubmed/36996234
http://dx.doi.org/10.1371/journal.pcbi.1011010
work_keys_str_mv AT gandarillaperezcarlosa combiningphylogenyandcoevolutionimprovestheinferenceofinteractionpartnersamongparalogousproteins
AT pinillasergio combiningphylogenyandcoevolutionimprovestheinferenceofinteractionpartnersamongparalogousproteins
AT bitbolanneflorence combiningphylogenyandcoevolutionimprovestheinferenceofinteractionpartnersamongparalogousproteins
AT weigtmartin combiningphylogenyandcoevolutionimprovestheinferenceofinteractionpartnersamongparalogousproteins