Cargando…
Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins
Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are spec...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10089317/ https://www.ncbi.nlm.nih.gov/pubmed/36996234 http://dx.doi.org/10.1371/journal.pcbi.1011010 |
_version_ | 1785022740119420928 |
---|---|
author | Gandarilla-Pérez, Carlos A. Pinilla, Sergio Bitbol, Anne-Florence Weigt, Martin |
author_facet | Gandarilla-Pérez, Carlos A. Pinilla, Sergio Bitbol, Anne-Florence Weigt, Martin |
author_sort | Gandarilla-Pérez, Carlos A. |
collection | PubMed |
description | Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are specific interaction partners within each species. We show that these two signals can be combined to improve the performance of the inference of interaction partners among paralogs. For this, we first align the sequence-similarity graphs of the two families through simulated annealing, yielding a robust partial pairing. We next use this partial pairing to seed a coevolution-based iterative pairing algorithm. This combined method improves performance over either separate method. The improvement obtained is striking in the difficult cases where the average number of paralogs per species is large or where the total number of sequences is modest. |
format | Online Article Text |
id | pubmed-10089317 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-100893172023-04-12 Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins Gandarilla-Pérez, Carlos A. Pinilla, Sergio Bitbol, Anne-Florence Weigt, Martin PLoS Comput Biol Research Article Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are specific interaction partners within each species. We show that these two signals can be combined to improve the performance of the inference of interaction partners among paralogs. For this, we first align the sequence-similarity graphs of the two families through simulated annealing, yielding a robust partial pairing. We next use this partial pairing to seed a coevolution-based iterative pairing algorithm. This combined method improves performance over either separate method. The improvement obtained is striking in the difficult cases where the average number of paralogs per species is large or where the total number of sequences is modest. Public Library of Science 2023-03-30 /pmc/articles/PMC10089317/ /pubmed/36996234 http://dx.doi.org/10.1371/journal.pcbi.1011010 Text en © 2023 Gandarilla-Pérez et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Gandarilla-Pérez, Carlos A. Pinilla, Sergio Bitbol, Anne-Florence Weigt, Martin Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins |
title | Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins |
title_full | Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins |
title_fullStr | Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins |
title_full_unstemmed | Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins |
title_short | Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins |
title_sort | combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10089317/ https://www.ncbi.nlm.nih.gov/pubmed/36996234 http://dx.doi.org/10.1371/journal.pcbi.1011010 |
work_keys_str_mv | AT gandarillaperezcarlosa combiningphylogenyandcoevolutionimprovestheinferenceofinteractionpartnersamongparalogousproteins AT pinillasergio combiningphylogenyandcoevolutionimprovestheinferenceofinteractionpartnersamongparalogousproteins AT bitbolanneflorence combiningphylogenyandcoevolutionimprovestheinferenceofinteractionpartnersamongparalogousproteins AT weigtmartin combiningphylogenyandcoevolutionimprovestheinferenceofinteractionpartnersamongparalogousproteins |