Cargando…

Topsy-Turvy: integrating a global view into sequence-based PPI prediction

SUMMARY: Computational methods to predict protein–protein interaction (PPI) typically segregate into sequence-based ‘bottom-up’ methods that infer properties from the characteristics of the individual protein sequences, or global ‘top-down’ methods that infer properties from the pattern of already k...

Descripción completa

Detalles Bibliográficos
Autores principales: Singh, Rohit, Devkota, Kapil, Sledzieski, Samuel, Berger, Bonnie, Cowen, Lenore
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9235477/
https://www.ncbi.nlm.nih.gov/pubmed/35758793
http://dx.doi.org/10.1093/bioinformatics/btac258
_version_ 1784736318319755264
author Singh, Rohit
Devkota, Kapil
Sledzieski, Samuel
Berger, Bonnie
Cowen, Lenore
author_facet Singh, Rohit
Devkota, Kapil
Sledzieski, Samuel
Berger, Bonnie
Cowen, Lenore
author_sort Singh, Rohit
collection PubMed
description SUMMARY: Computational methods to predict protein–protein interaction (PPI) typically segregate into sequence-based ‘bottom-up’ methods that infer properties from the characteristics of the individual protein sequences, or global ‘top-down’ methods that infer properties from the pattern of already known PPIs in the species of interest. However, a way to incorporate top-down insights into sequence-based bottom-up PPI prediction methods has been elusive. We thus introduce Topsy-Turvy, a method that newly synthesizes both views in a sequence-based, multi-scale, deep-learning model for PPI prediction. While Topsy-Turvy makes predictions using only sequence data, during the training phase it takes a transfer-learning approach by incorporating patterns from both global and molecular-level views of protein interaction. In a cross-species context, we show it achieves state-of-the-art performance, offering the ability to perform genome-scale, interpretable PPI prediction for non-model organisms with no existing experimental PPI data. In species with available experimental PPI data, we further present a Topsy-Turvy hybrid (TT-Hybrid) model which integrates Topsy-Turvy with a purely network-based model for link prediction that provides information about species-specific network rewiring. TT-Hybrid makes accurate predictions for both well- and sparsely-characterized proteins, outperforming both its constituent components as well as other state-of-the-art PPI prediction methods. Furthermore, running Topsy-Turvy and TT-Hybrid screens is feasible for whole genomes, and thus these methods scale to settings where other methods (e.g. AlphaFold-Multimer) might be infeasible. The generalizability, accuracy and genome-level scalability of Topsy-Turvy and TT-Hybrid unlocks a more comprehensive map of protein interaction and organization in both model and non-model organisms. AVAILABILITY AND IMPLEMENTATION: https://topsyturvy.csail.mit.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9235477
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-92354772022-06-29 Topsy-Turvy: integrating a global view into sequence-based PPI prediction Singh, Rohit Devkota, Kapil Sledzieski, Samuel Berger, Bonnie Cowen, Lenore Bioinformatics ISCB/Ismb 2022 SUMMARY: Computational methods to predict protein–protein interaction (PPI) typically segregate into sequence-based ‘bottom-up’ methods that infer properties from the characteristics of the individual protein sequences, or global ‘top-down’ methods that infer properties from the pattern of already known PPIs in the species of interest. However, a way to incorporate top-down insights into sequence-based bottom-up PPI prediction methods has been elusive. We thus introduce Topsy-Turvy, a method that newly synthesizes both views in a sequence-based, multi-scale, deep-learning model for PPI prediction. While Topsy-Turvy makes predictions using only sequence data, during the training phase it takes a transfer-learning approach by incorporating patterns from both global and molecular-level views of protein interaction. In a cross-species context, we show it achieves state-of-the-art performance, offering the ability to perform genome-scale, interpretable PPI prediction for non-model organisms with no existing experimental PPI data. In species with available experimental PPI data, we further present a Topsy-Turvy hybrid (TT-Hybrid) model which integrates Topsy-Turvy with a purely network-based model for link prediction that provides information about species-specific network rewiring. TT-Hybrid makes accurate predictions for both well- and sparsely-characterized proteins, outperforming both its constituent components as well as other state-of-the-art PPI prediction methods. Furthermore, running Topsy-Turvy and TT-Hybrid screens is feasible for whole genomes, and thus these methods scale to settings where other methods (e.g. AlphaFold-Multimer) might be infeasible. The generalizability, accuracy and genome-level scalability of Topsy-Turvy and TT-Hybrid unlocks a more comprehensive map of protein interaction and organization in both model and non-model organisms. AVAILABILITY AND IMPLEMENTATION: https://topsyturvy.csail.mit.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-06-27 /pmc/articles/PMC9235477/ /pubmed/35758793 http://dx.doi.org/10.1093/bioinformatics/btac258 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle ISCB/Ismb 2022
Singh, Rohit
Devkota, Kapil
Sledzieski, Samuel
Berger, Bonnie
Cowen, Lenore
Topsy-Turvy: integrating a global view into sequence-based PPI prediction
title Topsy-Turvy: integrating a global view into sequence-based PPI prediction
title_full Topsy-Turvy: integrating a global view into sequence-based PPI prediction
title_fullStr Topsy-Turvy: integrating a global view into sequence-based PPI prediction
title_full_unstemmed Topsy-Turvy: integrating a global view into sequence-based PPI prediction
title_short Topsy-Turvy: integrating a global view into sequence-based PPI prediction
title_sort topsy-turvy: integrating a global view into sequence-based ppi prediction
topic ISCB/Ismb 2022
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9235477/
https://www.ncbi.nlm.nih.gov/pubmed/35758793
http://dx.doi.org/10.1093/bioinformatics/btac258
work_keys_str_mv AT singhrohit topsyturvyintegratingaglobalviewintosequencebasedppiprediction
AT devkotakapil topsyturvyintegratingaglobalviewintosequencebasedppiprediction
AT sledzieskisamuel topsyturvyintegratingaglobalviewintosequencebasedppiprediction
AT bergerbonnie topsyturvyintegratingaglobalviewintosequencebasedppiprediction
AT cowenlenore topsyturvyintegratingaglobalviewintosequencebasedppiprediction