Cargando…
Topsy-Turvy: integrating a global view into sequence-based PPI prediction
SUMMARY: Computational methods to predict protein–protein interaction (PPI) typically segregate into sequence-based ‘bottom-up’ methods that infer properties from the characteristics of the individual protein sequences, or global ‘top-down’ methods that infer properties from the pattern of already k...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9235477/ https://www.ncbi.nlm.nih.gov/pubmed/35758793 http://dx.doi.org/10.1093/bioinformatics/btac258 |
_version_ | 1784736318319755264 |
---|---|
author | Singh, Rohit Devkota, Kapil Sledzieski, Samuel Berger, Bonnie Cowen, Lenore |
author_facet | Singh, Rohit Devkota, Kapil Sledzieski, Samuel Berger, Bonnie Cowen, Lenore |
author_sort | Singh, Rohit |
collection | PubMed |
description | SUMMARY: Computational methods to predict protein–protein interaction (PPI) typically segregate into sequence-based ‘bottom-up’ methods that infer properties from the characteristics of the individual protein sequences, or global ‘top-down’ methods that infer properties from the pattern of already known PPIs in the species of interest. However, a way to incorporate top-down insights into sequence-based bottom-up PPI prediction methods has been elusive. We thus introduce Topsy-Turvy, a method that newly synthesizes both views in a sequence-based, multi-scale, deep-learning model for PPI prediction. While Topsy-Turvy makes predictions using only sequence data, during the training phase it takes a transfer-learning approach by incorporating patterns from both global and molecular-level views of protein interaction. In a cross-species context, we show it achieves state-of-the-art performance, offering the ability to perform genome-scale, interpretable PPI prediction for non-model organisms with no existing experimental PPI data. In species with available experimental PPI data, we further present a Topsy-Turvy hybrid (TT-Hybrid) model which integrates Topsy-Turvy with a purely network-based model for link prediction that provides information about species-specific network rewiring. TT-Hybrid makes accurate predictions for both well- and sparsely-characterized proteins, outperforming both its constituent components as well as other state-of-the-art PPI prediction methods. Furthermore, running Topsy-Turvy and TT-Hybrid screens is feasible for whole genomes, and thus these methods scale to settings where other methods (e.g. AlphaFold-Multimer) might be infeasible. The generalizability, accuracy and genome-level scalability of Topsy-Turvy and TT-Hybrid unlocks a more comprehensive map of protein interaction and organization in both model and non-model organisms. AVAILABILITY AND IMPLEMENTATION: https://topsyturvy.csail.mit.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9235477 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-92354772022-06-29 Topsy-Turvy: integrating a global view into sequence-based PPI prediction Singh, Rohit Devkota, Kapil Sledzieski, Samuel Berger, Bonnie Cowen, Lenore Bioinformatics ISCB/Ismb 2022 SUMMARY: Computational methods to predict protein–protein interaction (PPI) typically segregate into sequence-based ‘bottom-up’ methods that infer properties from the characteristics of the individual protein sequences, or global ‘top-down’ methods that infer properties from the pattern of already known PPIs in the species of interest. However, a way to incorporate top-down insights into sequence-based bottom-up PPI prediction methods has been elusive. We thus introduce Topsy-Turvy, a method that newly synthesizes both views in a sequence-based, multi-scale, deep-learning model for PPI prediction. While Topsy-Turvy makes predictions using only sequence data, during the training phase it takes a transfer-learning approach by incorporating patterns from both global and molecular-level views of protein interaction. In a cross-species context, we show it achieves state-of-the-art performance, offering the ability to perform genome-scale, interpretable PPI prediction for non-model organisms with no existing experimental PPI data. In species with available experimental PPI data, we further present a Topsy-Turvy hybrid (TT-Hybrid) model which integrates Topsy-Turvy with a purely network-based model for link prediction that provides information about species-specific network rewiring. TT-Hybrid makes accurate predictions for both well- and sparsely-characterized proteins, outperforming both its constituent components as well as other state-of-the-art PPI prediction methods. Furthermore, running Topsy-Turvy and TT-Hybrid screens is feasible for whole genomes, and thus these methods scale to settings where other methods (e.g. AlphaFold-Multimer) might be infeasible. The generalizability, accuracy and genome-level scalability of Topsy-Turvy and TT-Hybrid unlocks a more comprehensive map of protein interaction and organization in both model and non-model organisms. AVAILABILITY AND IMPLEMENTATION: https://topsyturvy.csail.mit.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-06-27 /pmc/articles/PMC9235477/ /pubmed/35758793 http://dx.doi.org/10.1093/bioinformatics/btac258 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | ISCB/Ismb 2022 Singh, Rohit Devkota, Kapil Sledzieski, Samuel Berger, Bonnie Cowen, Lenore Topsy-Turvy: integrating a global view into sequence-based PPI prediction |
title | Topsy-Turvy: integrating a global view into sequence-based PPI prediction |
title_full | Topsy-Turvy: integrating a global view into sequence-based PPI prediction |
title_fullStr | Topsy-Turvy: integrating a global view into sequence-based PPI prediction |
title_full_unstemmed | Topsy-Turvy: integrating a global view into sequence-based PPI prediction |
title_short | Topsy-Turvy: integrating a global view into sequence-based PPI prediction |
title_sort | topsy-turvy: integrating a global view into sequence-based ppi prediction |
topic | ISCB/Ismb 2022 |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9235477/ https://www.ncbi.nlm.nih.gov/pubmed/35758793 http://dx.doi.org/10.1093/bioinformatics/btac258 |
work_keys_str_mv | AT singhrohit topsyturvyintegratingaglobalviewintosequencebasedppiprediction AT devkotakapil topsyturvyintegratingaglobalviewintosequencebasedppiprediction AT sledzieskisamuel topsyturvyintegratingaglobalviewintosequencebasedppiprediction AT bergerbonnie topsyturvyintegratingaglobalviewintosequencebasedppiprediction AT cowenlenore topsyturvyintegratingaglobalviewintosequencebasedppiprediction |