Cargando…
PEPPI: Whole-proteome Protein-protein Interaction Prediction through Structure and Sequence Similarity, Functional Association, and Machine Learning
Proteome-wide identification of protein-protein interactions is a formidable task which has yet to be sufficiently addressed by experimental methodologies. Many computational methods have been developed to predict proteome-wide interaction networks, but few leverage both the sensitivity of structura...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8897833/ https://www.ncbi.nlm.nih.gov/pubmed/35662463 http://dx.doi.org/10.1016/j.jmb.2022.167530 |
_version_ | 1784663512319000576 |
---|---|
author | Bell, Eric W. Schwartz, Jacob H. Freddolino, Peter L. Zhang, Yang |
author_facet | Bell, Eric W. Schwartz, Jacob H. Freddolino, Peter L. Zhang, Yang |
author_sort | Bell, Eric W. |
collection | PubMed |
description | Proteome-wide identification of protein-protein interactions is a formidable task which has yet to be sufficiently addressed by experimental methodologies. Many computational methods have been developed to predict proteome-wide interaction networks, but few leverage both the sensitivity of structural information and the wide availability of sequence data. We present PEPPI, a pipeline which integrates structural similarity, sequence similarity, functional association data, and machine learning-based classification through a naïve Bayesian classifier model to accurately predict protein-protein interactions at a proteomic scale. Through benchmarking against a set of 798 ground truth interactions and an equal number of non-interactions, we have found that PEPPI attains 4.5% higher AUROC than the best of other state-of-the-art methods. As a proteomic-scale application, PEPPI was applied to model the interactions which occur between SARS-CoV-2 and human host cells during coronavirus infection, where 403 high-confidence interactions were identified with predictions covering 73% of a gold standard dataset from PSICQUIC and demonstrating significant complementarity with the most recent high-throughput experiments. PEPPI is available both as a webserver and in a standalone version and should be a powerful and generally applicable tool for computational screening of protein-protein interactions. |
format | Online Article Text |
id | pubmed-8897833 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-88978332022-03-07 PEPPI: Whole-proteome Protein-protein Interaction Prediction through Structure and Sequence Similarity, Functional Association, and Machine Learning Bell, Eric W. Schwartz, Jacob H. Freddolino, Peter L. Zhang, Yang J Mol Biol Web Server Proteome-wide identification of protein-protein interactions is a formidable task which has yet to be sufficiently addressed by experimental methodologies. Many computational methods have been developed to predict proteome-wide interaction networks, but few leverage both the sensitivity of structural information and the wide availability of sequence data. We present PEPPI, a pipeline which integrates structural similarity, sequence similarity, functional association data, and machine learning-based classification through a naïve Bayesian classifier model to accurately predict protein-protein interactions at a proteomic scale. Through benchmarking against a set of 798 ground truth interactions and an equal number of non-interactions, we have found that PEPPI attains 4.5% higher AUROC than the best of other state-of-the-art methods. As a proteomic-scale application, PEPPI was applied to model the interactions which occur between SARS-CoV-2 and human host cells during coronavirus infection, where 403 high-confidence interactions were identified with predictions covering 73% of a gold standard dataset from PSICQUIC and demonstrating significant complementarity with the most recent high-throughput experiments. PEPPI is available both as a webserver and in a standalone version and should be a powerful and generally applicable tool for computational screening of protein-protein interactions. Elsevier Ltd. 2022-06-15 2022-03-05 /pmc/articles/PMC8897833/ /pubmed/35662463 http://dx.doi.org/10.1016/j.jmb.2022.167530 Text en © 2022 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Web Server Bell, Eric W. Schwartz, Jacob H. Freddolino, Peter L. Zhang, Yang PEPPI: Whole-proteome Protein-protein Interaction Prediction through Structure and Sequence Similarity, Functional Association, and Machine Learning |
title | PEPPI: Whole-proteome Protein-protein Interaction Prediction through Structure and Sequence Similarity, Functional Association, and Machine Learning |
title_full | PEPPI: Whole-proteome Protein-protein Interaction Prediction through Structure and Sequence Similarity, Functional Association, and Machine Learning |
title_fullStr | PEPPI: Whole-proteome Protein-protein Interaction Prediction through Structure and Sequence Similarity, Functional Association, and Machine Learning |
title_full_unstemmed | PEPPI: Whole-proteome Protein-protein Interaction Prediction through Structure and Sequence Similarity, Functional Association, and Machine Learning |
title_short | PEPPI: Whole-proteome Protein-protein Interaction Prediction through Structure and Sequence Similarity, Functional Association, and Machine Learning |
title_sort | peppi: whole-proteome protein-protein interaction prediction through structure and sequence similarity, functional association, and machine learning |
topic | Web Server |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8897833/ https://www.ncbi.nlm.nih.gov/pubmed/35662463 http://dx.doi.org/10.1016/j.jmb.2022.167530 |
work_keys_str_mv | AT bellericw peppiwholeproteomeproteinproteininteractionpredictionthroughstructureandsequencesimilarityfunctionalassociationandmachinelearning AT schwartzjacobh peppiwholeproteomeproteinproteininteractionpredictionthroughstructureandsequencesimilarityfunctionalassociationandmachinelearning AT freddolinopeterl peppiwholeproteomeproteinproteininteractionpredictionthroughstructureandsequencesimilarityfunctionalassociationandmachinelearning AT zhangyang peppiwholeproteomeproteinproteininteractionpredictionthroughstructureandsequencesimilarityfunctionalassociationandmachinelearning |