Cargando…

ProteinPrompt: a webserver for predicting protein–protein interactions

MOTIVATION: Protein–protein interactions (PPIs) play an essential role in a great variety of cellular processes and are therefore of significant interest for the design of new therapeutic compounds as well as the identification of side effects due to unexpected binding. Here, we present ProteinPromp...

Descripción completa

Detalles Bibliográficos
Autores principales:	Canzler, Sebastian, Fischer, Markus, Ulbricht, David, Ristic, Nikola, Hildebrand, Peter W, Staritzbichler, René
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2022
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710678/ https://www.ncbi.nlm.nih.gov/pubmed/36699419 http://dx.doi.org/10.1093/bioadv/vbac059

_version_	1784841417114255360
author	Canzler, Sebastian Fischer, Markus Ulbricht, David Ristic, Nikola Hildebrand, Peter W Staritzbichler, René
author_facet	Canzler, Sebastian Fischer, Markus Ulbricht, David Ristic, Nikola Hildebrand, Peter W Staritzbichler, René
author_sort	Canzler, Sebastian
collection	PubMed
description	MOTIVATION: Protein–protein interactions (PPIs) play an essential role in a great variety of cellular processes and are therefore of significant interest for the design of new therapeutic compounds as well as the identification of side effects due to unexpected binding. Here, we present ProteinPrompt, a webserver that uses machine learning algorithms to calculate specific, currently unknown PPIs. Our tool is designed to quickly and reliably predict contact propensities based on an input sequence in order to scan large sequence libraries for potential binding partners, with the goal to accelerate and assure the quality of the laborious process of drug target identification. RESULTS: We collected and thoroughly filtered a comprehensive database of known binders from several sources, which is available as download. ProteinPrompt provides two complementary search methods of similar accuracy for comparison and consensus building. The default method is a random forest (RF) algorithm that uses the auto-correlations of seven amino acid scales. Alternatively, a graph neural network (GNN) implementation can be selected. Additionally, a consensus prediction is available. For each query sequence, potential binding partners are identified from a protein sequence database. The proteom of several organisms are available and can be searched for binders. To evaluate the predictive power of the algorithms, we prepared a test dataset that was rigorously filtered for redundancy. No sequence pairs similar to the ones used for training were included in this dataset. With this challenging dataset, the RF method achieved an accuracy rate of 0.88 and an area under the curve of 0.95. The GNN achieved an accuracy rate of 0.86 using the same dataset. Since the underlying learning approaches are unrelated, comparing the results of RF and GNNs reduces the likelihood of errors. The consensus reached an accuracy of 0.89. AVAILABILITY AND IMPLEMENTATION: ProteinPrompt is available online at: http://proteinformatics.org/ProteinPrompt, where training and test data used to optimize the methods are also available. The server makes it possible to scan the human proteome for potential binding partners of an input sequence within minutes. For local offline usage, we furthermore created a ProteinPrompt Docker image which allows for batch submission: https://gitlab.hzdr.de/proteinprompt/ProteinPrompt. In conclusion, we offer a fast, accurate, easy-to-use online service for predicting binding partners from an input sequence.
format	Online Article Text
id	pubmed-9710678
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-97106782023-01-24 ProteinPrompt: a webserver for predicting protein–protein interactions Canzler, Sebastian Fischer, Markus Ulbricht, David Ristic, Nikola Hildebrand, Peter W Staritzbichler, René Bioinform Adv Original Paper MOTIVATION: Protein–protein interactions (PPIs) play an essential role in a great variety of cellular processes and are therefore of significant interest for the design of new therapeutic compounds as well as the identification of side effects due to unexpected binding. Here, we present ProteinPrompt, a webserver that uses machine learning algorithms to calculate specific, currently unknown PPIs. Our tool is designed to quickly and reliably predict contact propensities based on an input sequence in order to scan large sequence libraries for potential binding partners, with the goal to accelerate and assure the quality of the laborious process of drug target identification. RESULTS: We collected and thoroughly filtered a comprehensive database of known binders from several sources, which is available as download. ProteinPrompt provides two complementary search methods of similar accuracy for comparison and consensus building. The default method is a random forest (RF) algorithm that uses the auto-correlations of seven amino acid scales. Alternatively, a graph neural network (GNN) implementation can be selected. Additionally, a consensus prediction is available. For each query sequence, potential binding partners are identified from a protein sequence database. The proteom of several organisms are available and can be searched for binders. To evaluate the predictive power of the algorithms, we prepared a test dataset that was rigorously filtered for redundancy. No sequence pairs similar to the ones used for training were included in this dataset. With this challenging dataset, the RF method achieved an accuracy rate of 0.88 and an area under the curve of 0.95. The GNN achieved an accuracy rate of 0.86 using the same dataset. Since the underlying learning approaches are unrelated, comparing the results of RF and GNNs reduces the likelihood of errors. The consensus reached an accuracy of 0.89. AVAILABILITY AND IMPLEMENTATION: ProteinPrompt is available online at: http://proteinformatics.org/ProteinPrompt, where training and test data used to optimize the methods are also available. The server makes it possible to scan the human proteome for potential binding partners of an input sequence within minutes. For local offline usage, we furthermore created a ProteinPrompt Docker image which allows for batch submission: https://gitlab.hzdr.de/proteinprompt/ProteinPrompt. In conclusion, we offer a fast, accurate, easy-to-use online service for predicting binding partners from an input sequence. Oxford University Press 2022-08-17 /pmc/articles/PMC9710678/ /pubmed/36699419 http://dx.doi.org/10.1093/bioadv/vbac059 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Paper Canzler, Sebastian Fischer, Markus Ulbricht, David Ristic, Nikola Hildebrand, Peter W Staritzbichler, René ProteinPrompt: a webserver for predicting protein–protein interactions
title	ProteinPrompt: a webserver for predicting protein–protein interactions
title_full	ProteinPrompt: a webserver for predicting protein–protein interactions
title_fullStr	ProteinPrompt: a webserver for predicting protein–protein interactions
title_full_unstemmed	ProteinPrompt: a webserver for predicting protein–protein interactions
title_short	ProteinPrompt: a webserver for predicting protein–protein interactions
title_sort	proteinprompt: a webserver for predicting protein–protein interactions
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710678/ https://www.ncbi.nlm.nih.gov/pubmed/36699419 http://dx.doi.org/10.1093/bioadv/vbac059
work_keys_str_mv	AT canzlersebastian proteinpromptawebserverforpredictingproteinproteininteractions AT fischermarkus proteinpromptawebserverforpredictingproteinproteininteractions AT ulbrichtdavid proteinpromptawebserverforpredictingproteinproteininteractions AT risticnikola proteinpromptawebserverforpredictingproteinproteininteractions AT hildebrandpeterw proteinpromptawebserverforpredictingproteinproteininteractions AT staritzbichlerrene proteinpromptawebserverforpredictingproteinproteininteractions

ProteinPrompt: a webserver for predicting protein–protein interactions

Ejemplares similares