Cargando…

Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites

BACKGROUND: Protein-protein interactions (PPI) play a key role in an investigation of various biochemical processes, and their identification is thus of great importance. Although computational prediction of which amino acids take part in a PPI has been an active field of research for some time, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Jelínek, Jan, Škoda, Petr, Hoksza, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5731498/
https://www.ncbi.nlm.nih.gov/pubmed/29244012
http://dx.doi.org/10.1186/s12859-017-1921-4
_version_ 1783286521829261312
author Jelínek, Jan
Škoda, Petr
Hoksza, David
author_facet Jelínek, Jan
Škoda, Petr
Hoksza, David
author_sort Jelínek, Jan
collection PubMed
description BACKGROUND: Protein-protein interactions (PPI) play a key role in an investigation of various biochemical processes, and their identification is thus of great importance. Although computational prediction of which amino acids take part in a PPI has been an active field of research for some time, the quality of in-silico methods is still far from perfect. RESULTS: We have developed a novel prediction method called INSPiRE which benefits from a knowledge base built from data available in Protein Data Bank. All proteins involved in PPIs were converted into labeled graphs with nodes corresponding to amino acids and edges to pairs of neighboring amino acids. A structural neighborhood of each node was then encoded into a bit string and stored in the knowledge base. When predicting PPIs, INSPiRE labels amino acids of unknown proteins as interface or non-interface based on how often their structural neighborhood appears as interface or non-interface in the knowledge base. We evaluated INSPiRE’s behavior with respect to different types and sizes of the structural neighborhood. Furthermore, we examined the suitability of several different features for labeling the nodes. Our evaluations showed that INSPiRE clearly outperforms existing methods with respect to Matthews correlation coefficient. CONCLUSION: In this paper we introduce a new knowledge-based method for identification of protein-protein interaction sites called INSPiRE. Its knowledge base utilizes structural patterns of known interaction sites in the Protein Data Bank which are then used for PPI prediction. Extensive experiments on several well-established datasets show that INSPiRE significantly surpasses existing PPI approaches.
format Online
Article
Text
id pubmed-5731498
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57314982017-12-19 Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites Jelínek, Jan Škoda, Petr Hoksza, David BMC Bioinformatics Research BACKGROUND: Protein-protein interactions (PPI) play a key role in an investigation of various biochemical processes, and their identification is thus of great importance. Although computational prediction of which amino acids take part in a PPI has been an active field of research for some time, the quality of in-silico methods is still far from perfect. RESULTS: We have developed a novel prediction method called INSPiRE which benefits from a knowledge base built from data available in Protein Data Bank. All proteins involved in PPIs were converted into labeled graphs with nodes corresponding to amino acids and edges to pairs of neighboring amino acids. A structural neighborhood of each node was then encoded into a bit string and stored in the knowledge base. When predicting PPIs, INSPiRE labels amino acids of unknown proteins as interface or non-interface based on how often their structural neighborhood appears as interface or non-interface in the knowledge base. We evaluated INSPiRE’s behavior with respect to different types and sizes of the structural neighborhood. Furthermore, we examined the suitability of several different features for labeling the nodes. Our evaluations showed that INSPiRE clearly outperforms existing methods with respect to Matthews correlation coefficient. CONCLUSION: In this paper we introduce a new knowledge-based method for identification of protein-protein interaction sites called INSPiRE. Its knowledge base utilizes structural patterns of known interaction sites in the Protein Data Bank which are then used for PPI prediction. Extensive experiments on several well-established datasets show that INSPiRE significantly surpasses existing PPI approaches. BioMed Central 2017-12-06 /pmc/articles/PMC5731498/ /pubmed/29244012 http://dx.doi.org/10.1186/s12859-017-1921-4 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Jelínek, Jan
Škoda, Petr
Hoksza, David
Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites
title Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites
title_full Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites
title_fullStr Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites
title_full_unstemmed Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites
title_short Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites
title_sort utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5731498/
https://www.ncbi.nlm.nih.gov/pubmed/29244012
http://dx.doi.org/10.1186/s12859-017-1921-4
work_keys_str_mv AT jelinekjan utilizingknowledgebaseofaminoacidsstructuralneighborhoodstopredictproteinproteininteractionsites
AT skodapetr utilizingknowledgebaseofaminoacidsstructuralneighborhoodstopredictproteinproteininteractionsites
AT hokszadavid utilizingknowledgebaseofaminoacidsstructuralneighborhoodstopredictproteinproteininteractionsites