Cargando…

Improving Predictions of Protein-Protein Interfaces by Combining Amino Acid-Specific Classifiers Based on Structural and Physicochemical Descriptors with Their Weighted Neighbor Averages

Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidatio...

Descripción completa

Detalles Bibliográficos
Autores principales: de Moraes, Fábio R., Neshich, Izabella A. P., Mazoni, Ivan, Yano, Inácio H., Pereira, José G. C., Salim, José A., Jardine, José G., Neshich, Goran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3904977/
https://www.ncbi.nlm.nih.gov/pubmed/24489849
http://dx.doi.org/10.1371/journal.pone.0087107
_version_ 1782301270851190784
author de Moraes, Fábio R.
Neshich, Izabella A. P.
Mazoni, Ivan
Yano, Inácio H.
Pereira, José G. C.
Salim, José A.
Jardine, José G.
Neshich, Goran
author_facet de Moraes, Fábio R.
Neshich, Izabella A. P.
Mazoni, Ivan
Yano, Inácio H.
Pereira, José G. C.
Salim, José A.
Jardine, José G.
Neshich, Goran
author_sort de Moraes, Fábio R.
collection PubMed
description Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now integrated into the BlueStar STING suite of programs. Consequently, the prediction of protein-protein interfaces for all proteins available in the PDB is possible through STING_interfaces module, accessible at the following website: (http://www.cbi.cnptia.embrapa.br/SMS/predictions/index.html).
format Online
Article
Text
id pubmed-3904977
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39049772014-01-31 Improving Predictions of Protein-Protein Interfaces by Combining Amino Acid-Specific Classifiers Based on Structural and Physicochemical Descriptors with Their Weighted Neighbor Averages de Moraes, Fábio R. Neshich, Izabella A. P. Mazoni, Ivan Yano, Inácio H. Pereira, José G. C. Salim, José A. Jardine, José G. Neshich, Goran PLoS One Research Article Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now integrated into the BlueStar STING suite of programs. Consequently, the prediction of protein-protein interfaces for all proteins available in the PDB is possible through STING_interfaces module, accessible at the following website: (http://www.cbi.cnptia.embrapa.br/SMS/predictions/index.html). Public Library of Science 2014-01-28 /pmc/articles/PMC3904977/ /pubmed/24489849 http://dx.doi.org/10.1371/journal.pone.0087107 Text en © 2014 de Moraes et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
de Moraes, Fábio R.
Neshich, Izabella A. P.
Mazoni, Ivan
Yano, Inácio H.
Pereira, José G. C.
Salim, José A.
Jardine, José G.
Neshich, Goran
Improving Predictions of Protein-Protein Interfaces by Combining Amino Acid-Specific Classifiers Based on Structural and Physicochemical Descriptors with Their Weighted Neighbor Averages
title Improving Predictions of Protein-Protein Interfaces by Combining Amino Acid-Specific Classifiers Based on Structural and Physicochemical Descriptors with Their Weighted Neighbor Averages
title_full Improving Predictions of Protein-Protein Interfaces by Combining Amino Acid-Specific Classifiers Based on Structural and Physicochemical Descriptors with Their Weighted Neighbor Averages
title_fullStr Improving Predictions of Protein-Protein Interfaces by Combining Amino Acid-Specific Classifiers Based on Structural and Physicochemical Descriptors with Their Weighted Neighbor Averages
title_full_unstemmed Improving Predictions of Protein-Protein Interfaces by Combining Amino Acid-Specific Classifiers Based on Structural and Physicochemical Descriptors with Their Weighted Neighbor Averages
title_short Improving Predictions of Protein-Protein Interfaces by Combining Amino Acid-Specific Classifiers Based on Structural and Physicochemical Descriptors with Their Weighted Neighbor Averages
title_sort improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3904977/
https://www.ncbi.nlm.nih.gov/pubmed/24489849
http://dx.doi.org/10.1371/journal.pone.0087107
work_keys_str_mv AT demoraesfabior improvingpredictionsofproteinproteininterfacesbycombiningaminoacidspecificclassifiersbasedonstructuralandphysicochemicaldescriptorswiththeirweightedneighboraverages
AT neshichizabellaap improvingpredictionsofproteinproteininterfacesbycombiningaminoacidspecificclassifiersbasedonstructuralandphysicochemicaldescriptorswiththeirweightedneighboraverages
AT mazoniivan improvingpredictionsofproteinproteininterfacesbycombiningaminoacidspecificclassifiersbasedonstructuralandphysicochemicaldescriptorswiththeirweightedneighboraverages
AT yanoinacioh improvingpredictionsofproteinproteininterfacesbycombiningaminoacidspecificclassifiersbasedonstructuralandphysicochemicaldescriptorswiththeirweightedneighboraverages
AT pereirajosegc improvingpredictionsofproteinproteininterfacesbycombiningaminoacidspecificclassifiersbasedonstructuralandphysicochemicaldescriptorswiththeirweightedneighboraverages
AT salimjosea improvingpredictionsofproteinproteininterfacesbycombiningaminoacidspecificclassifiersbasedonstructuralandphysicochemicaldescriptorswiththeirweightedneighboraverages
AT jardinejoseg improvingpredictionsofproteinproteininterfacesbycombiningaminoacidspecificclassifiersbasedonstructuralandphysicochemicaldescriptorswiththeirweightedneighboraverages
AT neshichgoran improvingpredictionsofproteinproteininterfacesbycombiningaminoacidspecificclassifiersbasedonstructuralandphysicochemicaldescriptorswiththeirweightedneighboraverages