Cargando…

Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes

Predicting protein–protein interactions (PPI) represents an important challenge in structural bioinformatics. Current computational methods display different degrees of accuracy when predicting these interactions. Different factors were proposed to help improve these predictions, including choosing...

Descripción completa

Detalles Bibliográficos
Autores principales: Poot Velez, Albros Hermes, Fontove, Fernando, Del Rio, Gabriel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7370293/
https://www.ncbi.nlm.nih.gov/pubmed/32640745
http://dx.doi.org/10.3390/ijms21134787
_version_ 1783560961134690304
author Poot Velez, Albros Hermes
Fontove, Fernando
Del Rio, Gabriel
author_facet Poot Velez, Albros Hermes
Fontove, Fernando
Del Rio, Gabriel
author_sort Poot Velez, Albros Hermes
collection PubMed
description Predicting protein–protein interactions (PPI) represents an important challenge in structural bioinformatics. Current computational methods display different degrees of accuracy when predicting these interactions. Different factors were proposed to help improve these predictions, including choosing the proper descriptors of proteins to represent these interactions, among others. In the current work, we provide a representative protein structure that is amenable to PPI classification using machine learning approaches, referred to as residue cluster classes. Through sampling and optimization, we identified the best algorithm–parameter pair to classify PPI from more than 360 different training sets. We tested these classifiers against PPI datasets that were not included in the training set but shared sequence similarity with proteins in the training set to reproduce the situation of most proteins sharing sequence similarity with others. We identified a model with almost no PPI error (96–99% of correctly classified instances) and showed that residue cluster classes of protein pairs displayed a distinct pattern between positive and negative protein interactions. Our results indicated that residue cluster classes are structural features relevant to model PPI and provide a novel tool to mathematically model the protein structure/function relationship.
format Online
Article
Text
id pubmed-7370293
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-73702932020-08-07 Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes Poot Velez, Albros Hermes Fontove, Fernando Del Rio, Gabriel Int J Mol Sci Article Predicting protein–protein interactions (PPI) represents an important challenge in structural bioinformatics. Current computational methods display different degrees of accuracy when predicting these interactions. Different factors were proposed to help improve these predictions, including choosing the proper descriptors of proteins to represent these interactions, among others. In the current work, we provide a representative protein structure that is amenable to PPI classification using machine learning approaches, referred to as residue cluster classes. Through sampling and optimization, we identified the best algorithm–parameter pair to classify PPI from more than 360 different training sets. We tested these classifiers against PPI datasets that were not included in the training set but shared sequence similarity with proteins in the training set to reproduce the situation of most proteins sharing sequence similarity with others. We identified a model with almost no PPI error (96–99% of correctly classified instances) and showed that residue cluster classes of protein pairs displayed a distinct pattern between positive and negative protein interactions. Our results indicated that residue cluster classes are structural features relevant to model PPI and provide a novel tool to mathematically model the protein structure/function relationship. MDPI 2020-07-06 /pmc/articles/PMC7370293/ /pubmed/32640745 http://dx.doi.org/10.3390/ijms21134787 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Poot Velez, Albros Hermes
Fontove, Fernando
Del Rio, Gabriel
Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes
title Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes
title_full Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes
title_fullStr Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes
title_full_unstemmed Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes
title_short Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes
title_sort protein–protein interactions efficiently modeled by residue cluster classes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7370293/
https://www.ncbi.nlm.nih.gov/pubmed/32640745
http://dx.doi.org/10.3390/ijms21134787
work_keys_str_mv AT pootvelezalbroshermes proteinproteininteractionsefficientlymodeledbyresidueclusterclasses
AT fontovefernando proteinproteininteractionsefficientlymodeledbyresidueclusterclasses
AT delriogabriel proteinproteininteractionsefficientlymodeledbyresidueclusterclasses