Cargando…
Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes
Predicting protein–protein interactions (PPI) represents an important challenge in structural bioinformatics. Current computational methods display different degrees of accuracy when predicting these interactions. Different factors were proposed to help improve these predictions, including choosing...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7370293/ https://www.ncbi.nlm.nih.gov/pubmed/32640745 http://dx.doi.org/10.3390/ijms21134787 |
_version_ | 1783560961134690304 |
---|---|
author | Poot Velez, Albros Hermes Fontove, Fernando Del Rio, Gabriel |
author_facet | Poot Velez, Albros Hermes Fontove, Fernando Del Rio, Gabriel |
author_sort | Poot Velez, Albros Hermes |
collection | PubMed |
description | Predicting protein–protein interactions (PPI) represents an important challenge in structural bioinformatics. Current computational methods display different degrees of accuracy when predicting these interactions. Different factors were proposed to help improve these predictions, including choosing the proper descriptors of proteins to represent these interactions, among others. In the current work, we provide a representative protein structure that is amenable to PPI classification using machine learning approaches, referred to as residue cluster classes. Through sampling and optimization, we identified the best algorithm–parameter pair to classify PPI from more than 360 different training sets. We tested these classifiers against PPI datasets that were not included in the training set but shared sequence similarity with proteins in the training set to reproduce the situation of most proteins sharing sequence similarity with others. We identified a model with almost no PPI error (96–99% of correctly classified instances) and showed that residue cluster classes of protein pairs displayed a distinct pattern between positive and negative protein interactions. Our results indicated that residue cluster classes are structural features relevant to model PPI and provide a novel tool to mathematically model the protein structure/function relationship. |
format | Online Article Text |
id | pubmed-7370293 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-73702932020-08-07 Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes Poot Velez, Albros Hermes Fontove, Fernando Del Rio, Gabriel Int J Mol Sci Article Predicting protein–protein interactions (PPI) represents an important challenge in structural bioinformatics. Current computational methods display different degrees of accuracy when predicting these interactions. Different factors were proposed to help improve these predictions, including choosing the proper descriptors of proteins to represent these interactions, among others. In the current work, we provide a representative protein structure that is amenable to PPI classification using machine learning approaches, referred to as residue cluster classes. Through sampling and optimization, we identified the best algorithm–parameter pair to classify PPI from more than 360 different training sets. We tested these classifiers against PPI datasets that were not included in the training set but shared sequence similarity with proteins in the training set to reproduce the situation of most proteins sharing sequence similarity with others. We identified a model with almost no PPI error (96–99% of correctly classified instances) and showed that residue cluster classes of protein pairs displayed a distinct pattern between positive and negative protein interactions. Our results indicated that residue cluster classes are structural features relevant to model PPI and provide a novel tool to mathematically model the protein structure/function relationship. MDPI 2020-07-06 /pmc/articles/PMC7370293/ /pubmed/32640745 http://dx.doi.org/10.3390/ijms21134787 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Poot Velez, Albros Hermes Fontove, Fernando Del Rio, Gabriel Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes |
title | Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes |
title_full | Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes |
title_fullStr | Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes |
title_full_unstemmed | Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes |
title_short | Protein–Protein Interactions Efficiently Modeled by Residue Cluster Classes |
title_sort | protein–protein interactions efficiently modeled by residue cluster classes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7370293/ https://www.ncbi.nlm.nih.gov/pubmed/32640745 http://dx.doi.org/10.3390/ijms21134787 |
work_keys_str_mv | AT pootvelezalbroshermes proteinproteininteractionsefficientlymodeledbyresidueclusterclasses AT fontovefernando proteinproteininteractionsefficientlymodeledbyresidueclusterclasses AT delriogabriel proteinproteininteractionsefficientlymodeledbyresidueclusterclasses |