Cargando…

Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization

BACKGROUND: Protein–protein interactions (PPIs) are the core of protein function, which provide an effective means to understand the function at cell level. Identification of PPIs is the crucial foundation of predicting drug-target interactions. Although traditional biological experiments of identif...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Xiaoli, Zhang, Xiaolong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8094484/
https://www.ncbi.nlm.nih.gov/pubmed/33941163
http://dx.doi.org/10.1186/s12911-020-01350-4
_version_ 1783687980568805376
author Lin, Xiaoli
Zhang, Xiaolong
author_facet Lin, Xiaoli
Zhang, Xiaolong
author_sort Lin, Xiaoli
collection PubMed
description BACKGROUND: Protein–protein interactions (PPIs) are the core of protein function, which provide an effective means to understand the function at cell level. Identification of PPIs is the crucial foundation of predicting drug-target interactions. Although traditional biological experiments of identifying PPIs are becoming available, these experiments remain to be extremely time-consuming and expensive. Therefore, various computational models have been introduced to identify PPIs. In protein-protein interaction network (PPIN), Hub protein, as a highly connected node, can coordinate PPIs and play biological functions. Detecting hot regions on Hub protein interaction interfaces is an issue worthy of discussing. METHODS: Two clustering methods, LCSD and RCNOIK are used to detect the hot regions on Hub protein interaction interfaces in this paper. In order to improve the efficiency of K-means clustering algorithm, the best k value is selected by calculating the distance square sum and the average silhouette coefficients. Then, the optimization of residue coordination number strategy is used to calculate the average coordination number. In addition, the pair potentials and relative ASA (PPRA) strategy is also used to optimize the predicted results. RESULTS: DataHub dataset and PartyHub dataset were used to train two clustering models respectively. Experiments show that LCSD and RCNOIK have the same coverage with Hub protein datasets, and RCNOIK is slightly higher than LCSD in Precision. The predicted hot regions are closer to the standard hot regions. CONCLUSIONS: This paper optimizes two clustering methods based on PPRA strategy. Compared our methods for hot regions prediction against the well-known approaches, our improved methods have the higher reliability and are effective for predicting hot regions on Hub protein interaction interfaces.
format Online
Article
Text
id pubmed-8094484
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-80944842021-05-04 Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization Lin, Xiaoli Zhang, Xiaolong BMC Med Inform Decis Mak Research BACKGROUND: Protein–protein interactions (PPIs) are the core of protein function, which provide an effective means to understand the function at cell level. Identification of PPIs is the crucial foundation of predicting drug-target interactions. Although traditional biological experiments of identifying PPIs are becoming available, these experiments remain to be extremely time-consuming and expensive. Therefore, various computational models have been introduced to identify PPIs. In protein-protein interaction network (PPIN), Hub protein, as a highly connected node, can coordinate PPIs and play biological functions. Detecting hot regions on Hub protein interaction interfaces is an issue worthy of discussing. METHODS: Two clustering methods, LCSD and RCNOIK are used to detect the hot regions on Hub protein interaction interfaces in this paper. In order to improve the efficiency of K-means clustering algorithm, the best k value is selected by calculating the distance square sum and the average silhouette coefficients. Then, the optimization of residue coordination number strategy is used to calculate the average coordination number. In addition, the pair potentials and relative ASA (PPRA) strategy is also used to optimize the predicted results. RESULTS: DataHub dataset and PartyHub dataset were used to train two clustering models respectively. Experiments show that LCSD and RCNOIK have the same coverage with Hub protein datasets, and RCNOIK is slightly higher than LCSD in Precision. The predicted hot regions are closer to the standard hot regions. CONCLUSIONS: This paper optimizes two clustering methods based on PPRA strategy. Compared our methods for hot regions prediction against the well-known approaches, our improved methods have the higher reliability and are effective for predicting hot regions on Hub protein interaction interfaces. BioMed Central 2021-05-03 /pmc/articles/PMC8094484/ /pubmed/33941163 http://dx.doi.org/10.1186/s12911-020-01350-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Lin, Xiaoli
Zhang, Xiaolong
Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
title Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
title_full Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
title_fullStr Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
title_full_unstemmed Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
title_short Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
title_sort identification of hot regions in hub protein–protein interactions by clustering and ppra optimization
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8094484/
https://www.ncbi.nlm.nih.gov/pubmed/33941163
http://dx.doi.org/10.1186/s12911-020-01350-4
work_keys_str_mv AT linxiaoli identificationofhotregionsinhubproteinproteininteractionsbyclusteringandppraoptimization
AT zhangxiaolong identificationofhotregionsinhubproteinproteininteractionsbyclusteringandppraoptimization