Cargando…
Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
BACKGROUND: Protein–protein interactions (PPIs) are the core of protein function, which provide an effective means to understand the function at cell level. Identification of PPIs is the crucial foundation of predicting drug-target interactions. Although traditional biological experiments of identif...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8094484/ https://www.ncbi.nlm.nih.gov/pubmed/33941163 http://dx.doi.org/10.1186/s12911-020-01350-4 |
_version_ | 1783687980568805376 |
---|---|
author | Lin, Xiaoli Zhang, Xiaolong |
author_facet | Lin, Xiaoli Zhang, Xiaolong |
author_sort | Lin, Xiaoli |
collection | PubMed |
description | BACKGROUND: Protein–protein interactions (PPIs) are the core of protein function, which provide an effective means to understand the function at cell level. Identification of PPIs is the crucial foundation of predicting drug-target interactions. Although traditional biological experiments of identifying PPIs are becoming available, these experiments remain to be extremely time-consuming and expensive. Therefore, various computational models have been introduced to identify PPIs. In protein-protein interaction network (PPIN), Hub protein, as a highly connected node, can coordinate PPIs and play biological functions. Detecting hot regions on Hub protein interaction interfaces is an issue worthy of discussing. METHODS: Two clustering methods, LCSD and RCNOIK are used to detect the hot regions on Hub protein interaction interfaces in this paper. In order to improve the efficiency of K-means clustering algorithm, the best k value is selected by calculating the distance square sum and the average silhouette coefficients. Then, the optimization of residue coordination number strategy is used to calculate the average coordination number. In addition, the pair potentials and relative ASA (PPRA) strategy is also used to optimize the predicted results. RESULTS: DataHub dataset and PartyHub dataset were used to train two clustering models respectively. Experiments show that LCSD and RCNOIK have the same coverage with Hub protein datasets, and RCNOIK is slightly higher than LCSD in Precision. The predicted hot regions are closer to the standard hot regions. CONCLUSIONS: This paper optimizes two clustering methods based on PPRA strategy. Compared our methods for hot regions prediction against the well-known approaches, our improved methods have the higher reliability and are effective for predicting hot regions on Hub protein interaction interfaces. |
format | Online Article Text |
id | pubmed-8094484 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-80944842021-05-04 Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization Lin, Xiaoli Zhang, Xiaolong BMC Med Inform Decis Mak Research BACKGROUND: Protein–protein interactions (PPIs) are the core of protein function, which provide an effective means to understand the function at cell level. Identification of PPIs is the crucial foundation of predicting drug-target interactions. Although traditional biological experiments of identifying PPIs are becoming available, these experiments remain to be extremely time-consuming and expensive. Therefore, various computational models have been introduced to identify PPIs. In protein-protein interaction network (PPIN), Hub protein, as a highly connected node, can coordinate PPIs and play biological functions. Detecting hot regions on Hub protein interaction interfaces is an issue worthy of discussing. METHODS: Two clustering methods, LCSD and RCNOIK are used to detect the hot regions on Hub protein interaction interfaces in this paper. In order to improve the efficiency of K-means clustering algorithm, the best k value is selected by calculating the distance square sum and the average silhouette coefficients. Then, the optimization of residue coordination number strategy is used to calculate the average coordination number. In addition, the pair potentials and relative ASA (PPRA) strategy is also used to optimize the predicted results. RESULTS: DataHub dataset and PartyHub dataset were used to train two clustering models respectively. Experiments show that LCSD and RCNOIK have the same coverage with Hub protein datasets, and RCNOIK is slightly higher than LCSD in Precision. The predicted hot regions are closer to the standard hot regions. CONCLUSIONS: This paper optimizes two clustering methods based on PPRA strategy. Compared our methods for hot regions prediction against the well-known approaches, our improved methods have the higher reliability and are effective for predicting hot regions on Hub protein interaction interfaces. BioMed Central 2021-05-03 /pmc/articles/PMC8094484/ /pubmed/33941163 http://dx.doi.org/10.1186/s12911-020-01350-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Lin, Xiaoli Zhang, Xiaolong Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization |
title | Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization |
title_full | Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization |
title_fullStr | Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization |
title_full_unstemmed | Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization |
title_short | Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization |
title_sort | identification of hot regions in hub protein–protein interactions by clustering and ppra optimization |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8094484/ https://www.ncbi.nlm.nih.gov/pubmed/33941163 http://dx.doi.org/10.1186/s12911-020-01350-4 |
work_keys_str_mv | AT linxiaoli identificationofhotregionsinhubproteinproteininteractionsbyclusteringandppraoptimization AT zhangxiaolong identificationofhotregionsinhubproteinproteininteractionsbyclusteringandppraoptimization |