Cargando…

Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization

BACKGROUND: Protein–protein interactions (PPIs) are the core of protein function, which provide an effective means to understand the function at cell level. Identification of PPIs is the crucial foundation of predicting drug-target interactions. Although traditional biological experiments of identif...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lin, Xiaoli, Zhang, Xiaolong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2021
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8094484/ https://www.ncbi.nlm.nih.gov/pubmed/33941163 http://dx.doi.org/10.1186/s12911-020-01350-4

_version_	1783687980568805376
author	Lin, Xiaoli Zhang, Xiaolong
author_facet	Lin, Xiaoli Zhang, Xiaolong
author_sort	Lin, Xiaoli
collection	PubMed
description	BACKGROUND: Protein–protein interactions (PPIs) are the core of protein function, which provide an effective means to understand the function at cell level. Identification of PPIs is the crucial foundation of predicting drug-target interactions. Although traditional biological experiments of identifying PPIs are becoming available, these experiments remain to be extremely time-consuming and expensive. Therefore, various computational models have been introduced to identify PPIs. In protein-protein interaction network (PPIN), Hub protein, as a highly connected node, can coordinate PPIs and play biological functions. Detecting hot regions on Hub protein interaction interfaces is an issue worthy of discussing. METHODS: Two clustering methods, LCSD and RCNOIK are used to detect the hot regions on Hub protein interaction interfaces in this paper. In order to improve the efficiency of K-means clustering algorithm, the best k value is selected by calculating the distance square sum and the average silhouette coefficients. Then, the optimization of residue coordination number strategy is used to calculate the average coordination number. In addition, the pair potentials and relative ASA (PPRA) strategy is also used to optimize the predicted results. RESULTS: DataHub dataset and PartyHub dataset were used to train two clustering models respectively. Experiments show that LCSD and RCNOIK have the same coverage with Hub protein datasets, and RCNOIK is slightly higher than LCSD in Precision. The predicted hot regions are closer to the standard hot regions. CONCLUSIONS: This paper optimizes two clustering methods based on PPRA strategy. Compared our methods for hot regions prediction against the well-known approaches, our improved methods have the higher reliability and are effective for predicting hot regions on Hub protein interaction interfaces.
format	Online Article Text
id	pubmed-8094484
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-80944842021-05-04 Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization Lin, Xiaoli Zhang, Xiaolong BMC Med Inform Decis Mak Research BACKGROUND: Protein–protein interactions (PPIs) are the core of protein function, which provide an effective means to understand the function at cell level. Identification of PPIs is the crucial foundation of predicting drug-target interactions. Although traditional biological experiments of identifying PPIs are becoming available, these experiments remain to be extremely time-consuming and expensive. Therefore, various computational models have been introduced to identify PPIs. In protein-protein interaction network (PPIN), Hub protein, as a highly connected node, can coordinate PPIs and play biological functions. Detecting hot regions on Hub protein interaction interfaces is an issue worthy of discussing. METHODS: Two clustering methods, LCSD and RCNOIK are used to detect the hot regions on Hub protein interaction interfaces in this paper. In order to improve the efficiency of K-means clustering algorithm, the best k value is selected by calculating the distance square sum and the average silhouette coefficients. Then, the optimization of residue coordination number strategy is used to calculate the average coordination number. In addition, the pair potentials and relative ASA (PPRA) strategy is also used to optimize the predicted results. RESULTS: DataHub dataset and PartyHub dataset were used to train two clustering models respectively. Experiments show that LCSD and RCNOIK have the same coverage with Hub protein datasets, and RCNOIK is slightly higher than LCSD in Precision. The predicted hot regions are closer to the standard hot regions. CONCLUSIONS: This paper optimizes two clustering methods based on PPRA strategy. Compared our methods for hot regions prediction against the well-known approaches, our improved methods have the higher reliability and are effective for predicting hot regions on Hub protein interaction interfaces. BioMed Central 2021-05-03 /pmc/articles/PMC8094484/ /pubmed/33941163 http://dx.doi.org/10.1186/s12911-020-01350-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Lin, Xiaoli Zhang, Xiaolong Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
title	Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
title_full	Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
title_fullStr	Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
title_full_unstemmed	Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
title_short	Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization
title_sort	identification of hot regions in hub protein–protein interactions by clustering and ppra optimization
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8094484/ https://www.ncbi.nlm.nih.gov/pubmed/33941163 http://dx.doi.org/10.1186/s12911-020-01350-4
work_keys_str_mv	AT linxiaoli identificationofhotregionsinhubproteinproteininteractionsbyclusteringandppraoptimization AT zhangxiaolong identificationofhotregionsinhubproteinproteininteractionsbyclusteringandppraoptimization

Identification of hot regions in hub protein–protein interactions by clustering and PPRA optimization

Ejemplares similares