Cargando…

Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient

With the rapid development of bioinformatics, researchers have applied community detection algorithms to detect functional modules in protein-protein interaction (PPI) networks that can predict the function of unknown proteins at the molecular level and further reveal the regularity of cell activity...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yan, Chen, Qiong, Yang, Lili, Yang, Sen, He, Kai, Xie, Xuping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8261288/
https://www.ncbi.nlm.nih.gov/pubmed/34249104
http://dx.doi.org/10.3389/fgene.2021.689515
_version_ 1783718984021966848
author Wang, Yan
Chen, Qiong
Yang, Lili
Yang, Sen
He, Kai
Xie, Xuping
author_facet Wang, Yan
Chen, Qiong
Yang, Lili
Yang, Sen
He, Kai
Xie, Xuping
author_sort Wang, Yan
collection PubMed
description With the rapid development of bioinformatics, researchers have applied community detection algorithms to detect functional modules in protein-protein interaction (PPI) networks that can predict the function of unknown proteins at the molecular level and further reveal the regularity of cell activity. Clusters in a PPI network may overlap where a protein is involved in multiple functional modules. To identify overlapping structures in protein functional modules, this paper proposes a novel overlapping community detection algorithm based on the neighboring local clustering coefficient (NLC). The contributions of the NLC algorithm are threefold: (i) Combine the edge-based community detection method with local expansion in seed selection and the local clustering coefficient of neighboring nodes to improve the accuracy of seed selection; (ii) A method of measuring the distance between edges is improved to make the result of community division more accurate; (iii) A community optimization strategy for the excessive overlapping nodes makes the overlapping structure more reasonable. The experimental results on standard networks, Lancichinetti-Fortunato-Radicchi (LFR) benchmark networks and PPI networks show that the NLC algorithm can improve the Extended modularity (EQ) value and Normalized Mutual Information (NMI) value of the community division, which verifies that the algorithm can not only detect reasonable communities but also identify overlapping structures in networks.
format Online
Article
Text
id pubmed-8261288
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82612882021-07-08 Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient Wang, Yan Chen, Qiong Yang, Lili Yang, Sen He, Kai Xie, Xuping Front Genet Genetics With the rapid development of bioinformatics, researchers have applied community detection algorithms to detect functional modules in protein-protein interaction (PPI) networks that can predict the function of unknown proteins at the molecular level and further reveal the regularity of cell activity. Clusters in a PPI network may overlap where a protein is involved in multiple functional modules. To identify overlapping structures in protein functional modules, this paper proposes a novel overlapping community detection algorithm based on the neighboring local clustering coefficient (NLC). The contributions of the NLC algorithm are threefold: (i) Combine the edge-based community detection method with local expansion in seed selection and the local clustering coefficient of neighboring nodes to improve the accuracy of seed selection; (ii) A method of measuring the distance between edges is improved to make the result of community division more accurate; (iii) A community optimization strategy for the excessive overlapping nodes makes the overlapping structure more reasonable. The experimental results on standard networks, Lancichinetti-Fortunato-Radicchi (LFR) benchmark networks and PPI networks show that the NLC algorithm can improve the Extended modularity (EQ) value and Normalized Mutual Information (NMI) value of the community division, which verifies that the algorithm can not only detect reasonable communities but also identify overlapping structures in networks. Frontiers Media S.A. 2021-06-23 /pmc/articles/PMC8261288/ /pubmed/34249104 http://dx.doi.org/10.3389/fgene.2021.689515 Text en Copyright © 2021 Wang, Chen, Yang, Yang, He and Xie. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Wang, Yan
Chen, Qiong
Yang, Lili
Yang, Sen
He, Kai
Xie, Xuping
Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient
title Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient
title_full Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient
title_fullStr Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient
title_full_unstemmed Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient
title_short Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient
title_sort overlapping structures detection in protein-protein interaction networks using community detection algorithm based on neighbor clustering coefficient
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8261288/
https://www.ncbi.nlm.nih.gov/pubmed/34249104
http://dx.doi.org/10.3389/fgene.2021.689515
work_keys_str_mv AT wangyan overlappingstructuresdetectioninproteinproteininteractionnetworksusingcommunitydetectionalgorithmbasedonneighborclusteringcoefficient
AT chenqiong overlappingstructuresdetectioninproteinproteininteractionnetworksusingcommunitydetectionalgorithmbasedonneighborclusteringcoefficient
AT yanglili overlappingstructuresdetectioninproteinproteininteractionnetworksusingcommunitydetectionalgorithmbasedonneighborclusteringcoefficient
AT yangsen overlappingstructuresdetectioninproteinproteininteractionnetworksusingcommunitydetectionalgorithmbasedonneighborclusteringcoefficient
AT hekai overlappingstructuresdetectioninproteinproteininteractionnetworksusingcommunitydetectionalgorithmbasedonneighborclusteringcoefficient
AT xiexuping overlappingstructuresdetectioninproteinproteininteractionnetworksusingcommunitydetectionalgorithmbasedonneighborclusteringcoefficient