Cargando…
Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient
With the rapid development of bioinformatics, researchers have applied community detection algorithms to detect functional modules in protein-protein interaction (PPI) networks that can predict the function of unknown proteins at the molecular level and further reveal the regularity of cell activity...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8261288/ https://www.ncbi.nlm.nih.gov/pubmed/34249104 http://dx.doi.org/10.3389/fgene.2021.689515 |
_version_ | 1783718984021966848 |
---|---|
author | Wang, Yan Chen, Qiong Yang, Lili Yang, Sen He, Kai Xie, Xuping |
author_facet | Wang, Yan Chen, Qiong Yang, Lili Yang, Sen He, Kai Xie, Xuping |
author_sort | Wang, Yan |
collection | PubMed |
description | With the rapid development of bioinformatics, researchers have applied community detection algorithms to detect functional modules in protein-protein interaction (PPI) networks that can predict the function of unknown proteins at the molecular level and further reveal the regularity of cell activity. Clusters in a PPI network may overlap where a protein is involved in multiple functional modules. To identify overlapping structures in protein functional modules, this paper proposes a novel overlapping community detection algorithm based on the neighboring local clustering coefficient (NLC). The contributions of the NLC algorithm are threefold: (i) Combine the edge-based community detection method with local expansion in seed selection and the local clustering coefficient of neighboring nodes to improve the accuracy of seed selection; (ii) A method of measuring the distance between edges is improved to make the result of community division more accurate; (iii) A community optimization strategy for the excessive overlapping nodes makes the overlapping structure more reasonable. The experimental results on standard networks, Lancichinetti-Fortunato-Radicchi (LFR) benchmark networks and PPI networks show that the NLC algorithm can improve the Extended modularity (EQ) value and Normalized Mutual Information (NMI) value of the community division, which verifies that the algorithm can not only detect reasonable communities but also identify overlapping structures in networks. |
format | Online Article Text |
id | pubmed-8261288 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-82612882021-07-08 Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient Wang, Yan Chen, Qiong Yang, Lili Yang, Sen He, Kai Xie, Xuping Front Genet Genetics With the rapid development of bioinformatics, researchers have applied community detection algorithms to detect functional modules in protein-protein interaction (PPI) networks that can predict the function of unknown proteins at the molecular level and further reveal the regularity of cell activity. Clusters in a PPI network may overlap where a protein is involved in multiple functional modules. To identify overlapping structures in protein functional modules, this paper proposes a novel overlapping community detection algorithm based on the neighboring local clustering coefficient (NLC). The contributions of the NLC algorithm are threefold: (i) Combine the edge-based community detection method with local expansion in seed selection and the local clustering coefficient of neighboring nodes to improve the accuracy of seed selection; (ii) A method of measuring the distance between edges is improved to make the result of community division more accurate; (iii) A community optimization strategy for the excessive overlapping nodes makes the overlapping structure more reasonable. The experimental results on standard networks, Lancichinetti-Fortunato-Radicchi (LFR) benchmark networks and PPI networks show that the NLC algorithm can improve the Extended modularity (EQ) value and Normalized Mutual Information (NMI) value of the community division, which verifies that the algorithm can not only detect reasonable communities but also identify overlapping structures in networks. Frontiers Media S.A. 2021-06-23 /pmc/articles/PMC8261288/ /pubmed/34249104 http://dx.doi.org/10.3389/fgene.2021.689515 Text en Copyright © 2021 Wang, Chen, Yang, Yang, He and Xie. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Wang, Yan Chen, Qiong Yang, Lili Yang, Sen He, Kai Xie, Xuping Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient |
title | Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient |
title_full | Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient |
title_fullStr | Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient |
title_full_unstemmed | Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient |
title_short | Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient |
title_sort | overlapping structures detection in protein-protein interaction networks using community detection algorithm based on neighbor clustering coefficient |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8261288/ https://www.ncbi.nlm.nih.gov/pubmed/34249104 http://dx.doi.org/10.3389/fgene.2021.689515 |
work_keys_str_mv | AT wangyan overlappingstructuresdetectioninproteinproteininteractionnetworksusingcommunitydetectionalgorithmbasedonneighborclusteringcoefficient AT chenqiong overlappingstructuresdetectioninproteinproteininteractionnetworksusingcommunitydetectionalgorithmbasedonneighborclusteringcoefficient AT yanglili overlappingstructuresdetectioninproteinproteininteractionnetworksusingcommunitydetectionalgorithmbasedonneighborclusteringcoefficient AT yangsen overlappingstructuresdetectioninproteinproteininteractionnetworksusingcommunitydetectionalgorithmbasedonneighborclusteringcoefficient AT hekai overlappingstructuresdetectioninproteinproteininteractionnetworksusingcommunitydetectionalgorithmbasedonneighborclusteringcoefficient AT xiexuping overlappingstructuresdetectioninproteinproteininteractionnetworksusingcommunitydetectionalgorithmbasedonneighborclusteringcoefficient |