Cargando…
Co-Membership-based Generic Anomalous Communities Detection
Nowadays, detecting anomalous communities in networks is an essential task in research, as it helps discover insights into community-structured networks. Most of the existing methods leverage either information regarding attributes of vertices or the topological structure of communities. In this stu...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9812749/ https://www.ncbi.nlm.nih.gov/pubmed/36624805 http://dx.doi.org/10.1007/s11063-022-11103-1 |
_version_ | 1784863798238117888 |
---|---|
author | Lapid, Shay Kagan, Dima Fire, Michael |
author_facet | Lapid, Shay Kagan, Dima Fire, Michael |
author_sort | Lapid, Shay |
collection | PubMed |
description | Nowadays, detecting anomalous communities in networks is an essential task in research, as it helps discover insights into community-structured networks. Most of the existing methods leverage either information regarding attributes of vertices or the topological structure of communities. In this study, we introduce the Co-Membership-based Generic Anomalous Communities Detection Algorithm (referred as to CMMAC), a novel and generic method that utilizes the information of vertices co-membership in multiple communities. CMMAC is domain-free and almost unaffected by communities’ sizes and densities. Specifically, we train a classifier to predict the probability of each vertex in a community being a member of the community. We then rank the communities by the aggregated membership probabilities of each community’s vertices. The lowest-ranked communities are considered to be anomalous. Furthermore, we present an algorithm for generating a community-structured random network enabling the infusion of anomalous communities to facilitate research in the field. We utilized it to generate two datasets, composed of thousands of labeled anomaly-infused networks, and published them. We experimented extensively on thousands of simulated, and real-world networks, infused with artificial anomalies. CMMAC outperformed other existing methods in a range of settings. Additionally, we demonstrated that CMMAC can identify abnormal communities in real-world unlabeled networks in different domains, such as Reddit and Wikipedia. |
format | Online Article Text |
id | pubmed-9812749 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-98127492023-01-05 Co-Membership-based Generic Anomalous Communities Detection Lapid, Shay Kagan, Dima Fire, Michael Neural Process Lett Article Nowadays, detecting anomalous communities in networks is an essential task in research, as it helps discover insights into community-structured networks. Most of the existing methods leverage either information regarding attributes of vertices or the topological structure of communities. In this study, we introduce the Co-Membership-based Generic Anomalous Communities Detection Algorithm (referred as to CMMAC), a novel and generic method that utilizes the information of vertices co-membership in multiple communities. CMMAC is domain-free and almost unaffected by communities’ sizes and densities. Specifically, we train a classifier to predict the probability of each vertex in a community being a member of the community. We then rank the communities by the aggregated membership probabilities of each community’s vertices. The lowest-ranked communities are considered to be anomalous. Furthermore, we present an algorithm for generating a community-structured random network enabling the infusion of anomalous communities to facilitate research in the field. We utilized it to generate two datasets, composed of thousands of labeled anomaly-infused networks, and published them. We experimented extensively on thousands of simulated, and real-world networks, infused with artificial anomalies. CMMAC outperformed other existing methods in a range of settings. Additionally, we demonstrated that CMMAC can identify abnormal communities in real-world unlabeled networks in different domains, such as Reddit and Wikipedia. Springer US 2023-01-05 /pmc/articles/PMC9812749/ /pubmed/36624805 http://dx.doi.org/10.1007/s11063-022-11103-1 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Lapid, Shay Kagan, Dima Fire, Michael Co-Membership-based Generic Anomalous Communities Detection |
title | Co-Membership-based Generic Anomalous Communities Detection |
title_full | Co-Membership-based Generic Anomalous Communities Detection |
title_fullStr | Co-Membership-based Generic Anomalous Communities Detection |
title_full_unstemmed | Co-Membership-based Generic Anomalous Communities Detection |
title_short | Co-Membership-based Generic Anomalous Communities Detection |
title_sort | co-membership-based generic anomalous communities detection |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9812749/ https://www.ncbi.nlm.nih.gov/pubmed/36624805 http://dx.doi.org/10.1007/s11063-022-11103-1 |
work_keys_str_mv | AT lapidshay comembershipbasedgenericanomalouscommunitiesdetection AT kagandima comembershipbasedgenericanomalouscommunitiesdetection AT firemichael comembershipbasedgenericanomalouscommunitiesdetection |