Cargando…

Co-Membership-based Generic Anomalous Communities Detection

Nowadays, detecting anomalous communities in networks is an essential task in research, as it helps discover insights into community-structured networks. Most of the existing methods leverage either information regarding attributes of vertices or the topological structure of communities. In this stu...

Descripción completa

Detalles Bibliográficos
Autores principales: Lapid, Shay, Kagan, Dima, Fire, Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9812749/
https://www.ncbi.nlm.nih.gov/pubmed/36624805
http://dx.doi.org/10.1007/s11063-022-11103-1
_version_ 1784863798238117888
author Lapid, Shay
Kagan, Dima
Fire, Michael
author_facet Lapid, Shay
Kagan, Dima
Fire, Michael
author_sort Lapid, Shay
collection PubMed
description Nowadays, detecting anomalous communities in networks is an essential task in research, as it helps discover insights into community-structured networks. Most of the existing methods leverage either information regarding attributes of vertices or the topological structure of communities. In this study, we introduce the Co-Membership-based Generic Anomalous Communities Detection Algorithm (referred as to CMMAC), a novel and generic method that utilizes the information of vertices co-membership in multiple communities. CMMAC is domain-free and almost unaffected by communities’ sizes and densities. Specifically, we train a classifier to predict the probability of each vertex in a community being a member of the community. We then rank the communities by the aggregated membership probabilities of each community’s vertices. The lowest-ranked communities are considered to be anomalous. Furthermore, we present an algorithm for generating a community-structured random network enabling the infusion of anomalous communities to facilitate research in the field. We utilized it to generate two datasets, composed of thousands of labeled anomaly-infused networks, and published them. We experimented extensively on thousands of simulated, and real-world networks, infused with artificial anomalies. CMMAC outperformed other existing methods in a range of settings. Additionally, we demonstrated that CMMAC can identify abnormal communities in real-world unlabeled networks in different domains, such as Reddit and Wikipedia.
format Online
Article
Text
id pubmed-9812749
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-98127492023-01-05 Co-Membership-based Generic Anomalous Communities Detection Lapid, Shay Kagan, Dima Fire, Michael Neural Process Lett Article Nowadays, detecting anomalous communities in networks is an essential task in research, as it helps discover insights into community-structured networks. Most of the existing methods leverage either information regarding attributes of vertices or the topological structure of communities. In this study, we introduce the Co-Membership-based Generic Anomalous Communities Detection Algorithm (referred as to CMMAC), a novel and generic method that utilizes the information of vertices co-membership in multiple communities. CMMAC is domain-free and almost unaffected by communities’ sizes and densities. Specifically, we train a classifier to predict the probability of each vertex in a community being a member of the community. We then rank the communities by the aggregated membership probabilities of each community’s vertices. The lowest-ranked communities are considered to be anomalous. Furthermore, we present an algorithm for generating a community-structured random network enabling the infusion of anomalous communities to facilitate research in the field. We utilized it to generate two datasets, composed of thousands of labeled anomaly-infused networks, and published them. We experimented extensively on thousands of simulated, and real-world networks, infused with artificial anomalies. CMMAC outperformed other existing methods in a range of settings. Additionally, we demonstrated that CMMAC can identify abnormal communities in real-world unlabeled networks in different domains, such as Reddit and Wikipedia. Springer US 2023-01-05 /pmc/articles/PMC9812749/ /pubmed/36624805 http://dx.doi.org/10.1007/s11063-022-11103-1 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Lapid, Shay
Kagan, Dima
Fire, Michael
Co-Membership-based Generic Anomalous Communities Detection
title Co-Membership-based Generic Anomalous Communities Detection
title_full Co-Membership-based Generic Anomalous Communities Detection
title_fullStr Co-Membership-based Generic Anomalous Communities Detection
title_full_unstemmed Co-Membership-based Generic Anomalous Communities Detection
title_short Co-Membership-based Generic Anomalous Communities Detection
title_sort co-membership-based generic anomalous communities detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9812749/
https://www.ncbi.nlm.nih.gov/pubmed/36624805
http://dx.doi.org/10.1007/s11063-022-11103-1
work_keys_str_mv AT lapidshay comembershipbasedgenericanomalouscommunitiesdetection
AT kagandima comembershipbasedgenericanomalouscommunitiesdetection
AT firemichael comembershipbasedgenericanomalouscommunitiesdetection