Cargando…

Local hypergraph clustering using capacity releasing diffusion

Local graph clustering is an important machine learning task that aims to find a well-connected cluster near a set of seed nodes. Recent results have revealed that incorporating higher order information significantly enhances the results of graph clustering techniques. The majority of existing resea...

Descripción completa

Detalles Bibliográficos
Autores principales: Ibrahim, Rania, Gleich, David F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7757905/
https://www.ncbi.nlm.nih.gov/pubmed/33362247
http://dx.doi.org/10.1371/journal.pone.0243485
_version_ 1783626824167718912
author Ibrahim, Rania
Gleich, David F.
author_facet Ibrahim, Rania
Gleich, David F.
author_sort Ibrahim, Rania
collection PubMed
description Local graph clustering is an important machine learning task that aims to find a well-connected cluster near a set of seed nodes. Recent results have revealed that incorporating higher order information significantly enhances the results of graph clustering techniques. The majority of existing research in this area focuses on spectral graph theory-based techniques. However, an alternative perspective on local graph clustering arises from using max-flow and min-cut on the objectives, which offer distinctly different guarantees. For instance, a new method called capacity releasing diffusion (CRD) was recently proposed and shown to preserve local structure around the seeds better than spectral methods. The method was also the first local clustering technique that is not subject to the quadratic Cheeger inequality by assuming a good cluster near the seed nodes. In this paper, we propose a local hypergraph clustering technique called hypergraph CRD (HG-CRD) by extending the CRD process to cluster based on higher order patterns, encoded as hyperedges of a hypergraph. Moreover, we theoretically show that HG-CRD gives results about a quantity called motif conductance, rather than a biased version used in previous experiments. Experimental results on synthetic datasets and real world graphs show that HG-CRD enhances the clustering quality.
format Online
Article
Text
id pubmed-7757905
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-77579052021-01-07 Local hypergraph clustering using capacity releasing diffusion Ibrahim, Rania Gleich, David F. PLoS One Research Article Local graph clustering is an important machine learning task that aims to find a well-connected cluster near a set of seed nodes. Recent results have revealed that incorporating higher order information significantly enhances the results of graph clustering techniques. The majority of existing research in this area focuses on spectral graph theory-based techniques. However, an alternative perspective on local graph clustering arises from using max-flow and min-cut on the objectives, which offer distinctly different guarantees. For instance, a new method called capacity releasing diffusion (CRD) was recently proposed and shown to preserve local structure around the seeds better than spectral methods. The method was also the first local clustering technique that is not subject to the quadratic Cheeger inequality by assuming a good cluster near the seed nodes. In this paper, we propose a local hypergraph clustering technique called hypergraph CRD (HG-CRD) by extending the CRD process to cluster based on higher order patterns, encoded as hyperedges of a hypergraph. Moreover, we theoretically show that HG-CRD gives results about a quantity called motif conductance, rather than a biased version used in previous experiments. Experimental results on synthetic datasets and real world graphs show that HG-CRD enhances the clustering quality. Public Library of Science 2020-12-23 /pmc/articles/PMC7757905/ /pubmed/33362247 http://dx.doi.org/10.1371/journal.pone.0243485 Text en © 2020 Ibrahim, Gleich http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ibrahim, Rania
Gleich, David F.
Local hypergraph clustering using capacity releasing diffusion
title Local hypergraph clustering using capacity releasing diffusion
title_full Local hypergraph clustering using capacity releasing diffusion
title_fullStr Local hypergraph clustering using capacity releasing diffusion
title_full_unstemmed Local hypergraph clustering using capacity releasing diffusion
title_short Local hypergraph clustering using capacity releasing diffusion
title_sort local hypergraph clustering using capacity releasing diffusion
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7757905/
https://www.ncbi.nlm.nih.gov/pubmed/33362247
http://dx.doi.org/10.1371/journal.pone.0243485
work_keys_str_mv AT ibrahimrania localhypergraphclusteringusingcapacityreleasingdiffusion
AT gleichdavidf localhypergraphclusteringusingcapacityreleasingdiffusion