Cargando…

I/O Efficient Early Bursting Cohesive Subgraph Discovery in Massive Temporal Networks

Temporal networks are an effective way to encode temporal information into graph data losslessly. Finding the bursting cohesive subgraph (BCS), which accumulates its cohesiveness at the fastest rate, is an important problem in temporal networks. The BCS has a large number of applications, such as re...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yuan, Dai, Jie, Fan, Xiao-Lin, Zhao, Yu-Hai, Wang, Guo-Ren
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Nature Singapore 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9797886/
https://www.ncbi.nlm.nih.gov/pubmed/36594008
http://dx.doi.org/10.1007/s11390-022-2367-3
_version_ 1784860783120744448
author Li, Yuan
Dai, Jie
Fan, Xiao-Lin
Zhao, Yu-Hai
Wang, Guo-Ren
author_facet Li, Yuan
Dai, Jie
Fan, Xiao-Lin
Zhao, Yu-Hai
Wang, Guo-Ren
author_sort Li, Yuan
collection PubMed
description Temporal networks are an effective way to encode temporal information into graph data losslessly. Finding the bursting cohesive subgraph (BCS), which accumulates its cohesiveness at the fastest rate, is an important problem in temporal networks. The BCS has a large number of applications, such as representing emergency events in social media, traffic congestion in road networks and epidemic outbreak in communities. Nevertheless, existing methods demand the BCS lasting for a time interval, which neglects the timeliness of the BCS. In this paper, we design an early bursting cohesive subgraph (EBCS) model based on the k-core to enable identifying the burstiness as soon as possible. To find the EBCS, we first construct a time weight graph (TWG) to measure the bursting level by integrating the topological and temporal information. Then, we propose a global search algorithm, called GS-EBCS, which can find the exact EBCS by iteratively removing nodes from the TWG. Further, we propose a local search algorithm, named LS-EBCS, to find the EBCS by first expanding from a seed node until obtaining a candidate k-core and then refining the k-core to the result subgraph in an optimal time complexity. Subsequently, considering the situation that the massive temporal networks cannot be completely put into the memory, we first design an I/O method to build the TWG and then develop I/O efficient global search and local search algorithms, namely I/O-GS and I/O-LS respectively, to find the EBCS under the semi-external model. Extensive experiments, conducted on four real temporal networks, demonstrate the efficiency and effectiveness of our proposed algorithms. For example, on the DBLP dataset, I/O-LS and LS-EBCS have comparable running time, while the maximum memory usage of I/O-LS is only 6.5 MB, which is much smaller than that of LS-EBCS taking 308.7 MB. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11390-022-2367-3.
format Online
Article
Text
id pubmed-9797886
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Nature Singapore
record_format MEDLINE/PubMed
spelling pubmed-97978862022-12-29 I/O Efficient Early Bursting Cohesive Subgraph Discovery in Massive Temporal Networks Li, Yuan Dai, Jie Fan, Xiao-Lin Zhao, Yu-Hai Wang, Guo-Ren J Comput Sci Technol Regular Paper Temporal networks are an effective way to encode temporal information into graph data losslessly. Finding the bursting cohesive subgraph (BCS), which accumulates its cohesiveness at the fastest rate, is an important problem in temporal networks. The BCS has a large number of applications, such as representing emergency events in social media, traffic congestion in road networks and epidemic outbreak in communities. Nevertheless, existing methods demand the BCS lasting for a time interval, which neglects the timeliness of the BCS. In this paper, we design an early bursting cohesive subgraph (EBCS) model based on the k-core to enable identifying the burstiness as soon as possible. To find the EBCS, we first construct a time weight graph (TWG) to measure the bursting level by integrating the topological and temporal information. Then, we propose a global search algorithm, called GS-EBCS, which can find the exact EBCS by iteratively removing nodes from the TWG. Further, we propose a local search algorithm, named LS-EBCS, to find the EBCS by first expanding from a seed node until obtaining a candidate k-core and then refining the k-core to the result subgraph in an optimal time complexity. Subsequently, considering the situation that the massive temporal networks cannot be completely put into the memory, we first design an I/O method to build the TWG and then develop I/O efficient global search and local search algorithms, namely I/O-GS and I/O-LS respectively, to find the EBCS under the semi-external model. Extensive experiments, conducted on four real temporal networks, demonstrate the efficiency and effectiveness of our proposed algorithms. For example, on the DBLP dataset, I/O-LS and LS-EBCS have comparable running time, while the maximum memory usage of I/O-LS is only 6.5 MB, which is much smaller than that of LS-EBCS taking 308.7 MB. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11390-022-2367-3. Springer Nature Singapore 2022-11-30 2022 /pmc/articles/PMC9797886/ /pubmed/36594008 http://dx.doi.org/10.1007/s11390-022-2367-3 Text en © Institute of Computing Technology, Chinese Academy of Sciences 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Regular Paper
Li, Yuan
Dai, Jie
Fan, Xiao-Lin
Zhao, Yu-Hai
Wang, Guo-Ren
I/O Efficient Early Bursting Cohesive Subgraph Discovery in Massive Temporal Networks
title I/O Efficient Early Bursting Cohesive Subgraph Discovery in Massive Temporal Networks
title_full I/O Efficient Early Bursting Cohesive Subgraph Discovery in Massive Temporal Networks
title_fullStr I/O Efficient Early Bursting Cohesive Subgraph Discovery in Massive Temporal Networks
title_full_unstemmed I/O Efficient Early Bursting Cohesive Subgraph Discovery in Massive Temporal Networks
title_short I/O Efficient Early Bursting Cohesive Subgraph Discovery in Massive Temporal Networks
title_sort i/o efficient early bursting cohesive subgraph discovery in massive temporal networks
topic Regular Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9797886/
https://www.ncbi.nlm.nih.gov/pubmed/36594008
http://dx.doi.org/10.1007/s11390-022-2367-3
work_keys_str_mv AT liyuan ioefficientearlyburstingcohesivesubgraphdiscoveryinmassivetemporalnetworks
AT daijie ioefficientearlyburstingcohesivesubgraphdiscoveryinmassivetemporalnetworks
AT fanxiaolin ioefficientearlyburstingcohesivesubgraphdiscoveryinmassivetemporalnetworks
AT zhaoyuhai ioefficientearlyburstingcohesivesubgraphdiscoveryinmassivetemporalnetworks
AT wangguoren ioefficientearlyburstingcohesivesubgraphdiscoveryinmassivetemporalnetworks