Cargando…

On the effectiveness of a contrastive cascade graph learning framework: The power of synthetic cascade data

Analyzing the dynamics of information diffusion cascades and accurately predicting their behavior holds significant importance in various applications. In this paper, we concentrate specifically on a recently introduced contrastive cascade graph learning framework, for the task of predicting cascade...

Descripción completa

Detalles Bibliográficos
Autores principales: Suzuki, Daiki, Tsugawa, Sho, Tsukamoto, Keiichiro, Igari, Shintaro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10578604/
https://www.ncbi.nlm.nih.gov/pubmed/37844089
http://dx.doi.org/10.1371/journal.pone.0293032
_version_ 1785121552841310208
author Suzuki, Daiki
Tsugawa, Sho
Tsukamoto, Keiichiro
Igari, Shintaro
author_facet Suzuki, Daiki
Tsugawa, Sho
Tsukamoto, Keiichiro
Igari, Shintaro
author_sort Suzuki, Daiki
collection PubMed
description Analyzing the dynamics of information diffusion cascades and accurately predicting their behavior holds significant importance in various applications. In this paper, we concentrate specifically on a recently introduced contrastive cascade graph learning framework, for the task of predicting cascade popularity. This framework follows a pre-training and fine-tuning paradigm to address cascade prediction tasks. In a previous study, the transferability of pre-trained models within the contrastive cascade graph learning framework was examined solely between two social media datasets. However, in our present study, we comprehensively evaluate the transferability of pre-trained models across 13 real datasets and six synthetic datasets. We construct several pre-trained models using real cascades and synthetic cascades generated by the independent cascade model and the Profile model. Then, we fine-tune these pre-trained models on real cascade datasets and evaluate their prediction accuracy based on the mean squared logarithmic error. The main findings derived from our results are as follows. (1) The pre-trained models exhibit transferability across diverse types of real datasets in different domains, encompassing different languages, social media platforms, and diffusion time scales. (2) Synthetic cascade data prove effective for pre-training purposes. The pre-trained models constructed with synthetic cascade data demonstrate comparable effectiveness to those constructed using real data. (3) Synthetic cascade data prove beneficial for fine-tuning the contrastive cascade graph learning models and training other state-of-the-art popularity prediction models. Models trained using a combination of real and synthetic cascades yield significantly lower mean squared logarithmic error compared to those trained solely on real cascades. Our findings affirm the effectiveness of synthetic cascade data in enhancing the accuracy of cascade popularity prediction.
format Online
Article
Text
id pubmed-10578604
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-105786042023-10-17 On the effectiveness of a contrastive cascade graph learning framework: The power of synthetic cascade data Suzuki, Daiki Tsugawa, Sho Tsukamoto, Keiichiro Igari, Shintaro PLoS One Research Article Analyzing the dynamics of information diffusion cascades and accurately predicting their behavior holds significant importance in various applications. In this paper, we concentrate specifically on a recently introduced contrastive cascade graph learning framework, for the task of predicting cascade popularity. This framework follows a pre-training and fine-tuning paradigm to address cascade prediction tasks. In a previous study, the transferability of pre-trained models within the contrastive cascade graph learning framework was examined solely between two social media datasets. However, in our present study, we comprehensively evaluate the transferability of pre-trained models across 13 real datasets and six synthetic datasets. We construct several pre-trained models using real cascades and synthetic cascades generated by the independent cascade model and the Profile model. Then, we fine-tune these pre-trained models on real cascade datasets and evaluate their prediction accuracy based on the mean squared logarithmic error. The main findings derived from our results are as follows. (1) The pre-trained models exhibit transferability across diverse types of real datasets in different domains, encompassing different languages, social media platforms, and diffusion time scales. (2) Synthetic cascade data prove effective for pre-training purposes. The pre-trained models constructed with synthetic cascade data demonstrate comparable effectiveness to those constructed using real data. (3) Synthetic cascade data prove beneficial for fine-tuning the contrastive cascade graph learning models and training other state-of-the-art popularity prediction models. Models trained using a combination of real and synthetic cascades yield significantly lower mean squared logarithmic error compared to those trained solely on real cascades. Our findings affirm the effectiveness of synthetic cascade data in enhancing the accuracy of cascade popularity prediction. Public Library of Science 2023-10-16 /pmc/articles/PMC10578604/ /pubmed/37844089 http://dx.doi.org/10.1371/journal.pone.0293032 Text en © 2023 Suzuki et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Suzuki, Daiki
Tsugawa, Sho
Tsukamoto, Keiichiro
Igari, Shintaro
On the effectiveness of a contrastive cascade graph learning framework: The power of synthetic cascade data
title On the effectiveness of a contrastive cascade graph learning framework: The power of synthetic cascade data
title_full On the effectiveness of a contrastive cascade graph learning framework: The power of synthetic cascade data
title_fullStr On the effectiveness of a contrastive cascade graph learning framework: The power of synthetic cascade data
title_full_unstemmed On the effectiveness of a contrastive cascade graph learning framework: The power of synthetic cascade data
title_short On the effectiveness of a contrastive cascade graph learning framework: The power of synthetic cascade data
title_sort on the effectiveness of a contrastive cascade graph learning framework: the power of synthetic cascade data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10578604/
https://www.ncbi.nlm.nih.gov/pubmed/37844089
http://dx.doi.org/10.1371/journal.pone.0293032
work_keys_str_mv AT suzukidaiki ontheeffectivenessofacontrastivecascadegraphlearningframeworkthepowerofsyntheticcascadedata
AT tsugawasho ontheeffectivenessofacontrastivecascadegraphlearningframeworkthepowerofsyntheticcascadedata
AT tsukamotokeiichiro ontheeffectivenessofacontrastivecascadegraphlearningframeworkthepowerofsyntheticcascadedata
AT igarishintaro ontheeffectivenessofacontrastivecascadegraphlearningframeworkthepowerofsyntheticcascadedata