Cargando…

Multilingual topic modeling for tracking COVID-19 trends based on Facebook data analysis

Social data has shown important role in tracking, monitoring and risk management of disasters. Indeed, several works focused on the benefits of social data analysis for the healthcare practices and curing domain. Similarly, these data are exploited now for tracking the COVID-19 pandemic but the majo...

Descripción completa

Detalles Bibliográficos
Autores principales: Amara, Amina, Hadj Taieb, Mohamed Ali, Ben Aouicha, Mohamed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7881346/
https://www.ncbi.nlm.nih.gov/pubmed/34764585
http://dx.doi.org/10.1007/s10489-020-02033-3
_version_ 1783650859259789312
author Amara, Amina
Hadj Taieb, Mohamed Ali
Ben Aouicha, Mohamed
author_facet Amara, Amina
Hadj Taieb, Mohamed Ali
Ben Aouicha, Mohamed
author_sort Amara, Amina
collection PubMed
description Social data has shown important role in tracking, monitoring and risk management of disasters. Indeed, several works focused on the benefits of social data analysis for the healthcare practices and curing domain. Similarly, these data are exploited now for tracking the COVID-19 pandemic but the majority of works exploited Twitter as source. In this paper, we choose to exploit Facebook, rarely used, for tracking the evolution of COVID-19 related trends. In fact, a multilingual dataset covering 7 languages (English (EN), Arabic (AR), Spanish (ES), Italian (IT), German (DE), French (FR) and Japanese (JP)) is extracted from Facebook public posts. The proposal is an analytics process including a data gathering step, pre-processing, LDA-based topic modeling and presentation module using graph structure. Data analysing covers the duration spanned from January 1st, 2020 to May 15, 2020 divided on three periods in cumulative way: first period January-February, second period March-April and the last one to 15 May. The results showed that the extracted topics correspond to the chronological development of what has been circulated around the pandemic and the measures that have been taken according to the various languages under discussion representing several countries.
format Online
Article
Text
id pubmed-7881346
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-78813462021-02-16 Multilingual topic modeling for tracking COVID-19 trends based on Facebook data analysis Amara, Amina Hadj Taieb, Mohamed Ali Ben Aouicha, Mohamed Appl Intell (Dordr) Article Social data has shown important role in tracking, monitoring and risk management of disasters. Indeed, several works focused on the benefits of social data analysis for the healthcare practices and curing domain. Similarly, these data are exploited now for tracking the COVID-19 pandemic but the majority of works exploited Twitter as source. In this paper, we choose to exploit Facebook, rarely used, for tracking the evolution of COVID-19 related trends. In fact, a multilingual dataset covering 7 languages (English (EN), Arabic (AR), Spanish (ES), Italian (IT), German (DE), French (FR) and Japanese (JP)) is extracted from Facebook public posts. The proposal is an analytics process including a data gathering step, pre-processing, LDA-based topic modeling and presentation module using graph structure. Data analysing covers the duration spanned from January 1st, 2020 to May 15, 2020 divided on three periods in cumulative way: first period January-February, second period March-April and the last one to 15 May. The results showed that the extracted topics correspond to the chronological development of what has been circulated around the pandemic and the measures that have been taken according to the various languages under discussion representing several countries. Springer US 2021-02-13 2021 /pmc/articles/PMC7881346/ /pubmed/34764585 http://dx.doi.org/10.1007/s10489-020-02033-3 Text en © Springer Science+Business Media, LLC, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Amara, Amina
Hadj Taieb, Mohamed Ali
Ben Aouicha, Mohamed
Multilingual topic modeling for tracking COVID-19 trends based on Facebook data analysis
title Multilingual topic modeling for tracking COVID-19 trends based on Facebook data analysis
title_full Multilingual topic modeling for tracking COVID-19 trends based on Facebook data analysis
title_fullStr Multilingual topic modeling for tracking COVID-19 trends based on Facebook data analysis
title_full_unstemmed Multilingual topic modeling for tracking COVID-19 trends based on Facebook data analysis
title_short Multilingual topic modeling for tracking COVID-19 trends based on Facebook data analysis
title_sort multilingual topic modeling for tracking covid-19 trends based on facebook data analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7881346/
https://www.ncbi.nlm.nih.gov/pubmed/34764585
http://dx.doi.org/10.1007/s10489-020-02033-3
work_keys_str_mv AT amaraamina multilingualtopicmodelingfortrackingcovid19trendsbasedonfacebookdataanalysis
AT hadjtaiebmohamedali multilingualtopicmodelingfortrackingcovid19trendsbasedonfacebookdataanalysis
AT benaouichamohamed multilingualtopicmodelingfortrackingcovid19trendsbasedonfacebookdataanalysis