Cargando…

Social Media Multidimensional Analysis for Intelligent Health Surveillance

Background: Recent work in social network analysis has shown the usefulness of analysing and predicting outcomes from user-generated data in the context of Public Health Surveillance (PHS). Most of the proposals have focused on dealing with static datasets gathered from social networks, which are pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Aramburu, María José, Berlanga, Rafael, Lanza, Indira
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7177443/
https://www.ncbi.nlm.nih.gov/pubmed/32231152
http://dx.doi.org/10.3390/ijerph17072289
_version_ 1783525221430460416
author Aramburu, María José
Berlanga, Rafael
Lanza, Indira
author_facet Aramburu, María José
Berlanga, Rafael
Lanza, Indira
author_sort Aramburu, María José
collection PubMed
description Background: Recent work in social network analysis has shown the usefulness of analysing and predicting outcomes from user-generated data in the context of Public Health Surveillance (PHS). Most of the proposals have focused on dealing with static datasets gathered from social networks, which are processed and mined off-line. However, little work has been done on providing a general framework to analyse the highly dynamic data of social networks from a multidimensional perspective. In this paper, we claim that such a framework is crucial for including social data in PHS systems. Methods: We propose a dynamic multidimensional approach to deal with social data streams. In this approach, dynamic dimensions are continuously updated by applying unsupervised text mining methods. More specifically, we analyse the semantics and temporal patterns in posts for identifying relevant events, topics and users. We also define quality metrics to detect relevant user profiles. In this way, the incoming data can be further filtered to cope with the goals of PHS systems. Results: We have evaluated our approach over a long-term stream of Twitter. We show how the proposed quality metrics allow us to filter out the users that are out-of-domain as well as those with low quality in their messages. We also explain how specific user profiles can be identified through their descriptions. Finally, we illustrate how the proposed multidimensional model can be used to identify main events and topics, as well as to analyse their audience and impact. Conclusions: The results show that the proposed dynamic multidimensional model is able to identify relevant events and topics and analyse them from different perspectives, which is especially useful for PHS systems.
format Online
Article
Text
id pubmed-7177443
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-71774432020-04-28 Social Media Multidimensional Analysis for Intelligent Health Surveillance Aramburu, María José Berlanga, Rafael Lanza, Indira Int J Environ Res Public Health Article Background: Recent work in social network analysis has shown the usefulness of analysing and predicting outcomes from user-generated data in the context of Public Health Surveillance (PHS). Most of the proposals have focused on dealing with static datasets gathered from social networks, which are processed and mined off-line. However, little work has been done on providing a general framework to analyse the highly dynamic data of social networks from a multidimensional perspective. In this paper, we claim that such a framework is crucial for including social data in PHS systems. Methods: We propose a dynamic multidimensional approach to deal with social data streams. In this approach, dynamic dimensions are continuously updated by applying unsupervised text mining methods. More specifically, we analyse the semantics and temporal patterns in posts for identifying relevant events, topics and users. We also define quality metrics to detect relevant user profiles. In this way, the incoming data can be further filtered to cope with the goals of PHS systems. Results: We have evaluated our approach over a long-term stream of Twitter. We show how the proposed quality metrics allow us to filter out the users that are out-of-domain as well as those with low quality in their messages. We also explain how specific user profiles can be identified through their descriptions. Finally, we illustrate how the proposed multidimensional model can be used to identify main events and topics, as well as to analyse their audience and impact. Conclusions: The results show that the proposed dynamic multidimensional model is able to identify relevant events and topics and analyse them from different perspectives, which is especially useful for PHS systems. MDPI 2020-03-28 2020-04 /pmc/articles/PMC7177443/ /pubmed/32231152 http://dx.doi.org/10.3390/ijerph17072289 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Aramburu, María José
Berlanga, Rafael
Lanza, Indira
Social Media Multidimensional Analysis for Intelligent Health Surveillance
title Social Media Multidimensional Analysis for Intelligent Health Surveillance
title_full Social Media Multidimensional Analysis for Intelligent Health Surveillance
title_fullStr Social Media Multidimensional Analysis for Intelligent Health Surveillance
title_full_unstemmed Social Media Multidimensional Analysis for Intelligent Health Surveillance
title_short Social Media Multidimensional Analysis for Intelligent Health Surveillance
title_sort social media multidimensional analysis for intelligent health surveillance
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7177443/
https://www.ncbi.nlm.nih.gov/pubmed/32231152
http://dx.doi.org/10.3390/ijerph17072289
work_keys_str_mv AT aramburumariajose socialmediamultidimensionalanalysisforintelligenthealthsurveillance
AT berlangarafael socialmediamultidimensionalanalysisforintelligenthealthsurveillance
AT lanzaindira socialmediamultidimensionalanalysisforintelligenthealthsurveillance