Cargando…
Social Media Multidimensional Analysis for Intelligent Health Surveillance
Background: Recent work in social network analysis has shown the usefulness of analysing and predicting outcomes from user-generated data in the context of Public Health Surveillance (PHS). Most of the proposals have focused on dealing with static datasets gathered from social networks, which are pr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7177443/ https://www.ncbi.nlm.nih.gov/pubmed/32231152 http://dx.doi.org/10.3390/ijerph17072289 |
_version_ | 1783525221430460416 |
---|---|
author | Aramburu, María José Berlanga, Rafael Lanza, Indira |
author_facet | Aramburu, María José Berlanga, Rafael Lanza, Indira |
author_sort | Aramburu, María José |
collection | PubMed |
description | Background: Recent work in social network analysis has shown the usefulness of analysing and predicting outcomes from user-generated data in the context of Public Health Surveillance (PHS). Most of the proposals have focused on dealing with static datasets gathered from social networks, which are processed and mined off-line. However, little work has been done on providing a general framework to analyse the highly dynamic data of social networks from a multidimensional perspective. In this paper, we claim that such a framework is crucial for including social data in PHS systems. Methods: We propose a dynamic multidimensional approach to deal with social data streams. In this approach, dynamic dimensions are continuously updated by applying unsupervised text mining methods. More specifically, we analyse the semantics and temporal patterns in posts for identifying relevant events, topics and users. We also define quality metrics to detect relevant user profiles. In this way, the incoming data can be further filtered to cope with the goals of PHS systems. Results: We have evaluated our approach over a long-term stream of Twitter. We show how the proposed quality metrics allow us to filter out the users that are out-of-domain as well as those with low quality in their messages. We also explain how specific user profiles can be identified through their descriptions. Finally, we illustrate how the proposed multidimensional model can be used to identify main events and topics, as well as to analyse their audience and impact. Conclusions: The results show that the proposed dynamic multidimensional model is able to identify relevant events and topics and analyse them from different perspectives, which is especially useful for PHS systems. |
format | Online Article Text |
id | pubmed-7177443 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-71774432020-04-28 Social Media Multidimensional Analysis for Intelligent Health Surveillance Aramburu, María José Berlanga, Rafael Lanza, Indira Int J Environ Res Public Health Article Background: Recent work in social network analysis has shown the usefulness of analysing and predicting outcomes from user-generated data in the context of Public Health Surveillance (PHS). Most of the proposals have focused on dealing with static datasets gathered from social networks, which are processed and mined off-line. However, little work has been done on providing a general framework to analyse the highly dynamic data of social networks from a multidimensional perspective. In this paper, we claim that such a framework is crucial for including social data in PHS systems. Methods: We propose a dynamic multidimensional approach to deal with social data streams. In this approach, dynamic dimensions are continuously updated by applying unsupervised text mining methods. More specifically, we analyse the semantics and temporal patterns in posts for identifying relevant events, topics and users. We also define quality metrics to detect relevant user profiles. In this way, the incoming data can be further filtered to cope with the goals of PHS systems. Results: We have evaluated our approach over a long-term stream of Twitter. We show how the proposed quality metrics allow us to filter out the users that are out-of-domain as well as those with low quality in their messages. We also explain how specific user profiles can be identified through their descriptions. Finally, we illustrate how the proposed multidimensional model can be used to identify main events and topics, as well as to analyse their audience and impact. Conclusions: The results show that the proposed dynamic multidimensional model is able to identify relevant events and topics and analyse them from different perspectives, which is especially useful for PHS systems. MDPI 2020-03-28 2020-04 /pmc/articles/PMC7177443/ /pubmed/32231152 http://dx.doi.org/10.3390/ijerph17072289 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Aramburu, María José Berlanga, Rafael Lanza, Indira Social Media Multidimensional Analysis for Intelligent Health Surveillance |
title | Social Media Multidimensional Analysis for Intelligent Health Surveillance |
title_full | Social Media Multidimensional Analysis for Intelligent Health Surveillance |
title_fullStr | Social Media Multidimensional Analysis for Intelligent Health Surveillance |
title_full_unstemmed | Social Media Multidimensional Analysis for Intelligent Health Surveillance |
title_short | Social Media Multidimensional Analysis for Intelligent Health Surveillance |
title_sort | social media multidimensional analysis for intelligent health surveillance |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7177443/ https://www.ncbi.nlm.nih.gov/pubmed/32231152 http://dx.doi.org/10.3390/ijerph17072289 |
work_keys_str_mv | AT aramburumariajose socialmediamultidimensionalanalysisforintelligenthealthsurveillance AT berlangarafael socialmediamultidimensionalanalysisforintelligenthealthsurveillance AT lanzaindira socialmediamultidimensionalanalysisforintelligenthealthsurveillance |