Cargando…

Twitter: A Good Place to Detect Health Conditions

With the proliferation of social networks and blogs, the Internet is increasingly being used to disseminate personal health information rather than just as a source of information. In this paper we exploit the wealth of user-generated data, available through the micro-blogging service Twitter, to es...

Descripción completa

Detalles Bibliográficos
Autores principales: Prieto, Víctor M., Matos, Sérgio, Álvarez, Manuel, Cacheda, Fidel, Oliveira, José Luís
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3906034/
https://www.ncbi.nlm.nih.gov/pubmed/24489699
http://dx.doi.org/10.1371/journal.pone.0086191
_version_ 1782301424613326848
author Prieto, Víctor M.
Matos, Sérgio
Álvarez, Manuel
Cacheda, Fidel
Oliveira, José Luís
author_facet Prieto, Víctor M.
Matos, Sérgio
Álvarez, Manuel
Cacheda, Fidel
Oliveira, José Luís
author_sort Prieto, Víctor M.
collection PubMed
description With the proliferation of social networks and blogs, the Internet is increasingly being used to disseminate personal health information rather than just as a source of information. In this paper we exploit the wealth of user-generated data, available through the micro-blogging service Twitter, to estimate and track the incidence of health conditions in society. The method is based on two stages: we start by extracting possibly relevant tweets using a set of specially crafted regular expressions, and then classify these initial messages using machine learning methods. Furthermore, we selected relevant features to improve the results and the execution times. To test the method, we considered four health states or conditions, namely flu, depression, pregnancy and eating disorders, and two locations, Portugal and Spain. We present the results obtained and demonstrate that the detection results and the performance of the method are improved after feature selection. The results are promising, with areas under the receiver operating characteristic curve between 0.7 and 0.9, and f-measure values around 0.8 and 0.9. This fact indicates that such approach provides a feasible solution for measuring and tracking the evolution of health states within the society.
format Online
Article
Text
id pubmed-3906034
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39060342014-01-31 Twitter: A Good Place to Detect Health Conditions Prieto, Víctor M. Matos, Sérgio Álvarez, Manuel Cacheda, Fidel Oliveira, José Luís PLoS One Research Article With the proliferation of social networks and blogs, the Internet is increasingly being used to disseminate personal health information rather than just as a source of information. In this paper we exploit the wealth of user-generated data, available through the micro-blogging service Twitter, to estimate and track the incidence of health conditions in society. The method is based on two stages: we start by extracting possibly relevant tweets using a set of specially crafted regular expressions, and then classify these initial messages using machine learning methods. Furthermore, we selected relevant features to improve the results and the execution times. To test the method, we considered four health states or conditions, namely flu, depression, pregnancy and eating disorders, and two locations, Portugal and Spain. We present the results obtained and demonstrate that the detection results and the performance of the method are improved after feature selection. The results are promising, with areas under the receiver operating characteristic curve between 0.7 and 0.9, and f-measure values around 0.8 and 0.9. This fact indicates that such approach provides a feasible solution for measuring and tracking the evolution of health states within the society. Public Library of Science 2014-01-29 /pmc/articles/PMC3906034/ /pubmed/24489699 http://dx.doi.org/10.1371/journal.pone.0086191 Text en © 2014 Prieto et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Prieto, Víctor M.
Matos, Sérgio
Álvarez, Manuel
Cacheda, Fidel
Oliveira, José Luís
Twitter: A Good Place to Detect Health Conditions
title Twitter: A Good Place to Detect Health Conditions
title_full Twitter: A Good Place to Detect Health Conditions
title_fullStr Twitter: A Good Place to Detect Health Conditions
title_full_unstemmed Twitter: A Good Place to Detect Health Conditions
title_short Twitter: A Good Place to Detect Health Conditions
title_sort twitter: a good place to detect health conditions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3906034/
https://www.ncbi.nlm.nih.gov/pubmed/24489699
http://dx.doi.org/10.1371/journal.pone.0086191
work_keys_str_mv AT prietovictorm twitteragoodplacetodetecthealthconditions
AT matossergio twitteragoodplacetodetecthealthconditions
AT alvarezmanuel twitteragoodplacetodetecthealthconditions
AT cachedafidel twitteragoodplacetodetecthealthconditions
AT oliveirajoseluis twitteragoodplacetodetecthealthconditions