Cargando…
Twitter: A Good Place to Detect Health Conditions
With the proliferation of social networks and blogs, the Internet is increasingly being used to disseminate personal health information rather than just as a source of information. In this paper we exploit the wealth of user-generated data, available through the micro-blogging service Twitter, to es...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3906034/ https://www.ncbi.nlm.nih.gov/pubmed/24489699 http://dx.doi.org/10.1371/journal.pone.0086191 |
_version_ | 1782301424613326848 |
---|---|
author | Prieto, Víctor M. Matos, Sérgio Álvarez, Manuel Cacheda, Fidel Oliveira, José Luís |
author_facet | Prieto, Víctor M. Matos, Sérgio Álvarez, Manuel Cacheda, Fidel Oliveira, José Luís |
author_sort | Prieto, Víctor M. |
collection | PubMed |
description | With the proliferation of social networks and blogs, the Internet is increasingly being used to disseminate personal health information rather than just as a source of information. In this paper we exploit the wealth of user-generated data, available through the micro-blogging service Twitter, to estimate and track the incidence of health conditions in society. The method is based on two stages: we start by extracting possibly relevant tweets using a set of specially crafted regular expressions, and then classify these initial messages using machine learning methods. Furthermore, we selected relevant features to improve the results and the execution times. To test the method, we considered four health states or conditions, namely flu, depression, pregnancy and eating disorders, and two locations, Portugal and Spain. We present the results obtained and demonstrate that the detection results and the performance of the method are improved after feature selection. The results are promising, with areas under the receiver operating characteristic curve between 0.7 and 0.9, and f-measure values around 0.8 and 0.9. This fact indicates that such approach provides a feasible solution for measuring and tracking the evolution of health states within the society. |
format | Online Article Text |
id | pubmed-3906034 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-39060342014-01-31 Twitter: A Good Place to Detect Health Conditions Prieto, Víctor M. Matos, Sérgio Álvarez, Manuel Cacheda, Fidel Oliveira, José Luís PLoS One Research Article With the proliferation of social networks and blogs, the Internet is increasingly being used to disseminate personal health information rather than just as a source of information. In this paper we exploit the wealth of user-generated data, available through the micro-blogging service Twitter, to estimate and track the incidence of health conditions in society. The method is based on two stages: we start by extracting possibly relevant tweets using a set of specially crafted regular expressions, and then classify these initial messages using machine learning methods. Furthermore, we selected relevant features to improve the results and the execution times. To test the method, we considered four health states or conditions, namely flu, depression, pregnancy and eating disorders, and two locations, Portugal and Spain. We present the results obtained and demonstrate that the detection results and the performance of the method are improved after feature selection. The results are promising, with areas under the receiver operating characteristic curve between 0.7 and 0.9, and f-measure values around 0.8 and 0.9. This fact indicates that such approach provides a feasible solution for measuring and tracking the evolution of health states within the society. Public Library of Science 2014-01-29 /pmc/articles/PMC3906034/ /pubmed/24489699 http://dx.doi.org/10.1371/journal.pone.0086191 Text en © 2014 Prieto et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Prieto, Víctor M. Matos, Sérgio Álvarez, Manuel Cacheda, Fidel Oliveira, José Luís Twitter: A Good Place to Detect Health Conditions |
title | Twitter: A Good Place to Detect Health Conditions |
title_full | Twitter: A Good Place to Detect Health Conditions |
title_fullStr | Twitter: A Good Place to Detect Health Conditions |
title_full_unstemmed | Twitter: A Good Place to Detect Health Conditions |
title_short | Twitter: A Good Place to Detect Health Conditions |
title_sort | twitter: a good place to detect health conditions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3906034/ https://www.ncbi.nlm.nih.gov/pubmed/24489699 http://dx.doi.org/10.1371/journal.pone.0086191 |
work_keys_str_mv | AT prietovictorm twitteragoodplacetodetecthealthconditions AT matossergio twitteragoodplacetodetecthealthconditions AT alvarezmanuel twitteragoodplacetodetecthealthconditions AT cachedafidel twitteragoodplacetodetecthealthconditions AT oliveirajoseluis twitteragoodplacetodetecthealthconditions |