Cargando…
Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language
Twitter has the potential to be a timely and cost-effective source of data for syndromic surveillance. When speaking of an illness, Twitter users often report a combination of symptoms, rather than a suspected or final diagnosis, using naïve, everyday language. We developed a minimally trained algor...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853203/ https://www.ncbi.nlm.nih.gov/pubmed/24324799 http://dx.doi.org/10.1371/journal.pone.0082489 |
_version_ | 1782478796872482816 |
---|---|
author | Gesualdo, Francesco Stilo, Giovanni Agricola, Eleonora Gonfiantini, Michaela V. Pandolfi, Elisabetta Velardi, Paola Tozzi, Alberto E. |
author_facet | Gesualdo, Francesco Stilo, Giovanni Agricola, Eleonora Gonfiantini, Michaela V. Pandolfi, Elisabetta Velardi, Paola Tozzi, Alberto E. |
author_sort | Gesualdo, Francesco |
collection | PubMed |
description | Twitter has the potential to be a timely and cost-effective source of data for syndromic surveillance. When speaking of an illness, Twitter users often report a combination of symptoms, rather than a suspected or final diagnosis, using naïve, everyday language. We developed a minimally trained algorithm that exploits the abundance of health-related web pages to identify all jargon expressions related to a specific technical term. We then translated an influenza case definition into a Boolean query, each symptom being described by a technical term and all related jargon expressions, as identified by the algorithm. Subsequently, we monitored all tweets that reported a combination of symptoms satisfying the case definition query. In order to geolocalize messages, we defined 3 localization strategies based on codes associated with each tweet. We found a high correlation coefficient between the trend of our influenza-positive tweets and ILI trends identified by US traditional surveillance systems. |
format | Online Article Text |
id | pubmed-3853203 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-38532032013-12-09 Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language Gesualdo, Francesco Stilo, Giovanni Agricola, Eleonora Gonfiantini, Michaela V. Pandolfi, Elisabetta Velardi, Paola Tozzi, Alberto E. PLoS One Research Article Twitter has the potential to be a timely and cost-effective source of data for syndromic surveillance. When speaking of an illness, Twitter users often report a combination of symptoms, rather than a suspected or final diagnosis, using naïve, everyday language. We developed a minimally trained algorithm that exploits the abundance of health-related web pages to identify all jargon expressions related to a specific technical term. We then translated an influenza case definition into a Boolean query, each symptom being described by a technical term and all related jargon expressions, as identified by the algorithm. Subsequently, we monitored all tweets that reported a combination of symptoms satisfying the case definition query. In order to geolocalize messages, we defined 3 localization strategies based on codes associated with each tweet. We found a high correlation coefficient between the trend of our influenza-positive tweets and ILI trends identified by US traditional surveillance systems. Public Library of Science 2013-12-04 /pmc/articles/PMC3853203/ /pubmed/24324799 http://dx.doi.org/10.1371/journal.pone.0082489 Text en © 2013 Gesualdo et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Gesualdo, Francesco Stilo, Giovanni Agricola, Eleonora Gonfiantini, Michaela V. Pandolfi, Elisabetta Velardi, Paola Tozzi, Alberto E. Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language |
title | Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language |
title_full | Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language |
title_fullStr | Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language |
title_full_unstemmed | Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language |
title_short | Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language |
title_sort | influenza-like illness surveillance on twitter through automated learning of naïve language |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853203/ https://www.ncbi.nlm.nih.gov/pubmed/24324799 http://dx.doi.org/10.1371/journal.pone.0082489 |
work_keys_str_mv | AT gesualdofrancesco influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage AT stilogiovanni influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage AT agricolaeleonora influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage AT gonfiantinimichaelav influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage AT pandolfielisabetta influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage AT velardipaola influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage AT tozzialbertoe influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage |