Cargando…

Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language

Twitter has the potential to be a timely and cost-effective source of data for syndromic surveillance. When speaking of an illness, Twitter users often report a combination of symptoms, rather than a suspected or final diagnosis, using naïve, everyday language. We developed a minimally trained algor...

Descripción completa

Detalles Bibliográficos
Autores principales: Gesualdo, Francesco, Stilo, Giovanni, Agricola, Eleonora, Gonfiantini, Michaela V., Pandolfi, Elisabetta, Velardi, Paola, Tozzi, Alberto E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853203/
https://www.ncbi.nlm.nih.gov/pubmed/24324799
http://dx.doi.org/10.1371/journal.pone.0082489
_version_ 1782478796872482816
author Gesualdo, Francesco
Stilo, Giovanni
Agricola, Eleonora
Gonfiantini, Michaela V.
Pandolfi, Elisabetta
Velardi, Paola
Tozzi, Alberto E.
author_facet Gesualdo, Francesco
Stilo, Giovanni
Agricola, Eleonora
Gonfiantini, Michaela V.
Pandolfi, Elisabetta
Velardi, Paola
Tozzi, Alberto E.
author_sort Gesualdo, Francesco
collection PubMed
description Twitter has the potential to be a timely and cost-effective source of data for syndromic surveillance. When speaking of an illness, Twitter users often report a combination of symptoms, rather than a suspected or final diagnosis, using naïve, everyday language. We developed a minimally trained algorithm that exploits the abundance of health-related web pages to identify all jargon expressions related to a specific technical term. We then translated an influenza case definition into a Boolean query, each symptom being described by a technical term and all related jargon expressions, as identified by the algorithm. Subsequently, we monitored all tweets that reported a combination of symptoms satisfying the case definition query. In order to geolocalize messages, we defined 3 localization strategies based on codes associated with each tweet. We found a high correlation coefficient between the trend of our influenza-positive tweets and ILI trends identified by US traditional surveillance systems.
format Online
Article
Text
id pubmed-3853203
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38532032013-12-09 Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language Gesualdo, Francesco Stilo, Giovanni Agricola, Eleonora Gonfiantini, Michaela V. Pandolfi, Elisabetta Velardi, Paola Tozzi, Alberto E. PLoS One Research Article Twitter has the potential to be a timely and cost-effective source of data for syndromic surveillance. When speaking of an illness, Twitter users often report a combination of symptoms, rather than a suspected or final diagnosis, using naïve, everyday language. We developed a minimally trained algorithm that exploits the abundance of health-related web pages to identify all jargon expressions related to a specific technical term. We then translated an influenza case definition into a Boolean query, each symptom being described by a technical term and all related jargon expressions, as identified by the algorithm. Subsequently, we monitored all tweets that reported a combination of symptoms satisfying the case definition query. In order to geolocalize messages, we defined 3 localization strategies based on codes associated with each tweet. We found a high correlation coefficient between the trend of our influenza-positive tweets and ILI trends identified by US traditional surveillance systems. Public Library of Science 2013-12-04 /pmc/articles/PMC3853203/ /pubmed/24324799 http://dx.doi.org/10.1371/journal.pone.0082489 Text en © 2013 Gesualdo et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Gesualdo, Francesco
Stilo, Giovanni
Agricola, Eleonora
Gonfiantini, Michaela V.
Pandolfi, Elisabetta
Velardi, Paola
Tozzi, Alberto E.
Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language
title Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language
title_full Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language
title_fullStr Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language
title_full_unstemmed Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language
title_short Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language
title_sort influenza-like illness surveillance on twitter through automated learning of naïve language
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853203/
https://www.ncbi.nlm.nih.gov/pubmed/24324799
http://dx.doi.org/10.1371/journal.pone.0082489
work_keys_str_mv AT gesualdofrancesco influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage
AT stilogiovanni influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage
AT agricolaeleonora influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage
AT gonfiantinimichaelav influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage
AT pandolfielisabetta influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage
AT velardipaola influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage
AT tozzialbertoe influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage