Cargando…
Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe
BACKGROUND: In recent years new forms of syndromic surveillance that use data from the Internet have been proposed. These have been developed to assist the early prediction of epidemics in various cases and diseases. It has been found that these systems are accurate in monitoring and predicting outb...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7819209/ https://www.ncbi.nlm.nih.gov/pubmed/33472589 http://dx.doi.org/10.1186/s12889-020-10106-8 |
_version_ | 1783638967842766848 |
---|---|
author | Samaras, Loukas Sicilia, Miguel-Angel García-Barriocanal, Elena |
author_facet | Samaras, Loukas Sicilia, Miguel-Angel García-Barriocanal, Elena |
author_sort | Samaras, Loukas |
collection | PubMed |
description | BACKGROUND: In recent years new forms of syndromic surveillance that use data from the Internet have been proposed. These have been developed to assist the early prediction of epidemics in various cases and diseases. It has been found that these systems are accurate in monitoring and predicting outbreaks before these are observed in population and, therefore, they can be used as a complement to other methods. In this research, our aim is to examine a highly infectious disease, measles, as there is no extensive literature on forecasting measles using Internet data, METHODS: This research has been conducted with official data on measles for 5 years (2013–2018) from the competent authority of the European Union (European Center of Disease and Prevention - ECDC) and data obtained from Google Trends by using scripts coded in Python. We compared regression models forecasting the development of measles in the five countries. RESULTS: Results show that measles can be estimated and predicted through Google Trends in terms of time, volume and the overall spread. The combined results reveal a strong relationship of measles cases with the predicted cases (correlation coefficient R= 0.779 in two-tailed significance p< 0.01). The mean standard error was relatively low 45.2 (12.19%) for the combined results. However, major differences and deviations were observed for countries with a relatively low impact of measles, such as the United Kingdom and Spain. For these countries, alternative models were tested in an attempt to improve the results. CONCLUSIONS: The estimation of measles cases from Google Trends produces acceptable results and can help predict outbreaks in a robust and sound manner, at least 2 months in advance. Python scripts can be used individually or within the framework of an integrated Internet surveillance system for tracking epidemics as the one addressed here. |
format | Online Article Text |
id | pubmed-7819209 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-78192092021-01-22 Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe Samaras, Loukas Sicilia, Miguel-Angel García-Barriocanal, Elena BMC Public Health Research Article BACKGROUND: In recent years new forms of syndromic surveillance that use data from the Internet have been proposed. These have been developed to assist the early prediction of epidemics in various cases and diseases. It has been found that these systems are accurate in monitoring and predicting outbreaks before these are observed in population and, therefore, they can be used as a complement to other methods. In this research, our aim is to examine a highly infectious disease, measles, as there is no extensive literature on forecasting measles using Internet data, METHODS: This research has been conducted with official data on measles for 5 years (2013–2018) from the competent authority of the European Union (European Center of Disease and Prevention - ECDC) and data obtained from Google Trends by using scripts coded in Python. We compared regression models forecasting the development of measles in the five countries. RESULTS: Results show that measles can be estimated and predicted through Google Trends in terms of time, volume and the overall spread. The combined results reveal a strong relationship of measles cases with the predicted cases (correlation coefficient R= 0.779 in two-tailed significance p< 0.01). The mean standard error was relatively low 45.2 (12.19%) for the combined results. However, major differences and deviations were observed for countries with a relatively low impact of measles, such as the United Kingdom and Spain. For these countries, alternative models were tested in an attempt to improve the results. CONCLUSIONS: The estimation of measles cases from Google Trends produces acceptable results and can help predict outbreaks in a robust and sound manner, at least 2 months in advance. Python scripts can be used individually or within the framework of an integrated Internet surveillance system for tracking epidemics as the one addressed here. BioMed Central 2021-01-21 /pmc/articles/PMC7819209/ /pubmed/33472589 http://dx.doi.org/10.1186/s12889-020-10106-8 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Samaras, Loukas Sicilia, Miguel-Angel García-Barriocanal, Elena Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe |
title | Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe |
title_full | Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe |
title_fullStr | Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe |
title_full_unstemmed | Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe |
title_short | Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe |
title_sort | predicting epidemics using search engine data: a comparative study on measles in the largest countries of europe |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7819209/ https://www.ncbi.nlm.nih.gov/pubmed/33472589 http://dx.doi.org/10.1186/s12889-020-10106-8 |
work_keys_str_mv | AT samarasloukas predictingepidemicsusingsearchenginedataacomparativestudyonmeaslesinthelargestcountriesofeurope AT siciliamiguelangel predictingepidemicsusingsearchenginedataacomparativestudyonmeaslesinthelargestcountriesofeurope AT garciabarriocanalelena predictingepidemicsusingsearchenginedataacomparativestudyonmeaslesinthelargestcountriesofeurope |