Cargando…

Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe

BACKGROUND: In recent years new forms of syndromic surveillance that use data from the Internet have been proposed. These have been developed to assist the early prediction of epidemics in various cases and diseases. It has been found that these systems are accurate in monitoring and predicting outb...

Descripción completa

Detalles Bibliográficos
Autores principales: Samaras, Loukas, Sicilia, Miguel-Angel, García-Barriocanal, Elena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7819209/
https://www.ncbi.nlm.nih.gov/pubmed/33472589
http://dx.doi.org/10.1186/s12889-020-10106-8
_version_ 1783638967842766848
author Samaras, Loukas
Sicilia, Miguel-Angel
García-Barriocanal, Elena
author_facet Samaras, Loukas
Sicilia, Miguel-Angel
García-Barriocanal, Elena
author_sort Samaras, Loukas
collection PubMed
description BACKGROUND: In recent years new forms of syndromic surveillance that use data from the Internet have been proposed. These have been developed to assist the early prediction of epidemics in various cases and diseases. It has been found that these systems are accurate in monitoring and predicting outbreaks before these are observed in population and, therefore, they can be used as a complement to other methods. In this research, our aim is to examine a highly infectious disease, measles, as there is no extensive literature on forecasting measles using Internet data, METHODS: This research has been conducted with official data on measles for 5 years (2013–2018) from the competent authority of the European Union (European Center of Disease and Prevention - ECDC) and data obtained from Google Trends by using scripts coded in Python. We compared regression models forecasting the development of measles in the five countries. RESULTS: Results show that measles can be estimated and predicted through Google Trends in terms of time, volume and the overall spread. The combined results reveal a strong relationship of measles cases with the predicted cases (correlation coefficient R= 0.779 in two-tailed significance p< 0.01). The mean standard error was relatively low 45.2 (12.19%) for the combined results. However, major differences and deviations were observed for countries with a relatively low impact of measles, such as the United Kingdom and Spain. For these countries, alternative models were tested in an attempt to improve the results. CONCLUSIONS: The estimation of measles cases from Google Trends produces acceptable results and can help predict outbreaks in a robust and sound manner, at least 2 months in advance. Python scripts can be used individually or within the framework of an integrated Internet surveillance system for tracking epidemics as the one addressed here.
format Online
Article
Text
id pubmed-7819209
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-78192092021-01-22 Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe Samaras, Loukas Sicilia, Miguel-Angel García-Barriocanal, Elena BMC Public Health Research Article BACKGROUND: In recent years new forms of syndromic surveillance that use data from the Internet have been proposed. These have been developed to assist the early prediction of epidemics in various cases and diseases. It has been found that these systems are accurate in monitoring and predicting outbreaks before these are observed in population and, therefore, they can be used as a complement to other methods. In this research, our aim is to examine a highly infectious disease, measles, as there is no extensive literature on forecasting measles using Internet data, METHODS: This research has been conducted with official data on measles for 5 years (2013–2018) from the competent authority of the European Union (European Center of Disease and Prevention - ECDC) and data obtained from Google Trends by using scripts coded in Python. We compared regression models forecasting the development of measles in the five countries. RESULTS: Results show that measles can be estimated and predicted through Google Trends in terms of time, volume and the overall spread. The combined results reveal a strong relationship of measles cases with the predicted cases (correlation coefficient R= 0.779 in two-tailed significance p< 0.01). The mean standard error was relatively low 45.2 (12.19%) for the combined results. However, major differences and deviations were observed for countries with a relatively low impact of measles, such as the United Kingdom and Spain. For these countries, alternative models were tested in an attempt to improve the results. CONCLUSIONS: The estimation of measles cases from Google Trends produces acceptable results and can help predict outbreaks in a robust and sound manner, at least 2 months in advance. Python scripts can be used individually or within the framework of an integrated Internet surveillance system for tracking epidemics as the one addressed here. BioMed Central 2021-01-21 /pmc/articles/PMC7819209/ /pubmed/33472589 http://dx.doi.org/10.1186/s12889-020-10106-8 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Samaras, Loukas
Sicilia, Miguel-Angel
García-Barriocanal, Elena
Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe
title Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe
title_full Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe
title_fullStr Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe
title_full_unstemmed Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe
title_short Predicting epidemics using search engine data: a comparative study on measles in the largest countries of Europe
title_sort predicting epidemics using search engine data: a comparative study on measles in the largest countries of europe
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7819209/
https://www.ncbi.nlm.nih.gov/pubmed/33472589
http://dx.doi.org/10.1186/s12889-020-10106-8
work_keys_str_mv AT samarasloukas predictingepidemicsusingsearchenginedataacomparativestudyonmeaslesinthelargestcountriesofeurope
AT siciliamiguelangel predictingepidemicsusingsearchenginedataacomparativestudyonmeaslesinthelargestcountriesofeurope
AT garciabarriocanalelena predictingepidemicsusingsearchenginedataacomparativestudyonmeaslesinthelargestcountriesofeurope