Cargando…

Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review

BACKGROUND: Traditionally, dengue surveillance is based on case reporting to a central health agency. However, the delay between a case and its notification can limit the system responsiveness. Machine learning methods have been developed to reduce the reporting delays and to predict outbreaks, base...

Descripción completa

Detalles Bibliográficos
Autores principales: Sylvestre, Emmanuelle, Joachim, Clarisse, Cécilia-Joseph, Elsa, Bouzillé, Guillaume, Campillo-Gimenez, Boris, Cuggia, Marc, Cabié, André
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8740963/
https://www.ncbi.nlm.nih.gov/pubmed/34995281
http://dx.doi.org/10.1371/journal.pntd.0010056
_version_ 1784629404406644736
author Sylvestre, Emmanuelle
Joachim, Clarisse
Cécilia-Joseph, Elsa
Bouzillé, Guillaume
Campillo-Gimenez, Boris
Cuggia, Marc
Cabié, André
author_facet Sylvestre, Emmanuelle
Joachim, Clarisse
Cécilia-Joseph, Elsa
Bouzillé, Guillaume
Campillo-Gimenez, Boris
Cuggia, Marc
Cabié, André
author_sort Sylvestre, Emmanuelle
collection PubMed
description BACKGROUND: Traditionally, dengue surveillance is based on case reporting to a central health agency. However, the delay between a case and its notification can limit the system responsiveness. Machine learning methods have been developed to reduce the reporting delays and to predict outbreaks, based on non-traditional and non-clinical data sources. The aim of this systematic review was to identify studies that used real-world data, Big Data and/or machine learning methods to monitor and predict dengue-related outcomes. METHODOLOGY/PRINCIPAL FINDINGS: We performed a search in PubMed, Scopus, Web of Science and grey literature between January 1, 2000 and August 31, 2020. The review (ID: CRD42020172472) focused on data-driven studies. Reviews, randomized control trials and descriptive studies were not included. Among the 119 studies included, 67% were published between 2016 and 2020, and 39% used at least one novel data stream. The aim of the included studies was to predict a dengue-related outcome (55%), assess the validity of data sources for dengue surveillance (23%), or both (22%). Most studies (60%) used a machine learning approach. Studies on dengue prediction compared different prediction models, or identified significant predictors among several covariates in a model. The most significant predictors were rainfall (43%), temperature (41%), and humidity (25%). The two models with the highest performances were Neural Networks and Decision Trees (52%), followed by Support Vector Machine (17%). We cannot rule out a selection bias in our study because of our two main limitations: we did not include preprints and could not obtain the opinion of other international experts. CONCLUSIONS/SIGNIFICANCE: Combining real-world data and Big Data with machine learning methods is a promising approach to improve dengue prediction and monitoring. Future studies should focus on how to better integrate all available data sources and methods to improve the response and dengue management by stakeholders.
format Online
Article
Text
id pubmed-8740963
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-87409632022-01-08 Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review Sylvestre, Emmanuelle Joachim, Clarisse Cécilia-Joseph, Elsa Bouzillé, Guillaume Campillo-Gimenez, Boris Cuggia, Marc Cabié, André PLoS Negl Trop Dis Research Article BACKGROUND: Traditionally, dengue surveillance is based on case reporting to a central health agency. However, the delay between a case and its notification can limit the system responsiveness. Machine learning methods have been developed to reduce the reporting delays and to predict outbreaks, based on non-traditional and non-clinical data sources. The aim of this systematic review was to identify studies that used real-world data, Big Data and/or machine learning methods to monitor and predict dengue-related outcomes. METHODOLOGY/PRINCIPAL FINDINGS: We performed a search in PubMed, Scopus, Web of Science and grey literature between January 1, 2000 and August 31, 2020. The review (ID: CRD42020172472) focused on data-driven studies. Reviews, randomized control trials and descriptive studies were not included. Among the 119 studies included, 67% were published between 2016 and 2020, and 39% used at least one novel data stream. The aim of the included studies was to predict a dengue-related outcome (55%), assess the validity of data sources for dengue surveillance (23%), or both (22%). Most studies (60%) used a machine learning approach. Studies on dengue prediction compared different prediction models, or identified significant predictors among several covariates in a model. The most significant predictors were rainfall (43%), temperature (41%), and humidity (25%). The two models with the highest performances were Neural Networks and Decision Trees (52%), followed by Support Vector Machine (17%). We cannot rule out a selection bias in our study because of our two main limitations: we did not include preprints and could not obtain the opinion of other international experts. CONCLUSIONS/SIGNIFICANCE: Combining real-world data and Big Data with machine learning methods is a promising approach to improve dengue prediction and monitoring. Future studies should focus on how to better integrate all available data sources and methods to improve the response and dengue management by stakeholders. Public Library of Science 2022-01-07 /pmc/articles/PMC8740963/ /pubmed/34995281 http://dx.doi.org/10.1371/journal.pntd.0010056 Text en © 2022 Sylvestre et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sylvestre, Emmanuelle
Joachim, Clarisse
Cécilia-Joseph, Elsa
Bouzillé, Guillaume
Campillo-Gimenez, Boris
Cuggia, Marc
Cabié, André
Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review
title Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review
title_full Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review
title_fullStr Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review
title_full_unstemmed Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review
title_short Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review
title_sort data-driven methods for dengue prediction and surveillance using real-world and big data: a systematic review
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8740963/
https://www.ncbi.nlm.nih.gov/pubmed/34995281
http://dx.doi.org/10.1371/journal.pntd.0010056
work_keys_str_mv AT sylvestreemmanuelle datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview
AT joachimclarisse datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview
AT ceciliajosephelsa datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview
AT bouzilleguillaume datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview
AT campillogimenezboris datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview
AT cuggiamarc datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview
AT cabieandre datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview