Cargando…
Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review
BACKGROUND: Traditionally, dengue surveillance is based on case reporting to a central health agency. However, the delay between a case and its notification can limit the system responsiveness. Machine learning methods have been developed to reduce the reporting delays and to predict outbreaks, base...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8740963/ https://www.ncbi.nlm.nih.gov/pubmed/34995281 http://dx.doi.org/10.1371/journal.pntd.0010056 |
_version_ | 1784629404406644736 |
---|---|
author | Sylvestre, Emmanuelle Joachim, Clarisse Cécilia-Joseph, Elsa Bouzillé, Guillaume Campillo-Gimenez, Boris Cuggia, Marc Cabié, André |
author_facet | Sylvestre, Emmanuelle Joachim, Clarisse Cécilia-Joseph, Elsa Bouzillé, Guillaume Campillo-Gimenez, Boris Cuggia, Marc Cabié, André |
author_sort | Sylvestre, Emmanuelle |
collection | PubMed |
description | BACKGROUND: Traditionally, dengue surveillance is based on case reporting to a central health agency. However, the delay between a case and its notification can limit the system responsiveness. Machine learning methods have been developed to reduce the reporting delays and to predict outbreaks, based on non-traditional and non-clinical data sources. The aim of this systematic review was to identify studies that used real-world data, Big Data and/or machine learning methods to monitor and predict dengue-related outcomes. METHODOLOGY/PRINCIPAL FINDINGS: We performed a search in PubMed, Scopus, Web of Science and grey literature between January 1, 2000 and August 31, 2020. The review (ID: CRD42020172472) focused on data-driven studies. Reviews, randomized control trials and descriptive studies were not included. Among the 119 studies included, 67% were published between 2016 and 2020, and 39% used at least one novel data stream. The aim of the included studies was to predict a dengue-related outcome (55%), assess the validity of data sources for dengue surveillance (23%), or both (22%). Most studies (60%) used a machine learning approach. Studies on dengue prediction compared different prediction models, or identified significant predictors among several covariates in a model. The most significant predictors were rainfall (43%), temperature (41%), and humidity (25%). The two models with the highest performances were Neural Networks and Decision Trees (52%), followed by Support Vector Machine (17%). We cannot rule out a selection bias in our study because of our two main limitations: we did not include preprints and could not obtain the opinion of other international experts. CONCLUSIONS/SIGNIFICANCE: Combining real-world data and Big Data with machine learning methods is a promising approach to improve dengue prediction and monitoring. Future studies should focus on how to better integrate all available data sources and methods to improve the response and dengue management by stakeholders. |
format | Online Article Text |
id | pubmed-8740963 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-87409632022-01-08 Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review Sylvestre, Emmanuelle Joachim, Clarisse Cécilia-Joseph, Elsa Bouzillé, Guillaume Campillo-Gimenez, Boris Cuggia, Marc Cabié, André PLoS Negl Trop Dis Research Article BACKGROUND: Traditionally, dengue surveillance is based on case reporting to a central health agency. However, the delay between a case and its notification can limit the system responsiveness. Machine learning methods have been developed to reduce the reporting delays and to predict outbreaks, based on non-traditional and non-clinical data sources. The aim of this systematic review was to identify studies that used real-world data, Big Data and/or machine learning methods to monitor and predict dengue-related outcomes. METHODOLOGY/PRINCIPAL FINDINGS: We performed a search in PubMed, Scopus, Web of Science and grey literature between January 1, 2000 and August 31, 2020. The review (ID: CRD42020172472) focused on data-driven studies. Reviews, randomized control trials and descriptive studies were not included. Among the 119 studies included, 67% were published between 2016 and 2020, and 39% used at least one novel data stream. The aim of the included studies was to predict a dengue-related outcome (55%), assess the validity of data sources for dengue surveillance (23%), or both (22%). Most studies (60%) used a machine learning approach. Studies on dengue prediction compared different prediction models, or identified significant predictors among several covariates in a model. The most significant predictors were rainfall (43%), temperature (41%), and humidity (25%). The two models with the highest performances were Neural Networks and Decision Trees (52%), followed by Support Vector Machine (17%). We cannot rule out a selection bias in our study because of our two main limitations: we did not include preprints and could not obtain the opinion of other international experts. CONCLUSIONS/SIGNIFICANCE: Combining real-world data and Big Data with machine learning methods is a promising approach to improve dengue prediction and monitoring. Future studies should focus on how to better integrate all available data sources and methods to improve the response and dengue management by stakeholders. Public Library of Science 2022-01-07 /pmc/articles/PMC8740963/ /pubmed/34995281 http://dx.doi.org/10.1371/journal.pntd.0010056 Text en © 2022 Sylvestre et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Sylvestre, Emmanuelle Joachim, Clarisse Cécilia-Joseph, Elsa Bouzillé, Guillaume Campillo-Gimenez, Boris Cuggia, Marc Cabié, André Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review |
title | Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review |
title_full | Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review |
title_fullStr | Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review |
title_full_unstemmed | Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review |
title_short | Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review |
title_sort | data-driven methods for dengue prediction and surveillance using real-world and big data: a systematic review |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8740963/ https://www.ncbi.nlm.nih.gov/pubmed/34995281 http://dx.doi.org/10.1371/journal.pntd.0010056 |
work_keys_str_mv | AT sylvestreemmanuelle datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview AT joachimclarisse datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview AT ceciliajosephelsa datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview AT bouzilleguillaume datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview AT campillogimenezboris datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview AT cuggiamarc datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview AT cabieandre datadrivenmethodsfordenguepredictionandsurveillanceusingrealworldandbigdataasystematicreview |