Cargando…

Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data

Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorith...

Descripción completa

Detalles Bibliográficos
Autores principales: Ong, Song Quan, Isawasan, Pradeep, Ngesom, Ahmad Mohiddin Mohd, Shahar, Hanipah, Lasim, As’malia Md, Nair, Gomesh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10625978/
https://www.ncbi.nlm.nih.gov/pubmed/37926755
http://dx.doi.org/10.1038/s41598-023-46342-2
_version_ 1785131246581448704
author Ong, Song Quan
Isawasan, Pradeep
Ngesom, Ahmad Mohiddin Mohd
Shahar, Hanipah
Lasim, As’malia Md
Nair, Gomesh
author_facet Ong, Song Quan
Isawasan, Pradeep
Ngesom, Ahmad Mohiddin Mohd
Shahar, Hanipah
Lasim, As’malia Md
Nair, Gomesh
author_sort Ong, Song Quan
collection PubMed
description Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorithms that have higher performance. In this study, we use vector indices and meteorological data as predictors to develop the ML models. We trained and validated seven ML algorithms, including an ensemble ML method, and compared their performance using the receiver operating characteristic (ROC) with the area under the curve (AUC), accuracy and F1 score. Our results show that an ensemble ML such as XG Boost, AdaBoost and Random Forest perform better than the logistics regression, Naïve Bayens, decision tree, and support vector machine (SVM), with XGBoost having the highest AUC, accuracy and F1 score. Analysis of the importance of the variables showed that the container index was the least important. By removing this variable, the ML models improved their performance by at least 6% in AUC and F1 score. Our result provides a framework for future studies on the use of predictive models in the development of an early warning system.
format Online
Article
Text
id pubmed-10625978
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106259782023-11-07 Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data Ong, Song Quan Isawasan, Pradeep Ngesom, Ahmad Mohiddin Mohd Shahar, Hanipah Lasim, As’malia Md Nair, Gomesh Sci Rep Article Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorithms that have higher performance. In this study, we use vector indices and meteorological data as predictors to develop the ML models. We trained and validated seven ML algorithms, including an ensemble ML method, and compared their performance using the receiver operating characteristic (ROC) with the area under the curve (AUC), accuracy and F1 score. Our results show that an ensemble ML such as XG Boost, AdaBoost and Random Forest perform better than the logistics regression, Naïve Bayens, decision tree, and support vector machine (SVM), with XGBoost having the highest AUC, accuracy and F1 score. Analysis of the importance of the variables showed that the container index was the least important. By removing this variable, the ML models improved their performance by at least 6% in AUC and F1 score. Our result provides a framework for future studies on the use of predictive models in the development of an early warning system. Nature Publishing Group UK 2023-11-05 /pmc/articles/PMC10625978/ /pubmed/37926755 http://dx.doi.org/10.1038/s41598-023-46342-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Ong, Song Quan
Isawasan, Pradeep
Ngesom, Ahmad Mohiddin Mohd
Shahar, Hanipah
Lasim, As’malia Md
Nair, Gomesh
Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
title Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
title_full Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
title_fullStr Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
title_full_unstemmed Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
title_short Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
title_sort predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10625978/
https://www.ncbi.nlm.nih.gov/pubmed/37926755
http://dx.doi.org/10.1038/s41598-023-46342-2
work_keys_str_mv AT ongsongquan predictingdenguetransmissionratesbycomparingdifferentmachinelearningmodelswithvectorindicesandmeteorologicaldata
AT isawasanpradeep predictingdenguetransmissionratesbycomparingdifferentmachinelearningmodelswithvectorindicesandmeteorologicaldata
AT ngesomahmadmohiddinmohd predictingdenguetransmissionratesbycomparingdifferentmachinelearningmodelswithvectorindicesandmeteorologicaldata
AT shaharhanipah predictingdenguetransmissionratesbycomparingdifferentmachinelearningmodelswithvectorindicesandmeteorologicaldata
AT lasimasmaliamd predictingdenguetransmissionratesbycomparingdifferentmachinelearningmodelswithvectorindicesandmeteorologicaldata
AT nairgomesh predictingdenguetransmissionratesbycomparingdifferentmachinelearningmodelswithvectorindicesandmeteorologicaldata