Cargando…
Machine learning algorithms for dengue risk assessment: a case study for São Luís do Maranhão
This study aims to assess dengue fever risk using Machine Learning techniques, such as logistic regressions, linear discriminant analyses, Naive Bayes, decision tree, and random forest classifiers. This kind of approach to epidemiological problems has been developed to detect risks for diseases occu...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9664747/ http://dx.doi.org/10.1007/s40314-022-02101-z |
_version_ | 1784831164767272960 |
---|---|
author | Rocha, Fernanda Paula Giesbrecht, Mateus |
author_facet | Rocha, Fernanda Paula Giesbrecht, Mateus |
author_sort | Rocha, Fernanda Paula |
collection | PubMed |
description | This study aims to assess dengue fever risk using Machine Learning techniques, such as logistic regressions, linear discriminant analyses, Naive Bayes, decision tree, and random forest classifiers. This kind of approach to epidemiological problems has been developed to detect risks for diseases occurrence and allows to create public policies based on mathematical models to prevent public health problems. In this study, the models were trained with data from the municipality of São Luís do Maranhão, state of Maranhão, Brazil. The majority of related works analyze states, countries, or continental levels, with greater availability of data. To apply the approach to such a small region, some oversampling techniques were used. The number of cases per neighborhood from 2014 to and 2020 and climatic, territorial, and environmental data was used as input variables to estimate the probability of dengue occurrence in the municipality. Due to the unbalanced database, we used the SMOTE, ADASYN, and DBSMOTE oversampling techniques. The DBSMOTE-trained Random Forest classifier achieved the best results with a 75.1% AUC, 75.43% sensitivity and a 60.53% specificity. |
format | Online Article Text |
id | pubmed-9664747 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-96647472022-11-14 Machine learning algorithms for dengue risk assessment: a case study for São Luís do Maranhão Rocha, Fernanda Paula Giesbrecht, Mateus Comp. Appl. Math. Article This study aims to assess dengue fever risk using Machine Learning techniques, such as logistic regressions, linear discriminant analyses, Naive Bayes, decision tree, and random forest classifiers. This kind of approach to epidemiological problems has been developed to detect risks for diseases occurrence and allows to create public policies based on mathematical models to prevent public health problems. In this study, the models were trained with data from the municipality of São Luís do Maranhão, state of Maranhão, Brazil. The majority of related works analyze states, countries, or continental levels, with greater availability of data. To apply the approach to such a small region, some oversampling techniques were used. The number of cases per neighborhood from 2014 to and 2020 and climatic, territorial, and environmental data was used as input variables to estimate the probability of dengue occurrence in the municipality. Due to the unbalanced database, we used the SMOTE, ADASYN, and DBSMOTE oversampling techniques. The DBSMOTE-trained Random Forest classifier achieved the best results with a 75.1% AUC, 75.43% sensitivity and a 60.53% specificity. Springer International Publishing 2022-11-15 2022 /pmc/articles/PMC9664747/ http://dx.doi.org/10.1007/s40314-022-02101-z Text en © The Author(s) under exclusive licence to Sociedade Brasileira de Matemática Aplicada e Computacional 2022, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Rocha, Fernanda Paula Giesbrecht, Mateus Machine learning algorithms for dengue risk assessment: a case study for São Luís do Maranhão |
title | Machine learning algorithms for dengue risk assessment: a case study for São Luís do Maranhão |
title_full | Machine learning algorithms for dengue risk assessment: a case study for São Luís do Maranhão |
title_fullStr | Machine learning algorithms for dengue risk assessment: a case study for São Luís do Maranhão |
title_full_unstemmed | Machine learning algorithms for dengue risk assessment: a case study for São Luís do Maranhão |
title_short | Machine learning algorithms for dengue risk assessment: a case study for São Luís do Maranhão |
title_sort | machine learning algorithms for dengue risk assessment: a case study for são luís do maranhão |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9664747/ http://dx.doi.org/10.1007/s40314-022-02101-z |
work_keys_str_mv | AT rochafernandapaula machinelearningalgorithmsfordengueriskassessmentacasestudyforsaoluisdomaranhao AT giesbrechtmateus machinelearningalgorithmsfordengueriskassessmentacasestudyforsaoluisdomaranhao |