Cargando…

Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach

BACKGROUND: Controlling the COVID-19 outbreak in Brazil is a challenge due to the population’s size and urban density, inefficient maintenance of social distancing and testing strategies, and limited availability of testing resources. OBJECTIVE: The purpose of this study is to effectively prioritize...

Descripción completa

Detalles Bibliográficos
Autores principales:	Viana dos Santos Santana, Íris, CM da Silveira, Andressa, Sobrinho, Álvaro, Chaves e Silva, Lenardo, Dias da Silva, Leandro, Santos, Danilo F S, Gurjão, Edmar C, Perkusich, Angelo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2021
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8034680/ https://www.ncbi.nlm.nih.gov/pubmed/33750734 http://dx.doi.org/10.2196/27293

_version_	1783676578542125056
author	Viana dos Santos Santana, Íris CM da Silveira, Andressa Sobrinho, Álvaro Chaves e Silva, Lenardo Dias da Silva, Leandro Santos, Danilo F S Gurjão, Edmar C Perkusich, Angelo
author_facet	Viana dos Santos Santana, Íris CM da Silveira, Andressa Sobrinho, Álvaro Chaves e Silva, Lenardo Dias da Silva, Leandro Santos, Danilo F S Gurjão, Edmar C Perkusich, Angelo
author_sort	Viana dos Santos Santana, Íris
collection	PubMed
description	BACKGROUND: Controlling the COVID-19 outbreak in Brazil is a challenge due to the population’s size and urban density, inefficient maintenance of social distancing and testing strategies, and limited availability of testing resources. OBJECTIVE: The purpose of this study is to effectively prioritize patients who are symptomatic for testing to assist early COVID-19 detection in Brazil, addressing problems related to inefficient testing and control strategies. METHODS: Raw data from 55,676 Brazilians were preprocessed, and the chi-square test was used to confirm the relevance of the following features: gender, health professional, fever, sore throat, dyspnea, olfactory disorders, cough, coryza, taste disorders, and headache. Classification models were implemented relying on preprocessed data sets; supervised learning; and the algorithms multilayer perceptron (MLP), gradient boosting machine (GBM), decision tree (DT), random forest (RF), extreme gradient boosting (XGBoost), k-nearest neighbors (KNN), support vector machine (SVM), and logistic regression (LR). The models’ performances were analyzed using 10-fold cross-validation, classification metrics, and the Friedman and Nemenyi statistical tests. The permutation feature importance method was applied for ranking the features used by the classification models with the highest performances. RESULTS: Gender, fever, and dyspnea were among the highest-ranked features used by the classification models. The comparative analysis presents MLP, GBM, DT, RF, XGBoost, and SVM as the highest performance models with similar results. KNN and LR were outperformed by the other algorithms. Applying the easy interpretability as an additional comparison criterion, the DT was considered the most suitable model. CONCLUSIONS: The DT classification model can effectively (with a mean accuracy≥89.12%) assist COVID-19 test prioritization in Brazil. The model can be applied to recommend the prioritizing of a patient who is symptomatic for COVID-19 testing.
format	Online Article Text
id	pubmed-8034680
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-80346802021-04-14 Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach Viana dos Santos Santana, Íris CM da Silveira, Andressa Sobrinho, Álvaro Chaves e Silva, Lenardo Dias da Silva, Leandro Santos, Danilo F S Gurjão, Edmar C Perkusich, Angelo J Med Internet Res Original Paper BACKGROUND: Controlling the COVID-19 outbreak in Brazil is a challenge due to the population’s size and urban density, inefficient maintenance of social distancing and testing strategies, and limited availability of testing resources. OBJECTIVE: The purpose of this study is to effectively prioritize patients who are symptomatic for testing to assist early COVID-19 detection in Brazil, addressing problems related to inefficient testing and control strategies. METHODS: Raw data from 55,676 Brazilians were preprocessed, and the chi-square test was used to confirm the relevance of the following features: gender, health professional, fever, sore throat, dyspnea, olfactory disorders, cough, coryza, taste disorders, and headache. Classification models were implemented relying on preprocessed data sets; supervised learning; and the algorithms multilayer perceptron (MLP), gradient boosting machine (GBM), decision tree (DT), random forest (RF), extreme gradient boosting (XGBoost), k-nearest neighbors (KNN), support vector machine (SVM), and logistic regression (LR). The models’ performances were analyzed using 10-fold cross-validation, classification metrics, and the Friedman and Nemenyi statistical tests. The permutation feature importance method was applied for ranking the features used by the classification models with the highest performances. RESULTS: Gender, fever, and dyspnea were among the highest-ranked features used by the classification models. The comparative analysis presents MLP, GBM, DT, RF, XGBoost, and SVM as the highest performance models with similar results. KNN and LR were outperformed by the other algorithms. Applying the easy interpretability as an additional comparison criterion, the DT was considered the most suitable model. CONCLUSIONS: The DT classification model can effectively (with a mean accuracy≥89.12%) assist COVID-19 test prioritization in Brazil. The model can be applied to recommend the prioritizing of a patient who is symptomatic for COVID-19 testing. JMIR Publications 2021-04-08 /pmc/articles/PMC8034680/ /pubmed/33750734 http://dx.doi.org/10.2196/27293 Text en ©Íris Viana dos Santos Santana, Andressa CM da Silveira, Álvaro Sobrinho, Lenardo Chaves e Silva, Leandro Dias da Silva, Danilo F S Santos, Edmar C Gurjão, Angelo Perkusich. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 08.04.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Viana dos Santos Santana, Íris CM da Silveira, Andressa Sobrinho, Álvaro Chaves e Silva, Lenardo Dias da Silva, Leandro Santos, Danilo F S Gurjão, Edmar C Perkusich, Angelo Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach
title	Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach
title_full	Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach
title_fullStr	Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach
title_full_unstemmed	Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach
title_short	Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach
title_sort	classification models for covid-19 test prioritization in brazil: machine learning approach
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8034680/ https://www.ncbi.nlm.nih.gov/pubmed/33750734 http://dx.doi.org/10.2196/27293
work_keys_str_mv	AT vianadossantossantanairis classificationmodelsforcovid19testprioritizationinbrazilmachinelearningapproach AT cmdasilveiraandressa classificationmodelsforcovid19testprioritizationinbrazilmachinelearningapproach AT sobrinhoalvaro classificationmodelsforcovid19testprioritizationinbrazilmachinelearningapproach AT chavesesilvalenardo classificationmodelsforcovid19testprioritizationinbrazilmachinelearningapproach AT diasdasilvaleandro classificationmodelsforcovid19testprioritizationinbrazilmachinelearningapproach AT santosdanilofs classificationmodelsforcovid19testprioritizationinbrazilmachinelearningapproach AT gurjaoedmarc classificationmodelsforcovid19testprioritizationinbrazilmachinelearningapproach AT perkusichangelo classificationmodelsforcovid19testprioritizationinbrazilmachinelearningapproach

Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach

Ejemplares similares