Cargando…

Performance Analysis of Conventional Machine Learning Algorithms for Identification of Chronic Kidney Disease in Type 1 Diabetes Mellitus Patients

Chronic kidney disease (CKD) is one of the severe side effects of type 1 diabetes mellitus (T1DM). However, the detection and diagnosis of CKD are often delayed because of its asymptomatic nature. In addition, patients often tend to bypass the traditional urine protein (urinary albumin)-based CKD de...

Descripción completa

Detalles Bibliográficos
Autores principales: Chowdhury, Nakib Hayat, Reaz, Mamun Bin Ibne, Haque, Fahmida, Ahmad, Shamim, Ali, Sawal Hamid Md, A Bakar, Ahmad Ashrif, Bhuiyan, Mohammad Arif Sobhan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8700037/
https://www.ncbi.nlm.nih.gov/pubmed/34943504
http://dx.doi.org/10.3390/diagnostics11122267
_version_ 1784620659348865024
author Chowdhury, Nakib Hayat
Reaz, Mamun Bin Ibne
Haque, Fahmida
Ahmad, Shamim
Ali, Sawal Hamid Md
A Bakar, Ahmad Ashrif
Bhuiyan, Mohammad Arif Sobhan
author_facet Chowdhury, Nakib Hayat
Reaz, Mamun Bin Ibne
Haque, Fahmida
Ahmad, Shamim
Ali, Sawal Hamid Md
A Bakar, Ahmad Ashrif
Bhuiyan, Mohammad Arif Sobhan
author_sort Chowdhury, Nakib Hayat
collection PubMed
description Chronic kidney disease (CKD) is one of the severe side effects of type 1 diabetes mellitus (T1DM). However, the detection and diagnosis of CKD are often delayed because of its asymptomatic nature. In addition, patients often tend to bypass the traditional urine protein (urinary albumin)-based CKD detection test. Even though disease detection using machine learning (ML) is a well-established field of study, it is rarely used to diagnose CKD in T1DM patients. This research aimed to employ and evaluate several ML algorithms to develop models to quickly predict CKD in patients with T1DM using easily available routine checkup data. This study analyzed 16 years of data of 1375 T1DM patients, obtained from the Epidemiology of Diabetes Interventions and Complications (EDIC) clinical trials directed by the National Institute of Diabetes, Digestive, and Kidney Diseases, USA. Three data imputation techniques (RF, KNN, and MICE) and the SMOTETomek resampling technique were used to preprocess the primary dataset. Ten ML algorithms including logistic regression (LR), k-nearest neighbor (KNN), Gaussian naïve Bayes (GNB), support vector machine (SVM), stochastic gradient descent (SGD), decision tree (DT), gradient boosting (GB), random forest (RF), extreme gradient boosting (XGB), and light gradient-boosted machine (LightGBM) were applied to developed prediction models. Each model included 19 demographic, medical history, behavioral, and biochemical features, and every feature’s effect was ranked using three feature ranking techniques (XGB, RF, and Extra Tree). Lastly, each model’s ROC, sensitivity (recall), specificity, accuracy, precision, and F-1 score were estimated to find the best-performing model. The RF classifier model exhibited the best performance with 0.96 (±0.01) accuracy, 0.98 (±0.01) sensitivity, and 0.93 (±0.02) specificity. LightGBM performed second best and was quite close to RF with 0.95 (±0.06) accuracy. In addition to these two models, KNN, SVM, DT, GB, and XGB models also achieved more than 90% accuracy.
format Online
Article
Text
id pubmed-8700037
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87000372021-12-24 Performance Analysis of Conventional Machine Learning Algorithms for Identification of Chronic Kidney Disease in Type 1 Diabetes Mellitus Patients Chowdhury, Nakib Hayat Reaz, Mamun Bin Ibne Haque, Fahmida Ahmad, Shamim Ali, Sawal Hamid Md A Bakar, Ahmad Ashrif Bhuiyan, Mohammad Arif Sobhan Diagnostics (Basel) Article Chronic kidney disease (CKD) is one of the severe side effects of type 1 diabetes mellitus (T1DM). However, the detection and diagnosis of CKD are often delayed because of its asymptomatic nature. In addition, patients often tend to bypass the traditional urine protein (urinary albumin)-based CKD detection test. Even though disease detection using machine learning (ML) is a well-established field of study, it is rarely used to diagnose CKD in T1DM patients. This research aimed to employ and evaluate several ML algorithms to develop models to quickly predict CKD in patients with T1DM using easily available routine checkup data. This study analyzed 16 years of data of 1375 T1DM patients, obtained from the Epidemiology of Diabetes Interventions and Complications (EDIC) clinical trials directed by the National Institute of Diabetes, Digestive, and Kidney Diseases, USA. Three data imputation techniques (RF, KNN, and MICE) and the SMOTETomek resampling technique were used to preprocess the primary dataset. Ten ML algorithms including logistic regression (LR), k-nearest neighbor (KNN), Gaussian naïve Bayes (GNB), support vector machine (SVM), stochastic gradient descent (SGD), decision tree (DT), gradient boosting (GB), random forest (RF), extreme gradient boosting (XGB), and light gradient-boosted machine (LightGBM) were applied to developed prediction models. Each model included 19 demographic, medical history, behavioral, and biochemical features, and every feature’s effect was ranked using three feature ranking techniques (XGB, RF, and Extra Tree). Lastly, each model’s ROC, sensitivity (recall), specificity, accuracy, precision, and F-1 score were estimated to find the best-performing model. The RF classifier model exhibited the best performance with 0.96 (±0.01) accuracy, 0.98 (±0.01) sensitivity, and 0.93 (±0.02) specificity. LightGBM performed second best and was quite close to RF with 0.95 (±0.06) accuracy. In addition to these two models, KNN, SVM, DT, GB, and XGB models also achieved more than 90% accuracy. MDPI 2021-12-03 /pmc/articles/PMC8700037/ /pubmed/34943504 http://dx.doi.org/10.3390/diagnostics11122267 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chowdhury, Nakib Hayat
Reaz, Mamun Bin Ibne
Haque, Fahmida
Ahmad, Shamim
Ali, Sawal Hamid Md
A Bakar, Ahmad Ashrif
Bhuiyan, Mohammad Arif Sobhan
Performance Analysis of Conventional Machine Learning Algorithms for Identification of Chronic Kidney Disease in Type 1 Diabetes Mellitus Patients
title Performance Analysis of Conventional Machine Learning Algorithms for Identification of Chronic Kidney Disease in Type 1 Diabetes Mellitus Patients
title_full Performance Analysis of Conventional Machine Learning Algorithms for Identification of Chronic Kidney Disease in Type 1 Diabetes Mellitus Patients
title_fullStr Performance Analysis of Conventional Machine Learning Algorithms for Identification of Chronic Kidney Disease in Type 1 Diabetes Mellitus Patients
title_full_unstemmed Performance Analysis of Conventional Machine Learning Algorithms for Identification of Chronic Kidney Disease in Type 1 Diabetes Mellitus Patients
title_short Performance Analysis of Conventional Machine Learning Algorithms for Identification of Chronic Kidney Disease in Type 1 Diabetes Mellitus Patients
title_sort performance analysis of conventional machine learning algorithms for identification of chronic kidney disease in type 1 diabetes mellitus patients
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8700037/
https://www.ncbi.nlm.nih.gov/pubmed/34943504
http://dx.doi.org/10.3390/diagnostics11122267
work_keys_str_mv AT chowdhurynakibhayat performanceanalysisofconventionalmachinelearningalgorithmsforidentificationofchronickidneydiseaseintype1diabetesmellituspatients
AT reazmamunbinibne performanceanalysisofconventionalmachinelearningalgorithmsforidentificationofchronickidneydiseaseintype1diabetesmellituspatients
AT haquefahmida performanceanalysisofconventionalmachinelearningalgorithmsforidentificationofchronickidneydiseaseintype1diabetesmellituspatients
AT ahmadshamim performanceanalysisofconventionalmachinelearningalgorithmsforidentificationofchronickidneydiseaseintype1diabetesmellituspatients
AT alisawalhamidmd performanceanalysisofconventionalmachinelearningalgorithmsforidentificationofchronickidneydiseaseintype1diabetesmellituspatients
AT abakarahmadashrif performanceanalysisofconventionalmachinelearningalgorithmsforidentificationofchronickidneydiseaseintype1diabetesmellituspatients
AT bhuiyanmohammadarifsobhan performanceanalysisofconventionalmachinelearningalgorithmsforidentificationofchronickidneydiseaseintype1diabetesmellituspatients