Cargando…

Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods

Diabetes is a chronic disease that can cause several forms of chronic damage to the human body, including heart problems, kidney failure, depression, eye damage, and nerve damage. There are several risk factors involved in causing this disease, with some of the most common being obesity, age, insuli...

Descripción completa

Detalles Bibliográficos
Autores principales: Ullah, Zahid, Saleem, Farrukh, Jamjoom, Mona, Fakieh, Bahjat, Kateb, Faris, Ali, Abdullah Marish, Shah, Babar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536939/
https://www.ncbi.nlm.nih.gov/pubmed/36210985
http://dx.doi.org/10.1155/2022/2557795
_version_ 1784803085440253952
author Ullah, Zahid
Saleem, Farrukh
Jamjoom, Mona
Fakieh, Bahjat
Kateb, Faris
Ali, Abdullah Marish
Shah, Babar
author_facet Ullah, Zahid
Saleem, Farrukh
Jamjoom, Mona
Fakieh, Bahjat
Kateb, Faris
Ali, Abdullah Marish
Shah, Babar
author_sort Ullah, Zahid
collection PubMed
description Diabetes is a chronic disease that can cause several forms of chronic damage to the human body, including heart problems, kidney failure, depression, eye damage, and nerve damage. There are several risk factors involved in causing this disease, with some of the most common being obesity, age, insulin resistance, and hypertension. Therefore, early detection of these risk factors is vital in helping patients reverse diabetes from the early stage to live healthy lives. Machine learning (ML) is a useful tool that can easily detect diabetes from several risk factors and, based on the findings, provide a decision-based model that can help in diagnosing the disease. This study aims to detect the risk factors of diabetes using ML methods and to provide a decision support system for medical practitioners that can help them in diagnosing diabetes. Moreover, besides various other preprocessing steps, this study has used the synthetic minority over-sampling technique integrated with the edited nearest neighbor (SMOTE-ENN) method for balancing the BRFSS dataset. The SMOTE-ENN is a more powerful method than the individual SMOTE method. Several ML methods were applied to the processed BRFSS dataset and built prediction models for detecting the risk factors that can help in diagnosing diabetes patients in the early stage. The prediction models were evaluated using various measures that show the high performance of the models. The experimental results show the reliability of the proposed models, demonstrating that k-nearest neighbor (KNN) outperformed other methods with an accuracy of 98.38%, sensitivity, specificity, and ROC/AUC score of 98%. Moreover, compared with the existing state-of-the-art methods, the results confirm the efficacy of the proposed models in terms of accuracy and other evaluation measures. The use of SMOTE-ENN is more beneficial for balancing the dataset to build more accurate prediction models. This was the main reason it was possible to achieve models more accurate than the existing ones.
format Online
Article
Text
id pubmed-9536939
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-95369392022-10-07 Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods Ullah, Zahid Saleem, Farrukh Jamjoom, Mona Fakieh, Bahjat Kateb, Faris Ali, Abdullah Marish Shah, Babar Comput Intell Neurosci Research Article Diabetes is a chronic disease that can cause several forms of chronic damage to the human body, including heart problems, kidney failure, depression, eye damage, and nerve damage. There are several risk factors involved in causing this disease, with some of the most common being obesity, age, insulin resistance, and hypertension. Therefore, early detection of these risk factors is vital in helping patients reverse diabetes from the early stage to live healthy lives. Machine learning (ML) is a useful tool that can easily detect diabetes from several risk factors and, based on the findings, provide a decision-based model that can help in diagnosing the disease. This study aims to detect the risk factors of diabetes using ML methods and to provide a decision support system for medical practitioners that can help them in diagnosing diabetes. Moreover, besides various other preprocessing steps, this study has used the synthetic minority over-sampling technique integrated with the edited nearest neighbor (SMOTE-ENN) method for balancing the BRFSS dataset. The SMOTE-ENN is a more powerful method than the individual SMOTE method. Several ML methods were applied to the processed BRFSS dataset and built prediction models for detecting the risk factors that can help in diagnosing diabetes patients in the early stage. The prediction models were evaluated using various measures that show the high performance of the models. The experimental results show the reliability of the proposed models, demonstrating that k-nearest neighbor (KNN) outperformed other methods with an accuracy of 98.38%, sensitivity, specificity, and ROC/AUC score of 98%. Moreover, compared with the existing state-of-the-art methods, the results confirm the efficacy of the proposed models in terms of accuracy and other evaluation measures. The use of SMOTE-ENN is more beneficial for balancing the dataset to build more accurate prediction models. This was the main reason it was possible to achieve models more accurate than the existing ones. Hindawi 2022-09-29 /pmc/articles/PMC9536939/ /pubmed/36210985 http://dx.doi.org/10.1155/2022/2557795 Text en Copyright © 2022 Zahid Ullah et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ullah, Zahid
Saleem, Farrukh
Jamjoom, Mona
Fakieh, Bahjat
Kateb, Faris
Ali, Abdullah Marish
Shah, Babar
Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods
title Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods
title_full Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods
title_fullStr Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods
title_full_unstemmed Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods
title_short Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods
title_sort detecting high-risk factors and early diagnosis of diabetes using machine learning methods
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536939/
https://www.ncbi.nlm.nih.gov/pubmed/36210985
http://dx.doi.org/10.1155/2022/2557795
work_keys_str_mv AT ullahzahid detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods
AT saleemfarrukh detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods
AT jamjoommona detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods
AT fakiehbahjat detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods
AT katebfaris detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods
AT aliabdullahmarish detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods
AT shahbabar detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods