Cargando…
Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods
Diabetes is a chronic disease that can cause several forms of chronic damage to the human body, including heart problems, kidney failure, depression, eye damage, and nerve damage. There are several risk factors involved in causing this disease, with some of the most common being obesity, age, insuli...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536939/ https://www.ncbi.nlm.nih.gov/pubmed/36210985 http://dx.doi.org/10.1155/2022/2557795 |
_version_ | 1784803085440253952 |
---|---|
author | Ullah, Zahid Saleem, Farrukh Jamjoom, Mona Fakieh, Bahjat Kateb, Faris Ali, Abdullah Marish Shah, Babar |
author_facet | Ullah, Zahid Saleem, Farrukh Jamjoom, Mona Fakieh, Bahjat Kateb, Faris Ali, Abdullah Marish Shah, Babar |
author_sort | Ullah, Zahid |
collection | PubMed |
description | Diabetes is a chronic disease that can cause several forms of chronic damage to the human body, including heart problems, kidney failure, depression, eye damage, and nerve damage. There are several risk factors involved in causing this disease, with some of the most common being obesity, age, insulin resistance, and hypertension. Therefore, early detection of these risk factors is vital in helping patients reverse diabetes from the early stage to live healthy lives. Machine learning (ML) is a useful tool that can easily detect diabetes from several risk factors and, based on the findings, provide a decision-based model that can help in diagnosing the disease. This study aims to detect the risk factors of diabetes using ML methods and to provide a decision support system for medical practitioners that can help them in diagnosing diabetes. Moreover, besides various other preprocessing steps, this study has used the synthetic minority over-sampling technique integrated with the edited nearest neighbor (SMOTE-ENN) method for balancing the BRFSS dataset. The SMOTE-ENN is a more powerful method than the individual SMOTE method. Several ML methods were applied to the processed BRFSS dataset and built prediction models for detecting the risk factors that can help in diagnosing diabetes patients in the early stage. The prediction models were evaluated using various measures that show the high performance of the models. The experimental results show the reliability of the proposed models, demonstrating that k-nearest neighbor (KNN) outperformed other methods with an accuracy of 98.38%, sensitivity, specificity, and ROC/AUC score of 98%. Moreover, compared with the existing state-of-the-art methods, the results confirm the efficacy of the proposed models in terms of accuracy and other evaluation measures. The use of SMOTE-ENN is more beneficial for balancing the dataset to build more accurate prediction models. This was the main reason it was possible to achieve models more accurate than the existing ones. |
format | Online Article Text |
id | pubmed-9536939 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-95369392022-10-07 Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods Ullah, Zahid Saleem, Farrukh Jamjoom, Mona Fakieh, Bahjat Kateb, Faris Ali, Abdullah Marish Shah, Babar Comput Intell Neurosci Research Article Diabetes is a chronic disease that can cause several forms of chronic damage to the human body, including heart problems, kidney failure, depression, eye damage, and nerve damage. There are several risk factors involved in causing this disease, with some of the most common being obesity, age, insulin resistance, and hypertension. Therefore, early detection of these risk factors is vital in helping patients reverse diabetes from the early stage to live healthy lives. Machine learning (ML) is a useful tool that can easily detect diabetes from several risk factors and, based on the findings, provide a decision-based model that can help in diagnosing the disease. This study aims to detect the risk factors of diabetes using ML methods and to provide a decision support system for medical practitioners that can help them in diagnosing diabetes. Moreover, besides various other preprocessing steps, this study has used the synthetic minority over-sampling technique integrated with the edited nearest neighbor (SMOTE-ENN) method for balancing the BRFSS dataset. The SMOTE-ENN is a more powerful method than the individual SMOTE method. Several ML methods were applied to the processed BRFSS dataset and built prediction models for detecting the risk factors that can help in diagnosing diabetes patients in the early stage. The prediction models were evaluated using various measures that show the high performance of the models. The experimental results show the reliability of the proposed models, demonstrating that k-nearest neighbor (KNN) outperformed other methods with an accuracy of 98.38%, sensitivity, specificity, and ROC/AUC score of 98%. Moreover, compared with the existing state-of-the-art methods, the results confirm the efficacy of the proposed models in terms of accuracy and other evaluation measures. The use of SMOTE-ENN is more beneficial for balancing the dataset to build more accurate prediction models. This was the main reason it was possible to achieve models more accurate than the existing ones. Hindawi 2022-09-29 /pmc/articles/PMC9536939/ /pubmed/36210985 http://dx.doi.org/10.1155/2022/2557795 Text en Copyright © 2022 Zahid Ullah et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Ullah, Zahid Saleem, Farrukh Jamjoom, Mona Fakieh, Bahjat Kateb, Faris Ali, Abdullah Marish Shah, Babar Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods |
title | Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods |
title_full | Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods |
title_fullStr | Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods |
title_full_unstemmed | Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods |
title_short | Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods |
title_sort | detecting high-risk factors and early diagnosis of diabetes using machine learning methods |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536939/ https://www.ncbi.nlm.nih.gov/pubmed/36210985 http://dx.doi.org/10.1155/2022/2557795 |
work_keys_str_mv | AT ullahzahid detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods AT saleemfarrukh detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods AT jamjoommona detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods AT fakiehbahjat detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods AT katebfaris detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods AT aliabdullahmarish detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods AT shahbabar detectinghighriskfactorsandearlydiagnosisofdiabetesusingmachinelearningmethods |