Cargando…

Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease

To diagnose an illness in healthcare, doctors typically conduct physical exams and review the patient's medical history, followed by diagnostic tests and procedures to determine the underlying cause of symptoms. Chronic kidney disease (CKD) is currently the leading cause of death, with a rapidl...

Descripción completa

Detalles Bibliográficos
Autores principales: Khalid, Hira, Khan, Ajab, Zahid Khan, Muhammad, Mehmood, Gulzar, Shuaib Qureshi, Muhammad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10030216/
https://www.ncbi.nlm.nih.gov/pubmed/36959840
http://dx.doi.org/10.1155/2023/9266889
_version_ 1784910307558162432
author Khalid, Hira
Khan, Ajab
Zahid Khan, Muhammad
Mehmood, Gulzar
Shuaib Qureshi, Muhammad
author_facet Khalid, Hira
Khan, Ajab
Zahid Khan, Muhammad
Mehmood, Gulzar
Shuaib Qureshi, Muhammad
author_sort Khalid, Hira
collection PubMed
description To diagnose an illness in healthcare, doctors typically conduct physical exams and review the patient's medical history, followed by diagnostic tests and procedures to determine the underlying cause of symptoms. Chronic kidney disease (CKD) is currently the leading cause of death, with a rapidly increasing number of patients, resulting in 1.7 million deaths annually. While various diagnostic methods are available, this study utilizes machine learning due to its high accuracy. In this study, we have used the hybrid technique to build our proposed model. In our proposed model, we have used the Pearson correlation for feature selection. In the first step, the best models were selected on the basis of critical literature analysis. In the second step, the combination of these models is used in our proposed hybrid model. Gaussian Naïve Bayes, gradient boosting, and decision tree classifier are used as a base classifier, and the random forest classifier is used as a meta-classifier in the proposed hybrid model. The objective of this study is to evaluate the best machine learning classification techniques and identify the best-used machine learning classifier in terms of accuracy. This provides a solution for overfitting and achieves the highest accuracy. It also highlights some of the challenges that affect the result of better performance. In this study, we critically review the existing available machine learning classification techniques. We evaluate in terms of accuracy, and a comprehensive analytical evaluation of the related work is presented with a tabular system. In implementation, we have used the top four models and built a hybrid model using UCI chronic kidney disease dataset for prediction. Gradient boosting achieves around 99% accuracy, random forest achieves 98%, decision tree classifier achieves 96% accuracy, and our proposed hybrid model performs best getting 100% accuracy on the same dataset. Some of the main machine learning algorithms used to predict the occurrence of CKD are Naïve Bayes, decision tree, K-nearest neighbor, random forest, support vector machine, LDA, GB, and neural network. In this study, we apply GB (gradient boosting), Gaussian Naïve Bayes, and decision tree along with random forest on the same set of features and compare the accuracy score.
format Online
Article
Text
id pubmed-10030216
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-100302162023-03-22 Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease Khalid, Hira Khan, Ajab Zahid Khan, Muhammad Mehmood, Gulzar Shuaib Qureshi, Muhammad Comput Intell Neurosci Research Article To diagnose an illness in healthcare, doctors typically conduct physical exams and review the patient's medical history, followed by diagnostic tests and procedures to determine the underlying cause of symptoms. Chronic kidney disease (CKD) is currently the leading cause of death, with a rapidly increasing number of patients, resulting in 1.7 million deaths annually. While various diagnostic methods are available, this study utilizes machine learning due to its high accuracy. In this study, we have used the hybrid technique to build our proposed model. In our proposed model, we have used the Pearson correlation for feature selection. In the first step, the best models were selected on the basis of critical literature analysis. In the second step, the combination of these models is used in our proposed hybrid model. Gaussian Naïve Bayes, gradient boosting, and decision tree classifier are used as a base classifier, and the random forest classifier is used as a meta-classifier in the proposed hybrid model. The objective of this study is to evaluate the best machine learning classification techniques and identify the best-used machine learning classifier in terms of accuracy. This provides a solution for overfitting and achieves the highest accuracy. It also highlights some of the challenges that affect the result of better performance. In this study, we critically review the existing available machine learning classification techniques. We evaluate in terms of accuracy, and a comprehensive analytical evaluation of the related work is presented with a tabular system. In implementation, we have used the top four models and built a hybrid model using UCI chronic kidney disease dataset for prediction. Gradient boosting achieves around 99% accuracy, random forest achieves 98%, decision tree classifier achieves 96% accuracy, and our proposed hybrid model performs best getting 100% accuracy on the same dataset. Some of the main machine learning algorithms used to predict the occurrence of CKD are Naïve Bayes, decision tree, K-nearest neighbor, random forest, support vector machine, LDA, GB, and neural network. In this study, we apply GB (gradient boosting), Gaussian Naïve Bayes, and decision tree along with random forest on the same set of features and compare the accuracy score. Hindawi 2023-03-14 /pmc/articles/PMC10030216/ /pubmed/36959840 http://dx.doi.org/10.1155/2023/9266889 Text en Copyright © 2023 Hira Khalid et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Khalid, Hira
Khan, Ajab
Zahid Khan, Muhammad
Mehmood, Gulzar
Shuaib Qureshi, Muhammad
Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease
title Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease
title_full Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease
title_fullStr Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease
title_full_unstemmed Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease
title_short Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease
title_sort machine learning hybrid model for the prediction of chronic kidney disease
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10030216/
https://www.ncbi.nlm.nih.gov/pubmed/36959840
http://dx.doi.org/10.1155/2023/9266889
work_keys_str_mv AT khalidhira machinelearninghybridmodelforthepredictionofchronickidneydisease
AT khanajab machinelearninghybridmodelforthepredictionofchronickidneydisease
AT zahidkhanmuhammad machinelearninghybridmodelforthepredictionofchronickidneydisease
AT mehmoodgulzar machinelearninghybridmodelforthepredictionofchronickidneydisease
AT shuaibqureshimuhammad machinelearninghybridmodelforthepredictionofchronickidneydisease