Cargando…

An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction

The number of diabetic patients is increasing yearly worldwide, requiring the need for a quick intervention to help these people. Mortality rates are higher for diabetic patients with other serious health complications. Thus, early prediction for such diseases positively impacts healthcare quality a...

Descripción completa

Detalles Bibliográficos
Autores principales: Khafaga, Doaa Sami, Alharbi, Amal H., Mohamed, Israa, Hosny, Khalid M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602561/
https://www.ncbi.nlm.nih.gov/pubmed/36292517
http://dx.doi.org/10.3390/healthcare10102070
_version_ 1784817348965826560
author Khafaga, Doaa Sami
Alharbi, Amal H.
Mohamed, Israa
Hosny, Khalid M.
author_facet Khafaga, Doaa Sami
Alharbi, Amal H.
Mohamed, Israa
Hosny, Khalid M.
author_sort Khafaga, Doaa Sami
collection PubMed
description The number of diabetic patients is increasing yearly worldwide, requiring the need for a quick intervention to help these people. Mortality rates are higher for diabetic patients with other serious health complications. Thus, early prediction for such diseases positively impacts healthcare quality and can prevent serious health complications later. This paper constructs an efficient prediction system for predicting diabetes in its early stage. The proposed system starts with a Local Outlier Factor (LOF)-based outlier detection technique to detect outlier data. A Balanced Bagging Classifier (BBC) technique is used to balance data distribution. Finally, integration between association rules and classification algorithms is used to develop a prediction model based on real data. Four classification algorithms were utilized in addition to an a priori algorithm that discovered relationships between various factors. The named algorithms are Artificial Neural Network (ANN), Decision Trees (DT), Support Vector Machines (SVM), and K Nearest Neighbor (KNN) for data classification. Results revealed that KNN provided the highest accuracy of 97.36% compared to the other applied algorithms. An a priori algorithm extracted association rules based on the Lift matrix. Four association rules from 12 attributes with the highest correlation and information gain scores relative to the class attribute were produced.
format Online
Article
Text
id pubmed-9602561
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96025612022-10-27 An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction Khafaga, Doaa Sami Alharbi, Amal H. Mohamed, Israa Hosny, Khalid M. Healthcare (Basel) Article The number of diabetic patients is increasing yearly worldwide, requiring the need for a quick intervention to help these people. Mortality rates are higher for diabetic patients with other serious health complications. Thus, early prediction for such diseases positively impacts healthcare quality and can prevent serious health complications later. This paper constructs an efficient prediction system for predicting diabetes in its early stage. The proposed system starts with a Local Outlier Factor (LOF)-based outlier detection technique to detect outlier data. A Balanced Bagging Classifier (BBC) technique is used to balance data distribution. Finally, integration between association rules and classification algorithms is used to develop a prediction model based on real data. Four classification algorithms were utilized in addition to an a priori algorithm that discovered relationships between various factors. The named algorithms are Artificial Neural Network (ANN), Decision Trees (DT), Support Vector Machines (SVM), and K Nearest Neighbor (KNN) for data classification. Results revealed that KNN provided the highest accuracy of 97.36% compared to the other applied algorithms. An a priori algorithm extracted association rules based on the Lift matrix. Four association rules from 12 attributes with the highest correlation and information gain scores relative to the class attribute were produced. MDPI 2022-10-18 /pmc/articles/PMC9602561/ /pubmed/36292517 http://dx.doi.org/10.3390/healthcare10102070 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Khafaga, Doaa Sami
Alharbi, Amal H.
Mohamed, Israa
Hosny, Khalid M.
An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction
title An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction
title_full An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction
title_fullStr An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction
title_full_unstemmed An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction
title_short An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction
title_sort integrated classification and association rule technique for early-stage diabetes risk prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602561/
https://www.ncbi.nlm.nih.gov/pubmed/36292517
http://dx.doi.org/10.3390/healthcare10102070
work_keys_str_mv AT khafagadoaasami anintegratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction
AT alharbiamalh anintegratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction
AT mohamedisraa anintegratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction
AT hosnykhalidm anintegratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction
AT khafagadoaasami integratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction
AT alharbiamalh integratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction
AT mohamedisraa integratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction
AT hosnykhalidm integratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction