Cargando…
An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction
The number of diabetic patients is increasing yearly worldwide, requiring the need for a quick intervention to help these people. Mortality rates are higher for diabetic patients with other serious health complications. Thus, early prediction for such diseases positively impacts healthcare quality a...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602561/ https://www.ncbi.nlm.nih.gov/pubmed/36292517 http://dx.doi.org/10.3390/healthcare10102070 |
_version_ | 1784817348965826560 |
---|---|
author | Khafaga, Doaa Sami Alharbi, Amal H. Mohamed, Israa Hosny, Khalid M. |
author_facet | Khafaga, Doaa Sami Alharbi, Amal H. Mohamed, Israa Hosny, Khalid M. |
author_sort | Khafaga, Doaa Sami |
collection | PubMed |
description | The number of diabetic patients is increasing yearly worldwide, requiring the need for a quick intervention to help these people. Mortality rates are higher for diabetic patients with other serious health complications. Thus, early prediction for such diseases positively impacts healthcare quality and can prevent serious health complications later. This paper constructs an efficient prediction system for predicting diabetes in its early stage. The proposed system starts with a Local Outlier Factor (LOF)-based outlier detection technique to detect outlier data. A Balanced Bagging Classifier (BBC) technique is used to balance data distribution. Finally, integration between association rules and classification algorithms is used to develop a prediction model based on real data. Four classification algorithms were utilized in addition to an a priori algorithm that discovered relationships between various factors. The named algorithms are Artificial Neural Network (ANN), Decision Trees (DT), Support Vector Machines (SVM), and K Nearest Neighbor (KNN) for data classification. Results revealed that KNN provided the highest accuracy of 97.36% compared to the other applied algorithms. An a priori algorithm extracted association rules based on the Lift matrix. Four association rules from 12 attributes with the highest correlation and information gain scores relative to the class attribute were produced. |
format | Online Article Text |
id | pubmed-9602561 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96025612022-10-27 An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction Khafaga, Doaa Sami Alharbi, Amal H. Mohamed, Israa Hosny, Khalid M. Healthcare (Basel) Article The number of diabetic patients is increasing yearly worldwide, requiring the need for a quick intervention to help these people. Mortality rates are higher for diabetic patients with other serious health complications. Thus, early prediction for such diseases positively impacts healthcare quality and can prevent serious health complications later. This paper constructs an efficient prediction system for predicting diabetes in its early stage. The proposed system starts with a Local Outlier Factor (LOF)-based outlier detection technique to detect outlier data. A Balanced Bagging Classifier (BBC) technique is used to balance data distribution. Finally, integration between association rules and classification algorithms is used to develop a prediction model based on real data. Four classification algorithms were utilized in addition to an a priori algorithm that discovered relationships between various factors. The named algorithms are Artificial Neural Network (ANN), Decision Trees (DT), Support Vector Machines (SVM), and K Nearest Neighbor (KNN) for data classification. Results revealed that KNN provided the highest accuracy of 97.36% compared to the other applied algorithms. An a priori algorithm extracted association rules based on the Lift matrix. Four association rules from 12 attributes with the highest correlation and information gain scores relative to the class attribute were produced. MDPI 2022-10-18 /pmc/articles/PMC9602561/ /pubmed/36292517 http://dx.doi.org/10.3390/healthcare10102070 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Khafaga, Doaa Sami Alharbi, Amal H. Mohamed, Israa Hosny, Khalid M. An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction |
title | An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction |
title_full | An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction |
title_fullStr | An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction |
title_full_unstemmed | An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction |
title_short | An Integrated Classification and Association Rule Technique for Early-Stage Diabetes Risk Prediction |
title_sort | integrated classification and association rule technique for early-stage diabetes risk prediction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602561/ https://www.ncbi.nlm.nih.gov/pubmed/36292517 http://dx.doi.org/10.3390/healthcare10102070 |
work_keys_str_mv | AT khafagadoaasami anintegratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction AT alharbiamalh anintegratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction AT mohamedisraa anintegratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction AT hosnykhalidm anintegratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction AT khafagadoaasami integratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction AT alharbiamalh integratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction AT mohamedisraa integratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction AT hosnykhalidm integratedclassificationandassociationruletechniqueforearlystagediabetesriskprediction |