Cargando…

Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification

PURPOSE: Classification of medical data is essential to determine diabetic treatment options; therefore, the objective of the study was to develop a model to classify the three diabetes type diagnoses according to multiple patient attributes. METHODS: Three different datasets are used to develop a n...

Descripción completa

Detalles Bibliográficos
Autores principales: P, Nagaraj, P, Deepalakshmi, Mansour, Romany F, Almazroa, Ahmed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8232854/
https://www.ncbi.nlm.nih.gov/pubmed/34188504
http://dx.doi.org/10.2147/DMSO.S312787
_version_ 1783713725080928256
author P, Nagaraj
P, Deepalakshmi
Mansour, Romany F
Almazroa, Ahmed
author_facet P, Nagaraj
P, Deepalakshmi
Mansour, Romany F
Almazroa, Ahmed
author_sort P, Nagaraj
collection PubMed
description PURPOSE: Classification of medical data is essential to determine diabetic treatment options; therefore, the objective of the study was to develop a model to classify the three diabetes type diagnoses according to multiple patient attributes. METHODS: Three different datasets are used to develop a novel medical data classification model. The proposed model involved preprocessing, artificial flora algorithm (AFA)-based feature selection, and gradient boosted tree (GBT)-based classification. Then, the processing occurred in two steps, namely, format conversion and data transformation. AFA was applied for selecting features, such as demographics, vital signs, laboratory tests, medications, from the patients’ electronic health records. Lastly, the GBT-based classification model was applied for classifying the patients’ cases to type I, type II, or gestational diabetes mellitus. RESULTS: The effectiveness of the proposed AFA-GBT model was validated using three diabetes datasets to classify patient cases into one of the three different types of diabetes. The proposed model showed a maximum average precision of 91.64%, a recall of 97.46%, an accuracy of 99.93%, an F-score of 94.19%, and a kappa of 96.61%. CONCLUSION: The AFA-GBT model could classify patient diagnoses into the three diabetes types efficiently.
format Online
Article
Text
id pubmed-8232854
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Dove
record_format MEDLINE/PubMed
spelling pubmed-82328542021-06-28 Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification P, Nagaraj P, Deepalakshmi Mansour, Romany F Almazroa, Ahmed Diabetes Metab Syndr Obes Original Research PURPOSE: Classification of medical data is essential to determine diabetic treatment options; therefore, the objective of the study was to develop a model to classify the three diabetes type diagnoses according to multiple patient attributes. METHODS: Three different datasets are used to develop a novel medical data classification model. The proposed model involved preprocessing, artificial flora algorithm (AFA)-based feature selection, and gradient boosted tree (GBT)-based classification. Then, the processing occurred in two steps, namely, format conversion and data transformation. AFA was applied for selecting features, such as demographics, vital signs, laboratory tests, medications, from the patients’ electronic health records. Lastly, the GBT-based classification model was applied for classifying the patients’ cases to type I, type II, or gestational diabetes mellitus. RESULTS: The effectiveness of the proposed AFA-GBT model was validated using three diabetes datasets to classify patient cases into one of the three different types of diabetes. The proposed model showed a maximum average precision of 91.64%, a recall of 97.46%, an accuracy of 99.93%, an F-score of 94.19%, and a kappa of 96.61%. CONCLUSION: The AFA-GBT model could classify patient diagnoses into the three diabetes types efficiently. Dove 2021-06-21 /pmc/articles/PMC8232854/ /pubmed/34188504 http://dx.doi.org/10.2147/DMSO.S312787 Text en © 2021 P et al. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php).
spellingShingle Original Research
P, Nagaraj
P, Deepalakshmi
Mansour, Romany F
Almazroa, Ahmed
Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification
title Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification
title_full Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification
title_fullStr Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification
title_full_unstemmed Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification
title_short Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification
title_sort artificial flora algorithm-based feature selection with gradient boosted tree model for diabetes classification
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8232854/
https://www.ncbi.nlm.nih.gov/pubmed/34188504
http://dx.doi.org/10.2147/DMSO.S312787
work_keys_str_mv AT pnagaraj artificialfloraalgorithmbasedfeatureselectionwithgradientboostedtreemodelfordiabetesclassification
AT pdeepalakshmi artificialfloraalgorithmbasedfeatureselectionwithgradientboostedtreemodelfordiabetesclassification
AT mansourromanyf artificialfloraalgorithmbasedfeatureselectionwithgradientboostedtreemodelfordiabetesclassification
AT almazroaahmed artificialfloraalgorithmbasedfeatureselectionwithgradientboostedtreemodelfordiabetesclassification