Cargando…
Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification
PURPOSE: Classification of medical data is essential to determine diabetic treatment options; therefore, the objective of the study was to develop a model to classify the three diabetes type diagnoses according to multiple patient attributes. METHODS: Three different datasets are used to develop a n...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Dove
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8232854/ https://www.ncbi.nlm.nih.gov/pubmed/34188504 http://dx.doi.org/10.2147/DMSO.S312787 |
_version_ | 1783713725080928256 |
---|---|
author | P, Nagaraj P, Deepalakshmi Mansour, Romany F Almazroa, Ahmed |
author_facet | P, Nagaraj P, Deepalakshmi Mansour, Romany F Almazroa, Ahmed |
author_sort | P, Nagaraj |
collection | PubMed |
description | PURPOSE: Classification of medical data is essential to determine diabetic treatment options; therefore, the objective of the study was to develop a model to classify the three diabetes type diagnoses according to multiple patient attributes. METHODS: Three different datasets are used to develop a novel medical data classification model. The proposed model involved preprocessing, artificial flora algorithm (AFA)-based feature selection, and gradient boosted tree (GBT)-based classification. Then, the processing occurred in two steps, namely, format conversion and data transformation. AFA was applied for selecting features, such as demographics, vital signs, laboratory tests, medications, from the patients’ electronic health records. Lastly, the GBT-based classification model was applied for classifying the patients’ cases to type I, type II, or gestational diabetes mellitus. RESULTS: The effectiveness of the proposed AFA-GBT model was validated using three diabetes datasets to classify patient cases into one of the three different types of diabetes. The proposed model showed a maximum average precision of 91.64%, a recall of 97.46%, an accuracy of 99.93%, an F-score of 94.19%, and a kappa of 96.61%. CONCLUSION: The AFA-GBT model could classify patient diagnoses into the three diabetes types efficiently. |
format | Online Article Text |
id | pubmed-8232854 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Dove |
record_format | MEDLINE/PubMed |
spelling | pubmed-82328542021-06-28 Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification P, Nagaraj P, Deepalakshmi Mansour, Romany F Almazroa, Ahmed Diabetes Metab Syndr Obes Original Research PURPOSE: Classification of medical data is essential to determine diabetic treatment options; therefore, the objective of the study was to develop a model to classify the three diabetes type diagnoses according to multiple patient attributes. METHODS: Three different datasets are used to develop a novel medical data classification model. The proposed model involved preprocessing, artificial flora algorithm (AFA)-based feature selection, and gradient boosted tree (GBT)-based classification. Then, the processing occurred in two steps, namely, format conversion and data transformation. AFA was applied for selecting features, such as demographics, vital signs, laboratory tests, medications, from the patients’ electronic health records. Lastly, the GBT-based classification model was applied for classifying the patients’ cases to type I, type II, or gestational diabetes mellitus. RESULTS: The effectiveness of the proposed AFA-GBT model was validated using three diabetes datasets to classify patient cases into one of the three different types of diabetes. The proposed model showed a maximum average precision of 91.64%, a recall of 97.46%, an accuracy of 99.93%, an F-score of 94.19%, and a kappa of 96.61%. CONCLUSION: The AFA-GBT model could classify patient diagnoses into the three diabetes types efficiently. Dove 2021-06-21 /pmc/articles/PMC8232854/ /pubmed/34188504 http://dx.doi.org/10.2147/DMSO.S312787 Text en © 2021 P et al. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php). |
spellingShingle | Original Research P, Nagaraj P, Deepalakshmi Mansour, Romany F Almazroa, Ahmed Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification |
title | Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification |
title_full | Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification |
title_fullStr | Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification |
title_full_unstemmed | Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification |
title_short | Artificial Flora Algorithm-Based Feature Selection with Gradient Boosted Tree Model for Diabetes Classification |
title_sort | artificial flora algorithm-based feature selection with gradient boosted tree model for diabetes classification |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8232854/ https://www.ncbi.nlm.nih.gov/pubmed/34188504 http://dx.doi.org/10.2147/DMSO.S312787 |
work_keys_str_mv | AT pnagaraj artificialfloraalgorithmbasedfeatureselectionwithgradientboostedtreemodelfordiabetesclassification AT pdeepalakshmi artificialfloraalgorithmbasedfeatureselectionwithgradientboostedtreemodelfordiabetesclassification AT mansourromanyf artificialfloraalgorithmbasedfeatureselectionwithgradientboostedtreemodelfordiabetesclassification AT almazroaahmed artificialfloraalgorithmbasedfeatureselectionwithgradientboostedtreemodelfordiabetesclassification |