Cargando…
A multi-class classification model for supporting the diagnosis of type II diabetes mellitus
BACKGROUND: Numerous studies have utilized machine-learning techniques to predict the early onset of type 2 diabetes mellitus. However, fewer studies have been conducted to predict an appropriate diagnosis code for the type 2 diabetes mellitus condition. Further, ensemble techniques such as bagging...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7487151/ https://www.ncbi.nlm.nih.gov/pubmed/32974105 http://dx.doi.org/10.7717/peerj.9920 |
_version_ | 1783581433528320000 |
---|---|
author | Kuo, Kuang-Ming Talley, Paul Kao, YuHsi Huang, Chi Hsien |
author_facet | Kuo, Kuang-Ming Talley, Paul Kao, YuHsi Huang, Chi Hsien |
author_sort | Kuo, Kuang-Ming |
collection | PubMed |
description | BACKGROUND: Numerous studies have utilized machine-learning techniques to predict the early onset of type 2 diabetes mellitus. However, fewer studies have been conducted to predict an appropriate diagnosis code for the type 2 diabetes mellitus condition. Further, ensemble techniques such as bagging and boosting have likewise been utilized to an even lesser extent. The present study aims to identify appropriate diagnosis codes for type 2 diabetes mellitus patients by means of building a multi-class prediction model which is both parsimonious and possessing minimum features. In addition, the importance of features for predicting diagnose code is provided. METHODS: This study included 149 patients who have contracted type 2 diabetes mellitus. The sample was collected from a large hospital in Taiwan from November, 2017 to May, 2018. Machine learning algorithms including instance-based, decision trees, deep neural network, and ensemble algorithms were all used to build the predictive models utilized in this study. Average accuracy, area under receiver operating characteristic curve, Matthew correlation coefficient, macro-precision, recall, weighted average of precision and recall, and model process time were subsequently used to assess the performance of the built models. Information gain and gain ratio were used in order to demonstrate feature importance. RESULTS: The results showed that most algorithms, except for deep neural network, performed well in terms of all performance indices regardless of either the training or testing dataset that were used. Ten features and their importance to determine the diagnosis code of type 2 diabetes mellitus were identified. Our proposed predictive model can be further developed into a clinical diagnosis support system or integrated into existing healthcare information systems. Both methods of application can effectively support physicians whenever they are diagnosing type 2 diabetes mellitus patients in order to foster better patient-care planning. |
format | Online Article Text |
id | pubmed-7487151 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-74871512020-09-23 A multi-class classification model for supporting the diagnosis of type II diabetes mellitus Kuo, Kuang-Ming Talley, Paul Kao, YuHsi Huang, Chi Hsien PeerJ Diabetes and Endocrinology BACKGROUND: Numerous studies have utilized machine-learning techniques to predict the early onset of type 2 diabetes mellitus. However, fewer studies have been conducted to predict an appropriate diagnosis code for the type 2 diabetes mellitus condition. Further, ensemble techniques such as bagging and boosting have likewise been utilized to an even lesser extent. The present study aims to identify appropriate diagnosis codes for type 2 diabetes mellitus patients by means of building a multi-class prediction model which is both parsimonious and possessing minimum features. In addition, the importance of features for predicting diagnose code is provided. METHODS: This study included 149 patients who have contracted type 2 diabetes mellitus. The sample was collected from a large hospital in Taiwan from November, 2017 to May, 2018. Machine learning algorithms including instance-based, decision trees, deep neural network, and ensemble algorithms were all used to build the predictive models utilized in this study. Average accuracy, area under receiver operating characteristic curve, Matthew correlation coefficient, macro-precision, recall, weighted average of precision and recall, and model process time were subsequently used to assess the performance of the built models. Information gain and gain ratio were used in order to demonstrate feature importance. RESULTS: The results showed that most algorithms, except for deep neural network, performed well in terms of all performance indices regardless of either the training or testing dataset that were used. Ten features and their importance to determine the diagnosis code of type 2 diabetes mellitus were identified. Our proposed predictive model can be further developed into a clinical diagnosis support system or integrated into existing healthcare information systems. Both methods of application can effectively support physicians whenever they are diagnosing type 2 diabetes mellitus patients in order to foster better patient-care planning. PeerJ Inc. 2020-09-10 /pmc/articles/PMC7487151/ /pubmed/32974105 http://dx.doi.org/10.7717/peerj.9920 Text en © 2020 Kuo et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Diabetes and Endocrinology Kuo, Kuang-Ming Talley, Paul Kao, YuHsi Huang, Chi Hsien A multi-class classification model for supporting the diagnosis of type II diabetes mellitus |
title | A multi-class classification model for supporting the diagnosis of type II diabetes mellitus |
title_full | A multi-class classification model for supporting the diagnosis of type II diabetes mellitus |
title_fullStr | A multi-class classification model for supporting the diagnosis of type II diabetes mellitus |
title_full_unstemmed | A multi-class classification model for supporting the diagnosis of type II diabetes mellitus |
title_short | A multi-class classification model for supporting the diagnosis of type II diabetes mellitus |
title_sort | multi-class classification model for supporting the diagnosis of type ii diabetes mellitus |
topic | Diabetes and Endocrinology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7487151/ https://www.ncbi.nlm.nih.gov/pubmed/32974105 http://dx.doi.org/10.7717/peerj.9920 |
work_keys_str_mv | AT kuokuangming amulticlassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus AT talleypaul amulticlassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus AT kaoyuhsi amulticlassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus AT huangchihsien amulticlassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus AT kuokuangming multiclassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus AT talleypaul multiclassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus AT kaoyuhsi multiclassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus AT huangchihsien multiclassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus |