Cargando…

A multi-class classification model for supporting the diagnosis of type II diabetes mellitus

BACKGROUND: Numerous studies have utilized machine-learning techniques to predict the early onset of type 2 diabetes mellitus. However, fewer studies have been conducted to predict an appropriate diagnosis code for the type 2 diabetes mellitus condition. Further, ensemble techniques such as bagging...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuo, Kuang-Ming, Talley, Paul, Kao, YuHsi, Huang, Chi Hsien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7487151/
https://www.ncbi.nlm.nih.gov/pubmed/32974105
http://dx.doi.org/10.7717/peerj.9920
_version_ 1783581433528320000
author Kuo, Kuang-Ming
Talley, Paul
Kao, YuHsi
Huang, Chi Hsien
author_facet Kuo, Kuang-Ming
Talley, Paul
Kao, YuHsi
Huang, Chi Hsien
author_sort Kuo, Kuang-Ming
collection PubMed
description BACKGROUND: Numerous studies have utilized machine-learning techniques to predict the early onset of type 2 diabetes mellitus. However, fewer studies have been conducted to predict an appropriate diagnosis code for the type 2 diabetes mellitus condition. Further, ensemble techniques such as bagging and boosting have likewise been utilized to an even lesser extent. The present study aims to identify appropriate diagnosis codes for type 2 diabetes mellitus patients by means of building a multi-class prediction model which is both parsimonious and possessing minimum features. In addition, the importance of features for predicting diagnose code is provided. METHODS: This study included 149 patients who have contracted type 2 diabetes mellitus. The sample was collected from a large hospital in Taiwan from November, 2017 to May, 2018. Machine learning algorithms including instance-based, decision trees, deep neural network, and ensemble algorithms were all used to build the predictive models utilized in this study. Average accuracy, area under receiver operating characteristic curve, Matthew correlation coefficient, macro-precision, recall, weighted average of precision and recall, and model process time were subsequently used to assess the performance of the built models. Information gain and gain ratio were used in order to demonstrate feature importance. RESULTS: The results showed that most algorithms, except for deep neural network, performed well in terms of all performance indices regardless of either the training or testing dataset that were used. Ten features and their importance to determine the diagnosis code of type 2 diabetes mellitus were identified. Our proposed predictive model can be further developed into a clinical diagnosis support system or integrated into existing healthcare information systems. Both methods of application can effectively support physicians whenever they are diagnosing type 2 diabetes mellitus patients in order to foster better patient-care planning.
format Online
Article
Text
id pubmed-7487151
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-74871512020-09-23 A multi-class classification model for supporting the diagnosis of type II diabetes mellitus Kuo, Kuang-Ming Talley, Paul Kao, YuHsi Huang, Chi Hsien PeerJ Diabetes and Endocrinology BACKGROUND: Numerous studies have utilized machine-learning techniques to predict the early onset of type 2 diabetes mellitus. However, fewer studies have been conducted to predict an appropriate diagnosis code for the type 2 diabetes mellitus condition. Further, ensemble techniques such as bagging and boosting have likewise been utilized to an even lesser extent. The present study aims to identify appropriate diagnosis codes for type 2 diabetes mellitus patients by means of building a multi-class prediction model which is both parsimonious and possessing minimum features. In addition, the importance of features for predicting diagnose code is provided. METHODS: This study included 149 patients who have contracted type 2 diabetes mellitus. The sample was collected from a large hospital in Taiwan from November, 2017 to May, 2018. Machine learning algorithms including instance-based, decision trees, deep neural network, and ensemble algorithms were all used to build the predictive models utilized in this study. Average accuracy, area under receiver operating characteristic curve, Matthew correlation coefficient, macro-precision, recall, weighted average of precision and recall, and model process time were subsequently used to assess the performance of the built models. Information gain and gain ratio were used in order to demonstrate feature importance. RESULTS: The results showed that most algorithms, except for deep neural network, performed well in terms of all performance indices regardless of either the training or testing dataset that were used. Ten features and their importance to determine the diagnosis code of type 2 diabetes mellitus were identified. Our proposed predictive model can be further developed into a clinical diagnosis support system or integrated into existing healthcare information systems. Both methods of application can effectively support physicians whenever they are diagnosing type 2 diabetes mellitus patients in order to foster better patient-care planning. PeerJ Inc. 2020-09-10 /pmc/articles/PMC7487151/ /pubmed/32974105 http://dx.doi.org/10.7717/peerj.9920 Text en © 2020 Kuo et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Diabetes and Endocrinology
Kuo, Kuang-Ming
Talley, Paul
Kao, YuHsi
Huang, Chi Hsien
A multi-class classification model for supporting the diagnosis of type II diabetes mellitus
title A multi-class classification model for supporting the diagnosis of type II diabetes mellitus
title_full A multi-class classification model for supporting the diagnosis of type II diabetes mellitus
title_fullStr A multi-class classification model for supporting the diagnosis of type II diabetes mellitus
title_full_unstemmed A multi-class classification model for supporting the diagnosis of type II diabetes mellitus
title_short A multi-class classification model for supporting the diagnosis of type II diabetes mellitus
title_sort multi-class classification model for supporting the diagnosis of type ii diabetes mellitus
topic Diabetes and Endocrinology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7487151/
https://www.ncbi.nlm.nih.gov/pubmed/32974105
http://dx.doi.org/10.7717/peerj.9920
work_keys_str_mv AT kuokuangming amulticlassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus
AT talleypaul amulticlassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus
AT kaoyuhsi amulticlassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus
AT huangchihsien amulticlassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus
AT kuokuangming multiclassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus
AT talleypaul multiclassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus
AT kaoyuhsi multiclassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus
AT huangchihsien multiclassclassificationmodelforsupportingthediagnosisoftypeiidiabetesmellitus