Cargando…

Comparison between Statistical Models and Machine Learning Methods on Classification for Highly Imbalanced Multiclass Kidney Data

This study aims to compare the classification performance of statistical models on highly imbalanced kidney data. The health examination cohort database provided by the National Health Insurance Service in Korea is utilized to build models with various machine learning methods. The glomerular filtra...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jeong, Bomi, Cho, Hyunjeong, Kim, Jieun, Kwon, Soon Kil, Hong, SeungWoo, Lee, ChangSik, Kim, TaeYeon, Park, Man Sik, Hong, Seoksu, Heo, Tae-Young
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7345590/ https://www.ncbi.nlm.nih.gov/pubmed/32570782 http://dx.doi.org/10.3390/diagnostics10060415

_version_	1783556219072413696
author	Jeong, Bomi Cho, Hyunjeong Kim, Jieun Kwon, Soon Kil Hong, SeungWoo Lee, ChangSik Kim, TaeYeon Park, Man Sik Hong, Seoksu Heo, Tae-Young
author_facet	Jeong, Bomi Cho, Hyunjeong Kim, Jieun Kwon, Soon Kil Hong, SeungWoo Lee, ChangSik Kim, TaeYeon Park, Man Sik Hong, Seoksu Heo, Tae-Young
author_sort	Jeong, Bomi
collection	PubMed
description	This study aims to compare the classification performance of statistical models on highly imbalanced kidney data. The health examination cohort database provided by the National Health Insurance Service in Korea is utilized to build models with various machine learning methods. The glomerular filtration rate (GFR) is used to diagnose chronic kidney disease (CKD). It is calculated using the Modification of Diet in Renal Disease method and classified into five stages (1, 2, 3A and 3B, 4, and 5). Different CKD stages based on the estimated GFR are considered as six classes of the response variable. This study utilizes two representative generalized linear models for classification, namely, multinomial logistic regression (multinomial LR) and ordinal logistic regression (ordinal LR), as well as two machine learning models, namely, random forest (RF) and autoencoder (AE). The classification performance of the four models is compared in terms of accuracy, sensitivity, specificity, precision, and F1-Measure. To find the best model that classifies CKD stages correctly, the data are divided into a 10-fold dataset with the same rate for each CKD stage. Results indicate that RF and AE show better performance in accuracy than the multinomial and ordinal LR models when classifying the response variable. However, when a highly imbalanced dataset is modeled, the accuracy of the model performance can distort the actual performance. This occurs because accuracy is high even if a statistical model classifies a minority class into a majority class. To solve this problem in performance interpretation, we not only consider accuracy from the confusion matrix but also sensitivity, specificity, precision, and F-1 measure for each class. To present classification performance with a single value for each model, we calculate the macro-average and micro-weighted values for each model. We conclude that AE is the best model classifying CKD stages correctly for all performance indices.
format	Online Article Text
id	pubmed-7345590
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-73455902020-07-09 Comparison between Statistical Models and Machine Learning Methods on Classification for Highly Imbalanced Multiclass Kidney Data Jeong, Bomi Cho, Hyunjeong Kim, Jieun Kwon, Soon Kil Hong, SeungWoo Lee, ChangSik Kim, TaeYeon Park, Man Sik Hong, Seoksu Heo, Tae-Young Diagnostics (Basel) Article This study aims to compare the classification performance of statistical models on highly imbalanced kidney data. The health examination cohort database provided by the National Health Insurance Service in Korea is utilized to build models with various machine learning methods. The glomerular filtration rate (GFR) is used to diagnose chronic kidney disease (CKD). It is calculated using the Modification of Diet in Renal Disease method and classified into five stages (1, 2, 3A and 3B, 4, and 5). Different CKD stages based on the estimated GFR are considered as six classes of the response variable. This study utilizes two representative generalized linear models for classification, namely, multinomial logistic regression (multinomial LR) and ordinal logistic regression (ordinal LR), as well as two machine learning models, namely, random forest (RF) and autoencoder (AE). The classification performance of the four models is compared in terms of accuracy, sensitivity, specificity, precision, and F1-Measure. To find the best model that classifies CKD stages correctly, the data are divided into a 10-fold dataset with the same rate for each CKD stage. Results indicate that RF and AE show better performance in accuracy than the multinomial and ordinal LR models when classifying the response variable. However, when a highly imbalanced dataset is modeled, the accuracy of the model performance can distort the actual performance. This occurs because accuracy is high even if a statistical model classifies a minority class into a majority class. To solve this problem in performance interpretation, we not only consider accuracy from the confusion matrix but also sensitivity, specificity, precision, and F-1 measure for each class. To present classification performance with a single value for each model, we calculate the macro-average and micro-weighted values for each model. We conclude that AE is the best model classifying CKD stages correctly for all performance indices. MDPI 2020-06-18 /pmc/articles/PMC7345590/ /pubmed/32570782 http://dx.doi.org/10.3390/diagnostics10060415 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Jeong, Bomi Cho, Hyunjeong Kim, Jieun Kwon, Soon Kil Hong, SeungWoo Lee, ChangSik Kim, TaeYeon Park, Man Sik Hong, Seoksu Heo, Tae-Young Comparison between Statistical Models and Machine Learning Methods on Classification for Highly Imbalanced Multiclass Kidney Data
title	Comparison between Statistical Models and Machine Learning Methods on Classification for Highly Imbalanced Multiclass Kidney Data
title_full	Comparison between Statistical Models and Machine Learning Methods on Classification for Highly Imbalanced Multiclass Kidney Data
title_fullStr	Comparison between Statistical Models and Machine Learning Methods on Classification for Highly Imbalanced Multiclass Kidney Data
title_full_unstemmed	Comparison between Statistical Models and Machine Learning Methods on Classification for Highly Imbalanced Multiclass Kidney Data
title_short	Comparison between Statistical Models and Machine Learning Methods on Classification for Highly Imbalanced Multiclass Kidney Data
title_sort	comparison between statistical models and machine learning methods on classification for highly imbalanced multiclass kidney data
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7345590/ https://www.ncbi.nlm.nih.gov/pubmed/32570782 http://dx.doi.org/10.3390/diagnostics10060415
work_keys_str_mv	AT jeongbomi comparisonbetweenstatisticalmodelsandmachinelearningmethodsonclassificationforhighlyimbalancedmulticlasskidneydata AT chohyunjeong comparisonbetweenstatisticalmodelsandmachinelearningmethodsonclassificationforhighlyimbalancedmulticlasskidneydata AT kimjieun comparisonbetweenstatisticalmodelsandmachinelearningmethodsonclassificationforhighlyimbalancedmulticlasskidneydata AT kwonsoonkil comparisonbetweenstatisticalmodelsandmachinelearningmethodsonclassificationforhighlyimbalancedmulticlasskidneydata AT hongseungwoo comparisonbetweenstatisticalmodelsandmachinelearningmethodsonclassificationforhighlyimbalancedmulticlasskidneydata AT leechangsik comparisonbetweenstatisticalmodelsandmachinelearningmethodsonclassificationforhighlyimbalancedmulticlasskidneydata AT kimtaeyeon comparisonbetweenstatisticalmodelsandmachinelearningmethodsonclassificationforhighlyimbalancedmulticlasskidneydata AT parkmansik comparisonbetweenstatisticalmodelsandmachinelearningmethodsonclassificationforhighlyimbalancedmulticlasskidneydata AT hongseoksu comparisonbetweenstatisticalmodelsandmachinelearningmethodsonclassificationforhighlyimbalancedmulticlasskidneydata AT heotaeyoung comparisonbetweenstatisticalmodelsandmachinelearningmethodsonclassificationforhighlyimbalancedmulticlasskidneydata

Comparison between Statistical Models and Machine Learning Methods on Classification for Highly Imbalanced Multiclass Kidney Data

Ejemplares similares