Cargando…

Detection of factors affecting kidney function using machine learning methods

Due to the increasing prevalence of chronic kidney disease and its high mortality rate, study of risk factors affecting the progression of the disease is of great importance. Here in this work, we aim to develop a framework for using machine learning methods to identify factors affecting kidney func...

Descripción completa

Detalles Bibliográficos
Autores principales: Haratian, Arezoo, Maleki, Zeinab, Shayegh, Farzaneh, Safaeian, Alireza
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9758148/
https://www.ncbi.nlm.nih.gov/pubmed/36526702
http://dx.doi.org/10.1038/s41598-022-26160-8
_version_ 1784851980501385216
author Haratian, Arezoo
Maleki, Zeinab
Shayegh, Farzaneh
Safaeian, Alireza
author_facet Haratian, Arezoo
Maleki, Zeinab
Shayegh, Farzaneh
Safaeian, Alireza
author_sort Haratian, Arezoo
collection PubMed
description Due to the increasing prevalence of chronic kidney disease and its high mortality rate, study of risk factors affecting the progression of the disease is of great importance. Here in this work, we aim to develop a framework for using machine learning methods to identify factors affecting kidney function. To this end classification methods are trained to predict the serum creatinine level based on numerical values of other blood test parameters in one of the three classes representing different ranges of the variable values. Models are trained using the data from blood test results of healthy and patient subjects including 46 different blood test parameters. The best developed models are random forest and LightGBM. Interpretation of the resulting model reveals a direct relationship between vitamin D and blood creatinine level. The detected analogy between these two parameters is reliable, regarding the relatively high predictive accuracy of the random forest model reaching the AUC of 0.90 and the accuracy of 0.74. Moreover, in this paper we develop a Bayesian network to infer the direct relationships between blood test parameters which have consistent results with the classification models. The proposed framework uses an inclusive set of advanced imputation methods to deal with the main challenge of working with electronic health data, missing values. Hence it can be applied to similar clinical studies to investigate and discover the relationships between the factors under study.
format Online
Article
Text
id pubmed-9758148
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-97581482022-12-18 Detection of factors affecting kidney function using machine learning methods Haratian, Arezoo Maleki, Zeinab Shayegh, Farzaneh Safaeian, Alireza Sci Rep Article Due to the increasing prevalence of chronic kidney disease and its high mortality rate, study of risk factors affecting the progression of the disease is of great importance. Here in this work, we aim to develop a framework for using machine learning methods to identify factors affecting kidney function. To this end classification methods are trained to predict the serum creatinine level based on numerical values of other blood test parameters in one of the three classes representing different ranges of the variable values. Models are trained using the data from blood test results of healthy and patient subjects including 46 different blood test parameters. The best developed models are random forest and LightGBM. Interpretation of the resulting model reveals a direct relationship between vitamin D and blood creatinine level. The detected analogy between these two parameters is reliable, regarding the relatively high predictive accuracy of the random forest model reaching the AUC of 0.90 and the accuracy of 0.74. Moreover, in this paper we develop a Bayesian network to infer the direct relationships between blood test parameters which have consistent results with the classification models. The proposed framework uses an inclusive set of advanced imputation methods to deal with the main challenge of working with electronic health data, missing values. Hence it can be applied to similar clinical studies to investigate and discover the relationships between the factors under study. Nature Publishing Group UK 2022-12-16 /pmc/articles/PMC9758148/ /pubmed/36526702 http://dx.doi.org/10.1038/s41598-022-26160-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Haratian, Arezoo
Maleki, Zeinab
Shayegh, Farzaneh
Safaeian, Alireza
Detection of factors affecting kidney function using machine learning methods
title Detection of factors affecting kidney function using machine learning methods
title_full Detection of factors affecting kidney function using machine learning methods
title_fullStr Detection of factors affecting kidney function using machine learning methods
title_full_unstemmed Detection of factors affecting kidney function using machine learning methods
title_short Detection of factors affecting kidney function using machine learning methods
title_sort detection of factors affecting kidney function using machine learning methods
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9758148/
https://www.ncbi.nlm.nih.gov/pubmed/36526702
http://dx.doi.org/10.1038/s41598-022-26160-8
work_keys_str_mv AT haratianarezoo detectionoffactorsaffectingkidneyfunctionusingmachinelearningmethods
AT malekizeinab detectionoffactorsaffectingkidneyfunctionusingmachinelearningmethods
AT shayeghfarzaneh detectionoffactorsaffectingkidneyfunctionusingmachinelearningmethods
AT safaeianalireza detectionoffactorsaffectingkidneyfunctionusingmachinelearningmethods