Cargando…
Detection of factors affecting kidney function using machine learning methods
Due to the increasing prevalence of chronic kidney disease and its high mortality rate, study of risk factors affecting the progression of the disease is of great importance. Here in this work, we aim to develop a framework for using machine learning methods to identify factors affecting kidney func...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9758148/ https://www.ncbi.nlm.nih.gov/pubmed/36526702 http://dx.doi.org/10.1038/s41598-022-26160-8 |
_version_ | 1784851980501385216 |
---|---|
author | Haratian, Arezoo Maleki, Zeinab Shayegh, Farzaneh Safaeian, Alireza |
author_facet | Haratian, Arezoo Maleki, Zeinab Shayegh, Farzaneh Safaeian, Alireza |
author_sort | Haratian, Arezoo |
collection | PubMed |
description | Due to the increasing prevalence of chronic kidney disease and its high mortality rate, study of risk factors affecting the progression of the disease is of great importance. Here in this work, we aim to develop a framework for using machine learning methods to identify factors affecting kidney function. To this end classification methods are trained to predict the serum creatinine level based on numerical values of other blood test parameters in one of the three classes representing different ranges of the variable values. Models are trained using the data from blood test results of healthy and patient subjects including 46 different blood test parameters. The best developed models are random forest and LightGBM. Interpretation of the resulting model reveals a direct relationship between vitamin D and blood creatinine level. The detected analogy between these two parameters is reliable, regarding the relatively high predictive accuracy of the random forest model reaching the AUC of 0.90 and the accuracy of 0.74. Moreover, in this paper we develop a Bayesian network to infer the direct relationships between blood test parameters which have consistent results with the classification models. The proposed framework uses an inclusive set of advanced imputation methods to deal with the main challenge of working with electronic health data, missing values. Hence it can be applied to similar clinical studies to investigate and discover the relationships between the factors under study. |
format | Online Article Text |
id | pubmed-9758148 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-97581482022-12-18 Detection of factors affecting kidney function using machine learning methods Haratian, Arezoo Maleki, Zeinab Shayegh, Farzaneh Safaeian, Alireza Sci Rep Article Due to the increasing prevalence of chronic kidney disease and its high mortality rate, study of risk factors affecting the progression of the disease is of great importance. Here in this work, we aim to develop a framework for using machine learning methods to identify factors affecting kidney function. To this end classification methods are trained to predict the serum creatinine level based on numerical values of other blood test parameters in one of the three classes representing different ranges of the variable values. Models are trained using the data from blood test results of healthy and patient subjects including 46 different blood test parameters. The best developed models are random forest and LightGBM. Interpretation of the resulting model reveals a direct relationship between vitamin D and blood creatinine level. The detected analogy between these two parameters is reliable, regarding the relatively high predictive accuracy of the random forest model reaching the AUC of 0.90 and the accuracy of 0.74. Moreover, in this paper we develop a Bayesian network to infer the direct relationships between blood test parameters which have consistent results with the classification models. The proposed framework uses an inclusive set of advanced imputation methods to deal with the main challenge of working with electronic health data, missing values. Hence it can be applied to similar clinical studies to investigate and discover the relationships between the factors under study. Nature Publishing Group UK 2022-12-16 /pmc/articles/PMC9758148/ /pubmed/36526702 http://dx.doi.org/10.1038/s41598-022-26160-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Haratian, Arezoo Maleki, Zeinab Shayegh, Farzaneh Safaeian, Alireza Detection of factors affecting kidney function using machine learning methods |
title | Detection of factors affecting kidney function using machine learning methods |
title_full | Detection of factors affecting kidney function using machine learning methods |
title_fullStr | Detection of factors affecting kidney function using machine learning methods |
title_full_unstemmed | Detection of factors affecting kidney function using machine learning methods |
title_short | Detection of factors affecting kidney function using machine learning methods |
title_sort | detection of factors affecting kidney function using machine learning methods |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9758148/ https://www.ncbi.nlm.nih.gov/pubmed/36526702 http://dx.doi.org/10.1038/s41598-022-26160-8 |
work_keys_str_mv | AT haratianarezoo detectionoffactorsaffectingkidneyfunctionusingmachinelearningmethods AT malekizeinab detectionoffactorsaffectingkidneyfunctionusingmachinelearningmethods AT shayeghfarzaneh detectionoffactorsaffectingkidneyfunctionusingmachinelearningmethods AT safaeianalireza detectionoffactorsaffectingkidneyfunctionusingmachinelearningmethods |