Cargando…

Application of explainable ensemble artificial intelligence model to categorization of hemodialysis-patient and treatment using nationwide-real-world data in Japan

BACKGROUND: Although dialysis patients are at a high risk of death, it is difficult for medical practitioners to simultaneously evaluate many inter-related risk factors. In this study, we evaluated the characteristics of hemodialysis patients using machine learning model, and its usefulness for scre...

Descripción completa

Detalles Bibliográficos
Autores principales: Kanda, Eiichiro, Epureanu, Bogdan I., Adachi, Taiji, Tsuruta, Yuki, Kikuchi, Kan, Kashihara, Naoki, Abe, Masanori, Masakane, Ikuto, Nitta, Kosaku
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7259704/
https://www.ncbi.nlm.nih.gov/pubmed/32469924
http://dx.doi.org/10.1371/journal.pone.0233491
_version_ 1783540187659239424
author Kanda, Eiichiro
Epureanu, Bogdan I.
Adachi, Taiji
Tsuruta, Yuki
Kikuchi, Kan
Kashihara, Naoki
Abe, Masanori
Masakane, Ikuto
Nitta, Kosaku
author_facet Kanda, Eiichiro
Epureanu, Bogdan I.
Adachi, Taiji
Tsuruta, Yuki
Kikuchi, Kan
Kashihara, Naoki
Abe, Masanori
Masakane, Ikuto
Nitta, Kosaku
author_sort Kanda, Eiichiro
collection PubMed
description BACKGROUND: Although dialysis patients are at a high risk of death, it is difficult for medical practitioners to simultaneously evaluate many inter-related risk factors. In this study, we evaluated the characteristics of hemodialysis patients using machine learning model, and its usefulness for screening hemodialysis patients at a high risk of one-year death using the nation-wide database of the Japanese Society for Dialysis Therapy. MATERIALS AND METHODS: The patients were separated into two datasets (n = 39,930, 39,930, respectively). We categorized hemodialysis patients in Japan into new clusters generated by the K-means clustering method using the development dataset. The association between a cluster and the risk of death was evaluated using multivariate Cox proportional hazards models. Then, we developed an ensemble model composed of the clusters and support vector machine models in the model development phase, and compared the accuracy of the prediction of mortality between the machine learning models in the model validation phase. RESULTS: Average age of the subjects was 65.7±12.2 years; 32.7% had diabetes mellitus. The five clusters clearly distinguished the groups on the basis of their characteristics: Cluster 1, young male, and chronic glomerulonephritis; Cluster 2, female, and chronic glomerulonephritis; Cluster 3, diabetes mellitus; Cluster 4, elderly and nephrosclerosis; Cluster 5, elderly and protein energy wasting. These clusters were associated with the risk of death; Cluster 5 compared with Cluster 1, hazard ratio 8.86 (95% CI 7.68, 10.21). The accuracy of the ensemble model for the prediction of 1-year death was 0.948 and higher than those of logistic regression model (0.938), support vector machine model (0.937), and deep learning model (0.936). CONCLUSIONS: The clusters clearly categorized patient on their characteristics, and reflected their prognosis. Our real-world-data-based machine learning system is applicable to identifying high-risk hemodialysis patients in clinical settings, and has a strong potential to guide treatments and improve their prognosis.
format Online
Article
Text
id pubmed-7259704
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-72597042020-06-08 Application of explainable ensemble artificial intelligence model to categorization of hemodialysis-patient and treatment using nationwide-real-world data in Japan Kanda, Eiichiro Epureanu, Bogdan I. Adachi, Taiji Tsuruta, Yuki Kikuchi, Kan Kashihara, Naoki Abe, Masanori Masakane, Ikuto Nitta, Kosaku PLoS One Research Article BACKGROUND: Although dialysis patients are at a high risk of death, it is difficult for medical practitioners to simultaneously evaluate many inter-related risk factors. In this study, we evaluated the characteristics of hemodialysis patients using machine learning model, and its usefulness for screening hemodialysis patients at a high risk of one-year death using the nation-wide database of the Japanese Society for Dialysis Therapy. MATERIALS AND METHODS: The patients were separated into two datasets (n = 39,930, 39,930, respectively). We categorized hemodialysis patients in Japan into new clusters generated by the K-means clustering method using the development dataset. The association between a cluster and the risk of death was evaluated using multivariate Cox proportional hazards models. Then, we developed an ensemble model composed of the clusters and support vector machine models in the model development phase, and compared the accuracy of the prediction of mortality between the machine learning models in the model validation phase. RESULTS: Average age of the subjects was 65.7±12.2 years; 32.7% had diabetes mellitus. The five clusters clearly distinguished the groups on the basis of their characteristics: Cluster 1, young male, and chronic glomerulonephritis; Cluster 2, female, and chronic glomerulonephritis; Cluster 3, diabetes mellitus; Cluster 4, elderly and nephrosclerosis; Cluster 5, elderly and protein energy wasting. These clusters were associated with the risk of death; Cluster 5 compared with Cluster 1, hazard ratio 8.86 (95% CI 7.68, 10.21). The accuracy of the ensemble model for the prediction of 1-year death was 0.948 and higher than those of logistic regression model (0.938), support vector machine model (0.937), and deep learning model (0.936). CONCLUSIONS: The clusters clearly categorized patient on their characteristics, and reflected their prognosis. Our real-world-data-based machine learning system is applicable to identifying high-risk hemodialysis patients in clinical settings, and has a strong potential to guide treatments and improve their prognosis. Public Library of Science 2020-05-29 /pmc/articles/PMC7259704/ /pubmed/32469924 http://dx.doi.org/10.1371/journal.pone.0233491 Text en © 2020 Kanda et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kanda, Eiichiro
Epureanu, Bogdan I.
Adachi, Taiji
Tsuruta, Yuki
Kikuchi, Kan
Kashihara, Naoki
Abe, Masanori
Masakane, Ikuto
Nitta, Kosaku
Application of explainable ensemble artificial intelligence model to categorization of hemodialysis-patient and treatment using nationwide-real-world data in Japan
title Application of explainable ensemble artificial intelligence model to categorization of hemodialysis-patient and treatment using nationwide-real-world data in Japan
title_full Application of explainable ensemble artificial intelligence model to categorization of hemodialysis-patient and treatment using nationwide-real-world data in Japan
title_fullStr Application of explainable ensemble artificial intelligence model to categorization of hemodialysis-patient and treatment using nationwide-real-world data in Japan
title_full_unstemmed Application of explainable ensemble artificial intelligence model to categorization of hemodialysis-patient and treatment using nationwide-real-world data in Japan
title_short Application of explainable ensemble artificial intelligence model to categorization of hemodialysis-patient and treatment using nationwide-real-world data in Japan
title_sort application of explainable ensemble artificial intelligence model to categorization of hemodialysis-patient and treatment using nationwide-real-world data in japan
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7259704/
https://www.ncbi.nlm.nih.gov/pubmed/32469924
http://dx.doi.org/10.1371/journal.pone.0233491
work_keys_str_mv AT kandaeiichiro applicationofexplainableensembleartificialintelligencemodeltocategorizationofhemodialysispatientandtreatmentusingnationwiderealworlddatainjapan
AT epureanubogdani applicationofexplainableensembleartificialintelligencemodeltocategorizationofhemodialysispatientandtreatmentusingnationwiderealworlddatainjapan
AT adachitaiji applicationofexplainableensembleartificialintelligencemodeltocategorizationofhemodialysispatientandtreatmentusingnationwiderealworlddatainjapan
AT tsurutayuki applicationofexplainableensembleartificialintelligencemodeltocategorizationofhemodialysispatientandtreatmentusingnationwiderealworlddatainjapan
AT kikuchikan applicationofexplainableensembleartificialintelligencemodeltocategorizationofhemodialysispatientandtreatmentusingnationwiderealworlddatainjapan
AT kashiharanaoki applicationofexplainableensembleartificialintelligencemodeltocategorizationofhemodialysispatientandtreatmentusingnationwiderealworlddatainjapan
AT abemasanori applicationofexplainableensembleartificialintelligencemodeltocategorizationofhemodialysispatientandtreatmentusingnationwiderealworlddatainjapan
AT masakaneikuto applicationofexplainableensembleartificialintelligencemodeltocategorizationofhemodialysispatientandtreatmentusingnationwiderealworlddatainjapan
AT nittakosaku applicationofexplainableensembleartificialintelligencemodeltocategorizationofhemodialysispatientandtreatmentusingnationwiderealworlddatainjapan