Cargando…

A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis

Deep Reinforcement Learning (DRL) has been increasingly attempted in assisting clinicians for real-time treatment of sepsis. While a value function quantifies the performance of policies in such decision-making processes, most value-based DRL algorithms cannot evaluate the target value function prec...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, XiaoDan, Li, RuiChang, He, Zhen, Yu, TianZhi, Cheng, ChangQing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9894526/
https://www.ncbi.nlm.nih.gov/pubmed/36732666
http://dx.doi.org/10.1038/s41746-023-00755-5
_version_ 1784881761335902208
author Wu, XiaoDan
Li, RuiChang
He, Zhen
Yu, TianZhi
Cheng, ChangQing
author_facet Wu, XiaoDan
Li, RuiChang
He, Zhen
Yu, TianZhi
Cheng, ChangQing
author_sort Wu, XiaoDan
collection PubMed
description Deep Reinforcement Learning (DRL) has been increasingly attempted in assisting clinicians for real-time treatment of sepsis. While a value function quantifies the performance of policies in such decision-making processes, most value-based DRL algorithms cannot evaluate the target value function precisely and are not as safe as clinical experts. In this study, we propose a Weighted Dueling Double Deep Q-Network with embedded human Expertise (WD3QNE). A target Q value function with adaptive dynamic weight is designed to improve the estimate accuracy and human expertise in decision-making is leveraged. In addition, the random forest algorithm is employed for feature selection to improve model interpretability. We test our algorithm against state-of-the-art value function methods in terms of expected return, survival rate, action distribution and external validation. The results demonstrate that WD3QNE obtains the highest survival rate of 97.81% in MIMIC-III dataset. Our proposed method is capable of providing reliable treatment decisions with embedded clinician expertise.
format Online
Article
Text
id pubmed-9894526
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-98945262023-02-02 A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis Wu, XiaoDan Li, RuiChang He, Zhen Yu, TianZhi Cheng, ChangQing NPJ Digit Med Article Deep Reinforcement Learning (DRL) has been increasingly attempted in assisting clinicians for real-time treatment of sepsis. While a value function quantifies the performance of policies in such decision-making processes, most value-based DRL algorithms cannot evaluate the target value function precisely and are not as safe as clinical experts. In this study, we propose a Weighted Dueling Double Deep Q-Network with embedded human Expertise (WD3QNE). A target Q value function with adaptive dynamic weight is designed to improve the estimate accuracy and human expertise in decision-making is leveraged. In addition, the random forest algorithm is employed for feature selection to improve model interpretability. We test our algorithm against state-of-the-art value function methods in terms of expected return, survival rate, action distribution and external validation. The results demonstrate that WD3QNE obtains the highest survival rate of 97.81% in MIMIC-III dataset. Our proposed method is capable of providing reliable treatment decisions with embedded clinician expertise. Nature Publishing Group UK 2023-02-02 /pmc/articles/PMC9894526/ /pubmed/36732666 http://dx.doi.org/10.1038/s41746-023-00755-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Wu, XiaoDan
Li, RuiChang
He, Zhen
Yu, TianZhi
Cheng, ChangQing
A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis
title A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis
title_full A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis
title_fullStr A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis
title_full_unstemmed A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis
title_short A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis
title_sort value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9894526/
https://www.ncbi.nlm.nih.gov/pubmed/36732666
http://dx.doi.org/10.1038/s41746-023-00755-5
work_keys_str_mv AT wuxiaodan avaluebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis
AT liruichang avaluebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis
AT hezhen avaluebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis
AT yutianzhi avaluebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis
AT chengchangqing avaluebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis
AT wuxiaodan valuebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis
AT liruichang valuebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis
AT hezhen valuebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis
AT yutianzhi valuebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis
AT chengchangqing valuebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis