Cargando…
A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis
Deep Reinforcement Learning (DRL) has been increasingly attempted in assisting clinicians for real-time treatment of sepsis. While a value function quantifies the performance of policies in such decision-making processes, most value-based DRL algorithms cannot evaluate the target value function prec...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9894526/ https://www.ncbi.nlm.nih.gov/pubmed/36732666 http://dx.doi.org/10.1038/s41746-023-00755-5 |
_version_ | 1784881761335902208 |
---|---|
author | Wu, XiaoDan Li, RuiChang He, Zhen Yu, TianZhi Cheng, ChangQing |
author_facet | Wu, XiaoDan Li, RuiChang He, Zhen Yu, TianZhi Cheng, ChangQing |
author_sort | Wu, XiaoDan |
collection | PubMed |
description | Deep Reinforcement Learning (DRL) has been increasingly attempted in assisting clinicians for real-time treatment of sepsis. While a value function quantifies the performance of policies in such decision-making processes, most value-based DRL algorithms cannot evaluate the target value function precisely and are not as safe as clinical experts. In this study, we propose a Weighted Dueling Double Deep Q-Network with embedded human Expertise (WD3QNE). A target Q value function with adaptive dynamic weight is designed to improve the estimate accuracy and human expertise in decision-making is leveraged. In addition, the random forest algorithm is employed for feature selection to improve model interpretability. We test our algorithm against state-of-the-art value function methods in terms of expected return, survival rate, action distribution and external validation. The results demonstrate that WD3QNE obtains the highest survival rate of 97.81% in MIMIC-III dataset. Our proposed method is capable of providing reliable treatment decisions with embedded clinician expertise. |
format | Online Article Text |
id | pubmed-9894526 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-98945262023-02-02 A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis Wu, XiaoDan Li, RuiChang He, Zhen Yu, TianZhi Cheng, ChangQing NPJ Digit Med Article Deep Reinforcement Learning (DRL) has been increasingly attempted in assisting clinicians for real-time treatment of sepsis. While a value function quantifies the performance of policies in such decision-making processes, most value-based DRL algorithms cannot evaluate the target value function precisely and are not as safe as clinical experts. In this study, we propose a Weighted Dueling Double Deep Q-Network with embedded human Expertise (WD3QNE). A target Q value function with adaptive dynamic weight is designed to improve the estimate accuracy and human expertise in decision-making is leveraged. In addition, the random forest algorithm is employed for feature selection to improve model interpretability. We test our algorithm against state-of-the-art value function methods in terms of expected return, survival rate, action distribution and external validation. The results demonstrate that WD3QNE obtains the highest survival rate of 97.81% in MIMIC-III dataset. Our proposed method is capable of providing reliable treatment decisions with embedded clinician expertise. Nature Publishing Group UK 2023-02-02 /pmc/articles/PMC9894526/ /pubmed/36732666 http://dx.doi.org/10.1038/s41746-023-00755-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Wu, XiaoDan Li, RuiChang He, Zhen Yu, TianZhi Cheng, ChangQing A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis |
title | A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis |
title_full | A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis |
title_fullStr | A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis |
title_full_unstemmed | A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis |
title_short | A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis |
title_sort | value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9894526/ https://www.ncbi.nlm.nih.gov/pubmed/36732666 http://dx.doi.org/10.1038/s41746-023-00755-5 |
work_keys_str_mv | AT wuxiaodan avaluebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis AT liruichang avaluebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis AT hezhen avaluebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis AT yutianzhi avaluebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis AT chengchangqing avaluebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis AT wuxiaodan valuebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis AT liruichang valuebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis AT hezhen valuebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis AT yutianzhi valuebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis AT chengchangqing valuebaseddeepreinforcementlearningmodelwithhumanexpertiseinoptimaltreatmentofsepsis |