Cargando…
Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network
This paper presents the deep reinforcement learning (DRL) framework to estimate the optimal Dynamic Treatment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real-...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6365640/ https://www.ncbi.nlm.nih.gov/pubmed/30728403 http://dx.doi.org/10.1038/s41598-018-37142-0 |
_version_ | 1783393462443311104 |
---|---|
author | Liu, Ning Liu, Ying Logan, Brent Xu, Zhiyuan Tang, Jian Wang, Yanzhi |
author_facet | Liu, Ning Liu, Ying Logan, Brent Xu, Zhiyuan Tang, Jian Wang, Yanzhi |
author_sort | Liu, Ning |
collection | PubMed |
description | This paper presents the deep reinforcement learning (DRL) framework to estimate the optimal Dynamic Treatment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real-life complexity in heterogeneous disease progression and treatment choices, with the goal of providing doctors and patients the data-driven personalized decision recommendations. The proposed DRL framework comprises (i) a supervised learning step to predict expert actions, and (ii) a deep reinforcement learning step to estimate the long-term value function of Dynamic Treatment Regimes. Both steps depend on deep neural networks. As a key motivational example, we have implemented the proposed framework on a data set from the Center for International Bone Marrow Transplant Research (CIBMTR) registry database, focusing on the sequence of prevention and treatments for acute and chronic graft versus host disease after transplantation. In the experimental results, we have demonstrated promising accuracy in predicting human experts’ decisions, as well as the high expected reward function in the DRL-based dynamic treatment regimes. |
format | Online Article Text |
id | pubmed-6365640 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-63656402019-02-08 Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network Liu, Ning Liu, Ying Logan, Brent Xu, Zhiyuan Tang, Jian Wang, Yanzhi Sci Rep Article This paper presents the deep reinforcement learning (DRL) framework to estimate the optimal Dynamic Treatment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real-life complexity in heterogeneous disease progression and treatment choices, with the goal of providing doctors and patients the data-driven personalized decision recommendations. The proposed DRL framework comprises (i) a supervised learning step to predict expert actions, and (ii) a deep reinforcement learning step to estimate the long-term value function of Dynamic Treatment Regimes. Both steps depend on deep neural networks. As a key motivational example, we have implemented the proposed framework on a data set from the Center for International Bone Marrow Transplant Research (CIBMTR) registry database, focusing on the sequence of prevention and treatments for acute and chronic graft versus host disease after transplantation. In the experimental results, we have demonstrated promising accuracy in predicting human experts’ decisions, as well as the high expected reward function in the DRL-based dynamic treatment regimes. Nature Publishing Group UK 2019-02-06 /pmc/articles/PMC6365640/ /pubmed/30728403 http://dx.doi.org/10.1038/s41598-018-37142-0 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Liu, Ning Liu, Ying Logan, Brent Xu, Zhiyuan Tang, Jian Wang, Yanzhi Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network |
title | Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network |
title_full | Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network |
title_fullStr | Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network |
title_full_unstemmed | Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network |
title_short | Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network |
title_sort | learning the dynamic treatment regimes from medical registry data through deep q-network |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6365640/ https://www.ncbi.nlm.nih.gov/pubmed/30728403 http://dx.doi.org/10.1038/s41598-018-37142-0 |
work_keys_str_mv | AT liuning learningthedynamictreatmentregimesfrommedicalregistrydatathroughdeepqnetwork AT liuying learningthedynamictreatmentregimesfrommedicalregistrydatathroughdeepqnetwork AT loganbrent learningthedynamictreatmentregimesfrommedicalregistrydatathroughdeepqnetwork AT xuzhiyuan learningthedynamictreatmentregimesfrommedicalregistrydatathroughdeepqnetwork AT tangjian learningthedynamictreatmentregimesfrommedicalregistrydatathroughdeepqnetwork AT wangyanzhi learningthedynamictreatmentregimesfrommedicalregistrydatathroughdeepqnetwork |