Cargando…

Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network

This paper presents the deep reinforcement learning (DRL) framework to estimate the optimal Dynamic Treatment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real-...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Ning, Liu, Ying, Logan, Brent, Xu, Zhiyuan, Tang, Jian, Wang, Yanzhi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6365640/
https://www.ncbi.nlm.nih.gov/pubmed/30728403
http://dx.doi.org/10.1038/s41598-018-37142-0
_version_ 1783393462443311104
author Liu, Ning
Liu, Ying
Logan, Brent
Xu, Zhiyuan
Tang, Jian
Wang, Yanzhi
author_facet Liu, Ning
Liu, Ying
Logan, Brent
Xu, Zhiyuan
Tang, Jian
Wang, Yanzhi
author_sort Liu, Ning
collection PubMed
description This paper presents the deep reinforcement learning (DRL) framework to estimate the optimal Dynamic Treatment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real-life complexity in heterogeneous disease progression and treatment choices, with the goal of providing doctors and patients the data-driven personalized decision recommendations. The proposed DRL framework comprises (i) a supervised learning step to predict expert actions, and (ii) a deep reinforcement learning step to estimate the long-term value function of Dynamic Treatment Regimes. Both steps depend on deep neural networks. As a key motivational example, we have implemented the proposed framework on a data set from the Center for International Bone Marrow Transplant Research (CIBMTR) registry database, focusing on the sequence of prevention and treatments for acute and chronic graft versus host disease after transplantation. In the experimental results, we have demonstrated promising accuracy in predicting human experts’ decisions, as well as the high expected reward function in the DRL-based dynamic treatment regimes.
format Online
Article
Text
id pubmed-6365640
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-63656402019-02-08 Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network Liu, Ning Liu, Ying Logan, Brent Xu, Zhiyuan Tang, Jian Wang, Yanzhi Sci Rep Article This paper presents the deep reinforcement learning (DRL) framework to estimate the optimal Dynamic Treatment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real-life complexity in heterogeneous disease progression and treatment choices, with the goal of providing doctors and patients the data-driven personalized decision recommendations. The proposed DRL framework comprises (i) a supervised learning step to predict expert actions, and (ii) a deep reinforcement learning step to estimate the long-term value function of Dynamic Treatment Regimes. Both steps depend on deep neural networks. As a key motivational example, we have implemented the proposed framework on a data set from the Center for International Bone Marrow Transplant Research (CIBMTR) registry database, focusing on the sequence of prevention and treatments for acute and chronic graft versus host disease after transplantation. In the experimental results, we have demonstrated promising accuracy in predicting human experts’ decisions, as well as the high expected reward function in the DRL-based dynamic treatment regimes. Nature Publishing Group UK 2019-02-06 /pmc/articles/PMC6365640/ /pubmed/30728403 http://dx.doi.org/10.1038/s41598-018-37142-0 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Liu, Ning
Liu, Ying
Logan, Brent
Xu, Zhiyuan
Tang, Jian
Wang, Yanzhi
Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network
title Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network
title_full Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network
title_fullStr Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network
title_full_unstemmed Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network
title_short Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network
title_sort learning the dynamic treatment regimes from medical registry data through deep q-network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6365640/
https://www.ncbi.nlm.nih.gov/pubmed/30728403
http://dx.doi.org/10.1038/s41598-018-37142-0
work_keys_str_mv AT liuning learningthedynamictreatmentregimesfrommedicalregistrydatathroughdeepqnetwork
AT liuying learningthedynamictreatmentregimesfrommedicalregistrydatathroughdeepqnetwork
AT loganbrent learningthedynamictreatmentregimesfrommedicalregistrydatathroughdeepqnetwork
AT xuzhiyuan learningthedynamictreatmentregimesfrommedicalregistrydatathroughdeepqnetwork
AT tangjian learningthedynamictreatmentregimesfrommedicalregistrydatathroughdeepqnetwork
AT wangyanzhi learningthedynamictreatmentregimesfrommedicalregistrydatathroughdeepqnetwork