Cargando…

Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV

BACKGROUND: Reinforcement learning (RL) provides a promising technique to solve complex sequential decision making problems in health care domains. However, existing studies simply apply naive RL algorithms in discovering optimal treatment strategies for a targeted problem. This kind of direct appli...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Chao, Dong, Yinzhao, Liu, Jiming, Ren, Guoqi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454675/
https://www.ncbi.nlm.nih.gov/pubmed/30961606
http://dx.doi.org/10.1186/s12911-019-0755-6
_version_ 1783409585955012608
author Yu, Chao
Dong, Yinzhao
Liu, Jiming
Ren, Guoqi
author_facet Yu, Chao
Dong, Yinzhao
Liu, Jiming
Ren, Guoqi
author_sort Yu, Chao
collection PubMed
description BACKGROUND: Reinforcement learning (RL) provides a promising technique to solve complex sequential decision making problems in health care domains. However, existing studies simply apply naive RL algorithms in discovering optimal treatment strategies for a targeted problem. This kind of direct applications ignores the abundant causal relationships between treatment options and the associated outcomes that are inherent in medical domains. METHODS: This paper investigates how to integrate causal factors into an RL process in order to facilitate the final learning performance and increase explanations of learned strategies. A causal policy gradient algorithm is proposed and evaluated in dynamic treatment regimes (DTRs) for HIV based on a simulated computational model. RESULTS: Simulations prove the effectiveness of the proposed algorithm for designing more efficient treatment protocols in HIV, and different definitions of the causal factors could have significant influence on the final learning performance, indicating the necessity of human prior knowledge on defining a suitable causal relationships for a given problem. CONCLUSIONS: More efficient and robust DTRs for HIV can be derived through incorporation of causal factors between options of anti-HIV drugs and the associated treatment outcomes.
format Online
Article
Text
id pubmed-6454675
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64546752019-04-19 Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV Yu, Chao Dong, Yinzhao Liu, Jiming Ren, Guoqi BMC Med Inform Decis Mak Research BACKGROUND: Reinforcement learning (RL) provides a promising technique to solve complex sequential decision making problems in health care domains. However, existing studies simply apply naive RL algorithms in discovering optimal treatment strategies for a targeted problem. This kind of direct applications ignores the abundant causal relationships between treatment options and the associated outcomes that are inherent in medical domains. METHODS: This paper investigates how to integrate causal factors into an RL process in order to facilitate the final learning performance and increase explanations of learned strategies. A causal policy gradient algorithm is proposed and evaluated in dynamic treatment regimes (DTRs) for HIV based on a simulated computational model. RESULTS: Simulations prove the effectiveness of the proposed algorithm for designing more efficient treatment protocols in HIV, and different definitions of the causal factors could have significant influence on the final learning performance, indicating the necessity of human prior knowledge on defining a suitable causal relationships for a given problem. CONCLUSIONS: More efficient and robust DTRs for HIV can be derived through incorporation of causal factors between options of anti-HIV drugs and the associated treatment outcomes. BioMed Central 2019-04-09 /pmc/articles/PMC6454675/ /pubmed/30961606 http://dx.doi.org/10.1186/s12911-019-0755-6 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Yu, Chao
Dong, Yinzhao
Liu, Jiming
Ren, Guoqi
Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV
title Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV
title_full Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV
title_fullStr Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV
title_full_unstemmed Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV
title_short Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV
title_sort incorporating causal factors into reinforcement learning for dynamic treatment regimes in hiv
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454675/
https://www.ncbi.nlm.nih.gov/pubmed/30961606
http://dx.doi.org/10.1186/s12911-019-0755-6
work_keys_str_mv AT yuchao incorporatingcausalfactorsintoreinforcementlearningfordynamictreatmentregimesinhiv
AT dongyinzhao incorporatingcausalfactorsintoreinforcementlearningfordynamictreatmentregimesinhiv
AT liujiming incorporatingcausalfactorsintoreinforcementlearningfordynamictreatmentregimesinhiv
AT renguoqi incorporatingcausalfactorsintoreinforcementlearningfordynamictreatmentregimesinhiv