Cargando…

Reinforcement learning assisted oxygen therapy for COVID-19 patients under intensive care

BACKGROUND: Patients with severe Coronavirus disease 19 (COVID-19) typically require supplemental oxygen as an essential treatment. We developed a machine learning algorithm, based on deep Reinforcement Learning (RL), for continuous management of oxygen flow rate for critically ill patients under in...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Hua, Zhu, Jiahao, Xie, Wei, Zhong, Judy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8678583/
https://www.ncbi.nlm.nih.gov/pubmed/34920724
http://dx.doi.org/10.1186/s12911-021-01712-6
_version_ 1784616336665608192
author Zheng, Hua
Zhu, Jiahao
Xie, Wei
Zhong, Judy
author_facet Zheng, Hua
Zhu, Jiahao
Xie, Wei
Zhong, Judy
author_sort Zheng, Hua
collection PubMed
description BACKGROUND: Patients with severe Coronavirus disease 19 (COVID-19) typically require supplemental oxygen as an essential treatment. We developed a machine learning algorithm, based on deep Reinforcement Learning (RL), for continuous management of oxygen flow rate for critically ill patients under intensive care, which can identify the optimal personalized oxygen flow rate with strong potentials to reduce mortality rate relative to the current clinical practice. METHODS: We modeled the oxygen flow trajectory of COVID-19 patients and their health outcomes as a Markov decision process. Based on individual patient characteristics and health status, an optimal oxygen control policy is learned by using deep deterministic policy gradient (DDPG) and real-time recommends the oxygen flow rate to reduce the mortality rate. We assessed the performance of proposed methods through cross validation by using a retrospective cohort of 1372 critically ill patients with COVID-19 from New York University Langone Health ambulatory care with electronic health records from April 2020 to January 2021. RESULTS: The mean mortality rate under the RL algorithm is lower than the standard of care by 2.57% (95% CI: 2.08–3.06) reduction (P < 0.001) from 7.94% under the standard of care to 5.37% under our proposed algorithm. The averaged recommended oxygen flow rate is 1.28 L/min (95% CI: 1.14–1.42) lower than the rate delivered to patients. Thus, the RL algorithm could potentially lead to better intensive care treatment that can reduce the mortality rate, while saving the oxygen scarce resources. It can reduce the oxygen shortage issue and improve public health during the COVID-19 pandemic. CONCLUSIONS: A personalized reinforcement learning oxygen flow control algorithm for COVID-19 patients under intensive care showed a substantial reduction in 7-day mortality rate as compared to the standard of care. In the overall cross validation cohort independent of the training data, mortality was lowest in patients for whom intensivists’ actual flow rate matched the RL decisions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-021-01712-6.
format Online
Article
Text
id pubmed-8678583
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-86785832021-12-17 Reinforcement learning assisted oxygen therapy for COVID-19 patients under intensive care Zheng, Hua Zhu, Jiahao Xie, Wei Zhong, Judy BMC Med Inform Decis Mak Research BACKGROUND: Patients with severe Coronavirus disease 19 (COVID-19) typically require supplemental oxygen as an essential treatment. We developed a machine learning algorithm, based on deep Reinforcement Learning (RL), for continuous management of oxygen flow rate for critically ill patients under intensive care, which can identify the optimal personalized oxygen flow rate with strong potentials to reduce mortality rate relative to the current clinical practice. METHODS: We modeled the oxygen flow trajectory of COVID-19 patients and their health outcomes as a Markov decision process. Based on individual patient characteristics and health status, an optimal oxygen control policy is learned by using deep deterministic policy gradient (DDPG) and real-time recommends the oxygen flow rate to reduce the mortality rate. We assessed the performance of proposed methods through cross validation by using a retrospective cohort of 1372 critically ill patients with COVID-19 from New York University Langone Health ambulatory care with electronic health records from April 2020 to January 2021. RESULTS: The mean mortality rate under the RL algorithm is lower than the standard of care by 2.57% (95% CI: 2.08–3.06) reduction (P < 0.001) from 7.94% under the standard of care to 5.37% under our proposed algorithm. The averaged recommended oxygen flow rate is 1.28 L/min (95% CI: 1.14–1.42) lower than the rate delivered to patients. Thus, the RL algorithm could potentially lead to better intensive care treatment that can reduce the mortality rate, while saving the oxygen scarce resources. It can reduce the oxygen shortage issue and improve public health during the COVID-19 pandemic. CONCLUSIONS: A personalized reinforcement learning oxygen flow control algorithm for COVID-19 patients under intensive care showed a substantial reduction in 7-day mortality rate as compared to the standard of care. In the overall cross validation cohort independent of the training data, mortality was lowest in patients for whom intensivists’ actual flow rate matched the RL decisions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-021-01712-6. BioMed Central 2021-12-17 /pmc/articles/PMC8678583/ /pubmed/34920724 http://dx.doi.org/10.1186/s12911-021-01712-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zheng, Hua
Zhu, Jiahao
Xie, Wei
Zhong, Judy
Reinforcement learning assisted oxygen therapy for COVID-19 patients under intensive care
title Reinforcement learning assisted oxygen therapy for COVID-19 patients under intensive care
title_full Reinforcement learning assisted oxygen therapy for COVID-19 patients under intensive care
title_fullStr Reinforcement learning assisted oxygen therapy for COVID-19 patients under intensive care
title_full_unstemmed Reinforcement learning assisted oxygen therapy for COVID-19 patients under intensive care
title_short Reinforcement learning assisted oxygen therapy for COVID-19 patients under intensive care
title_sort reinforcement learning assisted oxygen therapy for covid-19 patients under intensive care
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8678583/
https://www.ncbi.nlm.nih.gov/pubmed/34920724
http://dx.doi.org/10.1186/s12911-021-01712-6
work_keys_str_mv AT zhenghua reinforcementlearningassistedoxygentherapyforcovid19patientsunderintensivecare
AT zhujiahao reinforcementlearningassistedoxygentherapyforcovid19patientsunderintensivecare
AT xiewei reinforcementlearningassistedoxygentherapyforcovid19patientsunderintensivecare
AT zhongjudy reinforcementlearningassistedoxygentherapyforcovid19patientsunderintensivecare