Cargando…
Exploring optimal control of epidemic spread using reinforcement learning
Pandemic defines the global outbreak of a disease having a high transmission rate. The impact of a pandemic situation can be lessened by restricting the movement of the mass. However, one of its concomitant circumstances is an economic crisis. In this article, we demonstrate what actions an agent (t...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7744528/ https://www.ncbi.nlm.nih.gov/pubmed/33328551 http://dx.doi.org/10.1038/s41598-020-79147-8 |
_version_ | 1783624437760786432 |
---|---|
author | Ohi, Abu Quwsar Mridha, M. F. Monowar, Muhammad Mostafa Hamid, Md. Abdul |
author_facet | Ohi, Abu Quwsar Mridha, M. F. Monowar, Muhammad Mostafa Hamid, Md. Abdul |
author_sort | Ohi, Abu Quwsar |
collection | PubMed |
description | Pandemic defines the global outbreak of a disease having a high transmission rate. The impact of a pandemic situation can be lessened by restricting the movement of the mass. However, one of its concomitant circumstances is an economic crisis. In this article, we demonstrate what actions an agent (trained using reinforcement learning) may take in different possible scenarios of a pandemic depending on the spread of disease and economic factors. To train the agent, we design a virtual pandemic scenario closely related to the present COVID-19 crisis. Then, we apply reinforcement learning, a branch of artificial intelligence, that deals with how an individual (human/machine) should interact on an environment (real/virtual) to achieve the cherished goal. Finally, we demonstrate what optimal actions the agent perform to reduce the spread of disease while considering the economic factors. In our experiment, we let the agent find an optimal solution without providing any prior knowledge. After training, we observed that the agent places a long length lockdown to reduce the first surge of a disease. Furthermore, the agent places a combination of cyclic lockdowns and short length lockdowns to halt the resurgence of the disease. Analyzing the agent’s performed actions, we discover that the agent decides movement restrictions not only based on the number of the infectious population but also considering the reproduction rate of the disease. The estimation and policy of the agent may improve the human-strategy of placing lockdown so that an economic crisis may be avoided while mitigating an infectious disease. |
format | Online Article Text |
id | pubmed-7744528 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-77445282020-12-17 Exploring optimal control of epidemic spread using reinforcement learning Ohi, Abu Quwsar Mridha, M. F. Monowar, Muhammad Mostafa Hamid, Md. Abdul Sci Rep Article Pandemic defines the global outbreak of a disease having a high transmission rate. The impact of a pandemic situation can be lessened by restricting the movement of the mass. However, one of its concomitant circumstances is an economic crisis. In this article, we demonstrate what actions an agent (trained using reinforcement learning) may take in different possible scenarios of a pandemic depending on the spread of disease and economic factors. To train the agent, we design a virtual pandemic scenario closely related to the present COVID-19 crisis. Then, we apply reinforcement learning, a branch of artificial intelligence, that deals with how an individual (human/machine) should interact on an environment (real/virtual) to achieve the cherished goal. Finally, we demonstrate what optimal actions the agent perform to reduce the spread of disease while considering the economic factors. In our experiment, we let the agent find an optimal solution without providing any prior knowledge. After training, we observed that the agent places a long length lockdown to reduce the first surge of a disease. Furthermore, the agent places a combination of cyclic lockdowns and short length lockdowns to halt the resurgence of the disease. Analyzing the agent’s performed actions, we discover that the agent decides movement restrictions not only based on the number of the infectious population but also considering the reproduction rate of the disease. The estimation and policy of the agent may improve the human-strategy of placing lockdown so that an economic crisis may be avoided while mitigating an infectious disease. Nature Publishing Group UK 2020-12-16 /pmc/articles/PMC7744528/ /pubmed/33328551 http://dx.doi.org/10.1038/s41598-020-79147-8 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Ohi, Abu Quwsar Mridha, M. F. Monowar, Muhammad Mostafa Hamid, Md. Abdul Exploring optimal control of epidemic spread using reinforcement learning |
title | Exploring optimal control of epidemic spread using reinforcement learning |
title_full | Exploring optimal control of epidemic spread using reinforcement learning |
title_fullStr | Exploring optimal control of epidemic spread using reinforcement learning |
title_full_unstemmed | Exploring optimal control of epidemic spread using reinforcement learning |
title_short | Exploring optimal control of epidemic spread using reinforcement learning |
title_sort | exploring optimal control of epidemic spread using reinforcement learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7744528/ https://www.ncbi.nlm.nih.gov/pubmed/33328551 http://dx.doi.org/10.1038/s41598-020-79147-8 |
work_keys_str_mv | AT ohiabuquwsar exploringoptimalcontrolofepidemicspreadusingreinforcementlearning AT mridhamf exploringoptimalcontrolofepidemicspreadusingreinforcementlearning AT monowarmuhammadmostafa exploringoptimalcontrolofepidemicspreadusingreinforcementlearning AT hamidmdabdul exploringoptimalcontrolofepidemicspreadusingreinforcementlearning |