Cargando…
Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies
The number of all possible epidemics of a given infectious disease that could occur on a given landscape is large for systems of real-world complexity. Furthermore, there is no guarantee that the control actions that are optimal, on average, over all possible epidemics are also best for each possibl...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6558555/ https://www.ncbi.nlm.nih.gov/pubmed/31104604 http://dx.doi.org/10.1098/rstb.2018.0277 |
_version_ | 1783425651164839936 |
---|---|
author | Probert, W. J. M. Lakkur, S. Fonnesbeck, C. J. Shea, K. Runge, M. C. Tildesley, M. J. Ferrari, M. J. |
author_facet | Probert, W. J. M. Lakkur, S. Fonnesbeck, C. J. Shea, K. Runge, M. C. Tildesley, M. J. Ferrari, M. J. |
author_sort | Probert, W. J. M. |
collection | PubMed |
description | The number of all possible epidemics of a given infectious disease that could occur on a given landscape is large for systems of real-world complexity. Furthermore, there is no guarantee that the control actions that are optimal, on average, over all possible epidemics are also best for each possible epidemic. Reinforcement learning (RL) and Monte Carlo control have been used to develop machine-readable context-dependent solutions for complex problems with many possible realizations ranging from video-games to the game of Go. RL could be a valuable tool to generate context-dependent policies for outbreak response, though translating the resulting policies into simple rules that can be read and interpreted by human decision-makers remains a challenge. Here we illustrate the application of RL to the development of context-dependent outbreak response policies to minimize outbreaks of foot-and-mouth disease. We show that control based on the resulting context-dependent policies, which adapt interventions to the specific outbreak, result in smaller outbreaks than static policies. We further illustrate two approaches for translating the complex machine-readable policies into simple heuristics that can be evaluated by human decision-makers. This article is part of the theme issue ‘Modelling infectious disease outbreaks in humans, animals and plants: epidemic forecasting and control’. This theme issue is linked with the earlier issue ‘Modelling infectious disease outbreaks in humans, animals and plants: approaches and important themes’. |
format | Online Article Text |
id | pubmed-6558555 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | The Royal Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-65585552019-06-26 Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies Probert, W. J. M. Lakkur, S. Fonnesbeck, C. J. Shea, K. Runge, M. C. Tildesley, M. J. Ferrari, M. J. Philos Trans R Soc Lond B Biol Sci Articles The number of all possible epidemics of a given infectious disease that could occur on a given landscape is large for systems of real-world complexity. Furthermore, there is no guarantee that the control actions that are optimal, on average, over all possible epidemics are also best for each possible epidemic. Reinforcement learning (RL) and Monte Carlo control have been used to develop machine-readable context-dependent solutions for complex problems with many possible realizations ranging from video-games to the game of Go. RL could be a valuable tool to generate context-dependent policies for outbreak response, though translating the resulting policies into simple rules that can be read and interpreted by human decision-makers remains a challenge. Here we illustrate the application of RL to the development of context-dependent outbreak response policies to minimize outbreaks of foot-and-mouth disease. We show that control based on the resulting context-dependent policies, which adapt interventions to the specific outbreak, result in smaller outbreaks than static policies. We further illustrate two approaches for translating the complex machine-readable policies into simple heuristics that can be evaluated by human decision-makers. This article is part of the theme issue ‘Modelling infectious disease outbreaks in humans, animals and plants: epidemic forecasting and control’. This theme issue is linked with the earlier issue ‘Modelling infectious disease outbreaks in humans, animals and plants: approaches and important themes’. The Royal Society 2019-07-08 2019-05-20 /pmc/articles/PMC6558555/ /pubmed/31104604 http://dx.doi.org/10.1098/rstb.2018.0277 Text en © 2019 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Articles Probert, W. J. M. Lakkur, S. Fonnesbeck, C. J. Shea, K. Runge, M. C. Tildesley, M. J. Ferrari, M. J. Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies |
title | Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies |
title_full | Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies |
title_fullStr | Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies |
title_full_unstemmed | Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies |
title_short | Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies |
title_sort | context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6558555/ https://www.ncbi.nlm.nih.gov/pubmed/31104604 http://dx.doi.org/10.1098/rstb.2018.0277 |
work_keys_str_mv | AT probertwjm contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies AT lakkurs contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies AT fonnesbeckcj contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies AT sheak contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies AT rungemc contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies AT tildesleymj contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies AT ferrarimj contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies |