Cargando…

Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies

The number of all possible epidemics of a given infectious disease that could occur on a given landscape is large for systems of real-world complexity. Furthermore, there is no guarantee that the control actions that are optimal, on average, over all possible epidemics are also best for each possibl...

Descripción completa

Detalles Bibliográficos
Autores principales:	Probert, W. J. M., Lakkur, S., Fonnesbeck, C. J., Shea, K., Runge, M. C., Tildesley, M. J., Ferrari, M. J.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	The Royal Society 2019
Materias:	Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6558555/ https://www.ncbi.nlm.nih.gov/pubmed/31104604 http://dx.doi.org/10.1098/rstb.2018.0277

_version_	1783425651164839936
author	Probert, W. J. M. Lakkur, S. Fonnesbeck, C. J. Shea, K. Runge, M. C. Tildesley, M. J. Ferrari, M. J.
author_facet	Probert, W. J. M. Lakkur, S. Fonnesbeck, C. J. Shea, K. Runge, M. C. Tildesley, M. J. Ferrari, M. J.
author_sort	Probert, W. J. M.
collection	PubMed
description	The number of all possible epidemics of a given infectious disease that could occur on a given landscape is large for systems of real-world complexity. Furthermore, there is no guarantee that the control actions that are optimal, on average, over all possible epidemics are also best for each possible epidemic. Reinforcement learning (RL) and Monte Carlo control have been used to develop machine-readable context-dependent solutions for complex problems with many possible realizations ranging from video-games to the game of Go. RL could be a valuable tool to generate context-dependent policies for outbreak response, though translating the resulting policies into simple rules that can be read and interpreted by human decision-makers remains a challenge. Here we illustrate the application of RL to the development of context-dependent outbreak response policies to minimize outbreaks of foot-and-mouth disease. We show that control based on the resulting context-dependent policies, which adapt interventions to the specific outbreak, result in smaller outbreaks than static policies. We further illustrate two approaches for translating the complex machine-readable policies into simple heuristics that can be evaluated by human decision-makers. This article is part of the theme issue ‘Modelling infectious disease outbreaks in humans, animals and plants: epidemic forecasting and control’. This theme issue is linked with the earlier issue ‘Modelling infectious disease outbreaks in humans, animals and plants: approaches and important themes’.
format	Online Article Text
id	pubmed-6558555
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	The Royal Society
record_format	MEDLINE/PubMed
spelling	pubmed-65585552019-06-26 Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies Probert, W. J. M. Lakkur, S. Fonnesbeck, C. J. Shea, K. Runge, M. C. Tildesley, M. J. Ferrari, M. J. Philos Trans R Soc Lond B Biol Sci Articles The number of all possible epidemics of a given infectious disease that could occur on a given landscape is large for systems of real-world complexity. Furthermore, there is no guarantee that the control actions that are optimal, on average, over all possible epidemics are also best for each possible epidemic. Reinforcement learning (RL) and Monte Carlo control have been used to develop machine-readable context-dependent solutions for complex problems with many possible realizations ranging from video-games to the game of Go. RL could be a valuable tool to generate context-dependent policies for outbreak response, though translating the resulting policies into simple rules that can be read and interpreted by human decision-makers remains a challenge. Here we illustrate the application of RL to the development of context-dependent outbreak response policies to minimize outbreaks of foot-and-mouth disease. We show that control based on the resulting context-dependent policies, which adapt interventions to the specific outbreak, result in smaller outbreaks than static policies. We further illustrate two approaches for translating the complex machine-readable policies into simple heuristics that can be evaluated by human decision-makers. This article is part of the theme issue ‘Modelling infectious disease outbreaks in humans, animals and plants: epidemic forecasting and control’. This theme issue is linked with the earlier issue ‘Modelling infectious disease outbreaks in humans, animals and plants: approaches and important themes’. The Royal Society 2019-07-08 2019-05-20 /pmc/articles/PMC6558555/ /pubmed/31104604 http://dx.doi.org/10.1098/rstb.2018.0277 Text en © 2019 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle	Articles Probert, W. J. M. Lakkur, S. Fonnesbeck, C. J. Shea, K. Runge, M. C. Tildesley, M. J. Ferrari, M. J. Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies
title	Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies
title_full	Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies
title_fullStr	Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies
title_full_unstemmed	Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies
title_short	Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies
title_sort	context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies
topic	Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6558555/ https://www.ncbi.nlm.nih.gov/pubmed/31104604 http://dx.doi.org/10.1098/rstb.2018.0277
work_keys_str_mv	AT probertwjm contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies AT lakkurs contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies AT fonnesbeckcj contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies AT sheak contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies AT rungemc contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies AT tildesleymj contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies AT ferrarimj contextmattersusingreinforcementlearningtodevelophumanreadablestatedependentoutbreakresponsepolicies

Context matters: using reinforcement learning to develop human-readable, state-dependent outbreak response policies

Ejemplares similares