Cargando…

Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments

The ability of an agent to detect changes in an environment is key to successful adaptation. This ability involves at least two phases: learning a model of an environment, and detecting that a change is likely to have occurred when this model is no longer accurate. This task is particularly challeng...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dick, Jeffery, Ladosz, Pawel, Ben-Iwhiwhu, Eseoghene, Shimadzu, Hideyasu, Kinnell, Peter, Pilly, Praveen K., Kolouri, Soheil, Soltoggio, Andrea
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2020
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7787001/ https://www.ncbi.nlm.nih.gov/pubmed/33424575 http://dx.doi.org/10.3389/fnbot.2020.578675

_version_	1783632742650478592
author	Dick, Jeffery Ladosz, Pawel Ben-Iwhiwhu, Eseoghene Shimadzu, Hideyasu Kinnell, Peter Pilly, Praveen K. Kolouri, Soheil Soltoggio, Andrea
author_facet	Dick, Jeffery Ladosz, Pawel Ben-Iwhiwhu, Eseoghene Shimadzu, Hideyasu Kinnell, Peter Pilly, Praveen K. Kolouri, Soheil Soltoggio, Andrea
author_sort	Dick, Jeffery
collection	PubMed
description	The ability of an agent to detect changes in an environment is key to successful adaptation. This ability involves at least two phases: learning a model of an environment, and detecting that a change is likely to have occurred when this model is no longer accurate. This task is particularly challenging in partially observable environments, such as those modeled with partially observable Markov decision processes (POMDPs). Some predictive learners are able to infer the state from observations and thus perform better with partial observability. Predictive state representations (PSRs) and neural networks are two such tools that can be trained to predict the probabilities of future observations. However, most such existing methods focus primarily on static problems in which only one environment is learned. In this paper, we propose an algorithm that uses statistical tests to estimate the probability of different predictive models to fit the current environment. We exploit the underlying probability distributions of predictive models to provide a fast and explainable method to assess and justify the model's beliefs about the current environment. Crucially, by doing so, the method can label incoming data as fitting different models, and thus can continuously train separate models in different environments. This new method is shown to prevent catastrophic forgetting when new environments, or tasks, are encountered. The method can also be of use when AI-informed decisions require justifications because its beliefs are based on statistical evidence from observations. We empirically demonstrate the benefit of the novel method with simulations in a set of POMDP environments.
format	Online Article Text
id	pubmed-7787001
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-77870012021-01-07 Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments Dick, Jeffery Ladosz, Pawel Ben-Iwhiwhu, Eseoghene Shimadzu, Hideyasu Kinnell, Peter Pilly, Praveen K. Kolouri, Soheil Soltoggio, Andrea Front Neurorobot Neuroscience The ability of an agent to detect changes in an environment is key to successful adaptation. This ability involves at least two phases: learning a model of an environment, and detecting that a change is likely to have occurred when this model is no longer accurate. This task is particularly challenging in partially observable environments, such as those modeled with partially observable Markov decision processes (POMDPs). Some predictive learners are able to infer the state from observations and thus perform better with partial observability. Predictive state representations (PSRs) and neural networks are two such tools that can be trained to predict the probabilities of future observations. However, most such existing methods focus primarily on static problems in which only one environment is learned. In this paper, we propose an algorithm that uses statistical tests to estimate the probability of different predictive models to fit the current environment. We exploit the underlying probability distributions of predictive models to provide a fast and explainable method to assess and justify the model's beliefs about the current environment. Crucially, by doing so, the method can label incoming data as fitting different models, and thus can continuously train separate models in different environments. This new method is shown to prevent catastrophic forgetting when new environments, or tasks, are encountered. The method can also be of use when AI-informed decisions require justifications because its beliefs are based on statistical evidence from observations. We empirically demonstrate the benefit of the novel method with simulations in a set of POMDP environments. Frontiers Media S.A. 2020-12-23 /pmc/articles/PMC7787001/ /pubmed/33424575 http://dx.doi.org/10.3389/fnbot.2020.578675 Text en Copyright © 2020 Dick, Ladosz, Ben-Iwhiwhu, Shimadzu, Kinnell, Pilly, Kolouri and Soltoggio. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Dick, Jeffery Ladosz, Pawel Ben-Iwhiwhu, Eseoghene Shimadzu, Hideyasu Kinnell, Peter Pilly, Praveen K. Kolouri, Soheil Soltoggio, Andrea Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments
title	Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments
title_full	Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments
title_fullStr	Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments
title_full_unstemmed	Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments
title_short	Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments
title_sort	detecting changes and avoiding catastrophic forgetting in dynamic partially observable environments
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7787001/ https://www.ncbi.nlm.nih.gov/pubmed/33424575 http://dx.doi.org/10.3389/fnbot.2020.578675
work_keys_str_mv	AT dickjeffery detectingchangesandavoidingcatastrophicforgettingindynamicpartiallyobservableenvironments AT ladoszpawel detectingchangesandavoidingcatastrophicforgettingindynamicpartiallyobservableenvironments AT beniwhiwhueseoghene detectingchangesandavoidingcatastrophicforgettingindynamicpartiallyobservableenvironments AT shimadzuhideyasu detectingchangesandavoidingcatastrophicforgettingindynamicpartiallyobservableenvironments AT kinnellpeter detectingchangesandavoidingcatastrophicforgettingindynamicpartiallyobservableenvironments AT pillypraveenk detectingchangesandavoidingcatastrophicforgettingindynamicpartiallyobservableenvironments AT kolourisoheil detectingchangesandavoidingcatastrophicforgettingindynamicpartiallyobservableenvironments AT soltoggioandrea detectingchangesandavoidingcatastrophicforgettingindynamicpartiallyobservableenvironments

Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments

Ejemplares similares