Cargando…

Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients

This paper presents the Homeo-Heterostatic Value Gradients (HHVG) algorithm as a formal account on the constructive interplay between boredom and curiosity which gives rise to effective exploration and superior forward model learning. We offer an instrumental view of action selection, in which an ac...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Yen, Chang, Acer Y. C., Kanai, Ryota
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6349823/
https://www.ncbi.nlm.nih.gov/pubmed/30723402
http://dx.doi.org/10.3389/fnbot.2018.00088
_version_ 1783390325476163584
author Yu, Yen
Chang, Acer Y. C.
Kanai, Ryota
author_facet Yu, Yen
Chang, Acer Y. C.
Kanai, Ryota
author_sort Yu, Yen
collection PubMed
description This paper presents the Homeo-Heterostatic Value Gradients (HHVG) algorithm as a formal account on the constructive interplay between boredom and curiosity which gives rise to effective exploration and superior forward model learning. We offer an instrumental view of action selection, in which an action serves to disclose outcomes that have intrinsic meaningfulness to an agent itself. This motivated two central algorithmic ingredients: devaluation and devaluation progress, both underpin agent's cognition concerning intrinsically generated rewards. The two serve as an instantiation of homeostatic and heterostatic intrinsic motivation. A key insight from our algorithm is that the two seemingly opposite motivations can be reconciled—without which exploration and information-gathering cannot be effectively carried out. We supported this claim with empirical evidence, showing that boredom-enabled agents consistently outperformed other curious or explorative agent variants in model building benchmarks based on self-assisted experience accumulation.
format Online
Article
Text
id pubmed-6349823
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-63498232019-02-05 Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients Yu, Yen Chang, Acer Y. C. Kanai, Ryota Front Neurorobot Neuroscience This paper presents the Homeo-Heterostatic Value Gradients (HHVG) algorithm as a formal account on the constructive interplay between boredom and curiosity which gives rise to effective exploration and superior forward model learning. We offer an instrumental view of action selection, in which an action serves to disclose outcomes that have intrinsic meaningfulness to an agent itself. This motivated two central algorithmic ingredients: devaluation and devaluation progress, both underpin agent's cognition concerning intrinsically generated rewards. The two serve as an instantiation of homeostatic and heterostatic intrinsic motivation. A key insight from our algorithm is that the two seemingly opposite motivations can be reconciled—without which exploration and information-gathering cannot be effectively carried out. We supported this claim with empirical evidence, showing that boredom-enabled agents consistently outperformed other curious or explorative agent variants in model building benchmarks based on self-assisted experience accumulation. Frontiers Media S.A. 2019-01-22 /pmc/articles/PMC6349823/ /pubmed/30723402 http://dx.doi.org/10.3389/fnbot.2018.00088 Text en Copyright © 2019 Yu, Chang and Kanai. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Yu, Yen
Chang, Acer Y. C.
Kanai, Ryota
Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients
title Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients
title_full Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients
title_fullStr Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients
title_full_unstemmed Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients
title_short Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients
title_sort boredom-driven curious learning by homeo-heterostatic value gradients
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6349823/
https://www.ncbi.nlm.nih.gov/pubmed/30723402
http://dx.doi.org/10.3389/fnbot.2018.00088
work_keys_str_mv AT yuyen boredomdrivencuriouslearningbyhomeoheterostaticvaluegradients
AT changaceryc boredomdrivencuriouslearningbyhomeoheterostaticvaluegradients
AT kanairyota boredomdrivencuriouslearningbyhomeoheterostaticvaluegradients