Cargando…

From eye-blinks to state construction: Diagnostic benchmarks for online representation learning

We present three new diagnostic prediction problems inspired by classical-conditioning experiments to facilitate research in online prediction learning. Experiments in classical conditioning show that animals such as rabbits, pigeons, and dogs can make long temporal associations that enable multi-st...

Descripción completa

Detalles Bibliográficos
Autores principales: Rafiee, Banafsheh, Abbas, Zaheer, Ghiassian, Sina, Kumaraswamy, Raksha, Sutton, Richard S, Ludvig, Elliot A, White, Adam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9814020/
https://www.ncbi.nlm.nih.gov/pubmed/36618906
http://dx.doi.org/10.1177/10597123221085039
_version_ 1784864042425253888
author Rafiee, Banafsheh
Abbas, Zaheer
Ghiassian, Sina
Kumaraswamy, Raksha
Sutton, Richard S
Ludvig, Elliot A
White, Adam
author_facet Rafiee, Banafsheh
Abbas, Zaheer
Ghiassian, Sina
Kumaraswamy, Raksha
Sutton, Richard S
Ludvig, Elliot A
White, Adam
author_sort Rafiee, Banafsheh
collection PubMed
description We present three new diagnostic prediction problems inspired by classical-conditioning experiments to facilitate research in online prediction learning. Experiments in classical conditioning show that animals such as rabbits, pigeons, and dogs can make long temporal associations that enable multi-step prediction. To replicate this remarkable ability, an agent must construct an internal state representation that summarizes its interaction history. Recurrent neural networks can automatically construct state and learn temporal associations. However, the current training methods are prohibitively expensive for online prediction—continual learning on every time step—which is the focus of this paper. Our proposed problems test the learning capabilities that animals readily exhibit and highlight the limitations of the current recurrent learning methods. While the proposed problems are nontrivial, they are still amenable to extensive testing and analysis in the small-compute regime, thereby enabling researchers to study issues in isolation, ultimately accelerating progress towards scalable online representation learning methods.
format Online
Article
Text
id pubmed-9814020
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-98140202023-01-06 From eye-blinks to state construction: Diagnostic benchmarks for online representation learning Rafiee, Banafsheh Abbas, Zaheer Ghiassian, Sina Kumaraswamy, Raksha Sutton, Richard S Ludvig, Elliot A White, Adam Adapt Behav Articles We present three new diagnostic prediction problems inspired by classical-conditioning experiments to facilitate research in online prediction learning. Experiments in classical conditioning show that animals such as rabbits, pigeons, and dogs can make long temporal associations that enable multi-step prediction. To replicate this remarkable ability, an agent must construct an internal state representation that summarizes its interaction history. Recurrent neural networks can automatically construct state and learn temporal associations. However, the current training methods are prohibitively expensive for online prediction—continual learning on every time step—which is the focus of this paper. Our proposed problems test the learning capabilities that animals readily exhibit and highlight the limitations of the current recurrent learning methods. While the proposed problems are nontrivial, they are still amenable to extensive testing and analysis in the small-compute regime, thereby enabling researchers to study issues in isolation, ultimately accelerating progress towards scalable online representation learning methods. SAGE Publications 2022-04-27 2023-02 /pmc/articles/PMC9814020/ /pubmed/36618906 http://dx.doi.org/10.1177/10597123221085039 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Articles
Rafiee, Banafsheh
Abbas, Zaheer
Ghiassian, Sina
Kumaraswamy, Raksha
Sutton, Richard S
Ludvig, Elliot A
White, Adam
From eye-blinks to state construction: Diagnostic benchmarks for online representation learning
title From eye-blinks to state construction: Diagnostic benchmarks for online representation learning
title_full From eye-blinks to state construction: Diagnostic benchmarks for online representation learning
title_fullStr From eye-blinks to state construction: Diagnostic benchmarks for online representation learning
title_full_unstemmed From eye-blinks to state construction: Diagnostic benchmarks for online representation learning
title_short From eye-blinks to state construction: Diagnostic benchmarks for online representation learning
title_sort from eye-blinks to state construction: diagnostic benchmarks for online representation learning
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9814020/
https://www.ncbi.nlm.nih.gov/pubmed/36618906
http://dx.doi.org/10.1177/10597123221085039
work_keys_str_mv AT rafieebanafsheh fromeyeblinkstostateconstructiondiagnosticbenchmarksforonlinerepresentationlearning
AT abbaszaheer fromeyeblinkstostateconstructiondiagnosticbenchmarksforonlinerepresentationlearning
AT ghiassiansina fromeyeblinkstostateconstructiondiagnosticbenchmarksforonlinerepresentationlearning
AT kumaraswamyraksha fromeyeblinkstostateconstructiondiagnosticbenchmarksforonlinerepresentationlearning
AT suttonrichards fromeyeblinkstostateconstructiondiagnosticbenchmarksforonlinerepresentationlearning
AT ludvigelliota fromeyeblinkstostateconstructiondiagnosticbenchmarksforonlinerepresentationlearning
AT whiteadam fromeyeblinkstostateconstructiondiagnosticbenchmarksforonlinerepresentationlearning