Cargando…

Place preference and vocal learning rely on distinct reinforcers in songbirds

In reinforcement learning (RL) agents are typically tasked with maximizing a single objective function such as reward. But it remains poorly understood how agents might pursue distinct objectives at once. In machines, multiobjective RL can be achieved by dividing a single agent into multiple sub-age...

Descripción completa

Detalles Bibliográficos
Autores principales: Murdoch, Don, Chen, Ruidong, Goldberg, Jesse H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5928228/
https://www.ncbi.nlm.nih.gov/pubmed/29712967
http://dx.doi.org/10.1038/s41598-018-25112-5
_version_ 1783319202136850432
author Murdoch, Don
Chen, Ruidong
Goldberg, Jesse H.
author_facet Murdoch, Don
Chen, Ruidong
Goldberg, Jesse H.
author_sort Murdoch, Don
collection PubMed
description In reinforcement learning (RL) agents are typically tasked with maximizing a single objective function such as reward. But it remains poorly understood how agents might pursue distinct objectives at once. In machines, multiobjective RL can be achieved by dividing a single agent into multiple sub-agents, each of which is shaped by agent-specific reinforcement, but it remains unknown if animals adopt this strategy. Here we use songbirds to test if navigation and singing, two behaviors with distinct objectives, can be differentially reinforced. We demonstrate that strobe flashes aversively condition place preference but not song syllables. Brief noise bursts aversively condition song syllables but positively reinforce place preference. Thus distinct behavior-generating systems, or agencies, within a single animal can be shaped by correspondingly distinct reinforcement signals. Our findings suggest that spatially segregated vocal circuits can solve a credit assignment problem associated with multiobjective learning.
format Online
Article
Text
id pubmed-5928228
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-59282282018-05-07 Place preference and vocal learning rely on distinct reinforcers in songbirds Murdoch, Don Chen, Ruidong Goldberg, Jesse H. Sci Rep Article In reinforcement learning (RL) agents are typically tasked with maximizing a single objective function such as reward. But it remains poorly understood how agents might pursue distinct objectives at once. In machines, multiobjective RL can be achieved by dividing a single agent into multiple sub-agents, each of which is shaped by agent-specific reinforcement, but it remains unknown if animals adopt this strategy. Here we use songbirds to test if navigation and singing, two behaviors with distinct objectives, can be differentially reinforced. We demonstrate that strobe flashes aversively condition place preference but not song syllables. Brief noise bursts aversively condition song syllables but positively reinforce place preference. Thus distinct behavior-generating systems, or agencies, within a single animal can be shaped by correspondingly distinct reinforcement signals. Our findings suggest that spatially segregated vocal circuits can solve a credit assignment problem associated with multiobjective learning. Nature Publishing Group UK 2018-04-30 /pmc/articles/PMC5928228/ /pubmed/29712967 http://dx.doi.org/10.1038/s41598-018-25112-5 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Murdoch, Don
Chen, Ruidong
Goldberg, Jesse H.
Place preference and vocal learning rely on distinct reinforcers in songbirds
title Place preference and vocal learning rely on distinct reinforcers in songbirds
title_full Place preference and vocal learning rely on distinct reinforcers in songbirds
title_fullStr Place preference and vocal learning rely on distinct reinforcers in songbirds
title_full_unstemmed Place preference and vocal learning rely on distinct reinforcers in songbirds
title_short Place preference and vocal learning rely on distinct reinforcers in songbirds
title_sort place preference and vocal learning rely on distinct reinforcers in songbirds
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5928228/
https://www.ncbi.nlm.nih.gov/pubmed/29712967
http://dx.doi.org/10.1038/s41598-018-25112-5
work_keys_str_mv AT murdochdon placepreferenceandvocallearningrelyondistinctreinforcersinsongbirds
AT chenruidong placepreferenceandvocallearningrelyondistinctreinforcersinsongbirds
AT goldbergjesseh placepreferenceandvocallearningrelyondistinctreinforcersinsongbirds