Cargando…

Place preference and vocal learning rely on distinct reinforcers in songbirds

In reinforcement learning (RL) agents are typically tasked with maximizing a single objective function such as reward. But it remains poorly understood how agents might pursue distinct objectives at once. In machines, multiobjective RL can be achieved by dividing a single agent into multiple sub-age...

Descripción completa

Detalles Bibliográficos
Autores principales:	Murdoch, Don, Chen, Ruidong, Goldberg, Jesse H.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2018
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5928228/ https://www.ncbi.nlm.nih.gov/pubmed/29712967 http://dx.doi.org/10.1038/s41598-018-25112-5

_version_	1783319202136850432
author	Murdoch, Don Chen, Ruidong Goldberg, Jesse H.
author_facet	Murdoch, Don Chen, Ruidong Goldberg, Jesse H.
author_sort	Murdoch, Don
collection	PubMed
description	In reinforcement learning (RL) agents are typically tasked with maximizing a single objective function such as reward. But it remains poorly understood how agents might pursue distinct objectives at once. In machines, multiobjective RL can be achieved by dividing a single agent into multiple sub-agents, each of which is shaped by agent-specific reinforcement, but it remains unknown if animals adopt this strategy. Here we use songbirds to test if navigation and singing, two behaviors with distinct objectives, can be differentially reinforced. We demonstrate that strobe flashes aversively condition place preference but not song syllables. Brief noise bursts aversively condition song syllables but positively reinforce place preference. Thus distinct behavior-generating systems, or agencies, within a single animal can be shaped by correspondingly distinct reinforcement signals. Our findings suggest that spatially segregated vocal circuits can solve a credit assignment problem associated with multiobjective learning.
format	Online Article Text
id	pubmed-5928228
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-59282282018-05-07 Place preference and vocal learning rely on distinct reinforcers in songbirds Murdoch, Don Chen, Ruidong Goldberg, Jesse H. Sci Rep Article In reinforcement learning (RL) agents are typically tasked with maximizing a single objective function such as reward. But it remains poorly understood how agents might pursue distinct objectives at once. In machines, multiobjective RL can be achieved by dividing a single agent into multiple sub-agents, each of which is shaped by agent-specific reinforcement, but it remains unknown if animals adopt this strategy. Here we use songbirds to test if navigation and singing, two behaviors with distinct objectives, can be differentially reinforced. We demonstrate that strobe flashes aversively condition place preference but not song syllables. Brief noise bursts aversively condition song syllables but positively reinforce place preference. Thus distinct behavior-generating systems, or agencies, within a single animal can be shaped by correspondingly distinct reinforcement signals. Our findings suggest that spatially segregated vocal circuits can solve a credit assignment problem associated with multiobjective learning. Nature Publishing Group UK 2018-04-30 /pmc/articles/PMC5928228/ /pubmed/29712967 http://dx.doi.org/10.1038/s41598-018-25112-5 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Article Murdoch, Don Chen, Ruidong Goldberg, Jesse H. Place preference and vocal learning rely on distinct reinforcers in songbirds
title	Place preference and vocal learning rely on distinct reinforcers in songbirds
title_full	Place preference and vocal learning rely on distinct reinforcers in songbirds
title_fullStr	Place preference and vocal learning rely on distinct reinforcers in songbirds
title_full_unstemmed	Place preference and vocal learning rely on distinct reinforcers in songbirds
title_short	Place preference and vocal learning rely on distinct reinforcers in songbirds
title_sort	place preference and vocal learning rely on distinct reinforcers in songbirds
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5928228/ https://www.ncbi.nlm.nih.gov/pubmed/29712967 http://dx.doi.org/10.1038/s41598-018-25112-5
work_keys_str_mv	AT murdochdon placepreferenceandvocallearningrelyondistinctreinforcersinsongbirds AT chenruidong placepreferenceandvocallearningrelyondistinctreinforcersinsongbirds AT goldbergjesseh placepreferenceandvocallearningrelyondistinctreinforcersinsongbirds

Place preference and vocal learning rely on distinct reinforcers in songbirds

Ejemplares similares