Cargando…
Place preference and vocal learning rely on distinct reinforcers in songbirds
In reinforcement learning (RL) agents are typically tasked with maximizing a single objective function such as reward. But it remains poorly understood how agents might pursue distinct objectives at once. In machines, multiobjective RL can be achieved by dividing a single agent into multiple sub-age...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5928228/ https://www.ncbi.nlm.nih.gov/pubmed/29712967 http://dx.doi.org/10.1038/s41598-018-25112-5 |
_version_ | 1783319202136850432 |
---|---|
author | Murdoch, Don Chen, Ruidong Goldberg, Jesse H. |
author_facet | Murdoch, Don Chen, Ruidong Goldberg, Jesse H. |
author_sort | Murdoch, Don |
collection | PubMed |
description | In reinforcement learning (RL) agents are typically tasked with maximizing a single objective function such as reward. But it remains poorly understood how agents might pursue distinct objectives at once. In machines, multiobjective RL can be achieved by dividing a single agent into multiple sub-agents, each of which is shaped by agent-specific reinforcement, but it remains unknown if animals adopt this strategy. Here we use songbirds to test if navigation and singing, two behaviors with distinct objectives, can be differentially reinforced. We demonstrate that strobe flashes aversively condition place preference but not song syllables. Brief noise bursts aversively condition song syllables but positively reinforce place preference. Thus distinct behavior-generating systems, or agencies, within a single animal can be shaped by correspondingly distinct reinforcement signals. Our findings suggest that spatially segregated vocal circuits can solve a credit assignment problem associated with multiobjective learning. |
format | Online Article Text |
id | pubmed-5928228 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-59282282018-05-07 Place preference and vocal learning rely on distinct reinforcers in songbirds Murdoch, Don Chen, Ruidong Goldberg, Jesse H. Sci Rep Article In reinforcement learning (RL) agents are typically tasked with maximizing a single objective function such as reward. But it remains poorly understood how agents might pursue distinct objectives at once. In machines, multiobjective RL can be achieved by dividing a single agent into multiple sub-agents, each of which is shaped by agent-specific reinforcement, but it remains unknown if animals adopt this strategy. Here we use songbirds to test if navigation and singing, two behaviors with distinct objectives, can be differentially reinforced. We demonstrate that strobe flashes aversively condition place preference but not song syllables. Brief noise bursts aversively condition song syllables but positively reinforce place preference. Thus distinct behavior-generating systems, or agencies, within a single animal can be shaped by correspondingly distinct reinforcement signals. Our findings suggest that spatially segregated vocal circuits can solve a credit assignment problem associated with multiobjective learning. Nature Publishing Group UK 2018-04-30 /pmc/articles/PMC5928228/ /pubmed/29712967 http://dx.doi.org/10.1038/s41598-018-25112-5 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Murdoch, Don Chen, Ruidong Goldberg, Jesse H. Place preference and vocal learning rely on distinct reinforcers in songbirds |
title | Place preference and vocal learning rely on distinct reinforcers in songbirds |
title_full | Place preference and vocal learning rely on distinct reinforcers in songbirds |
title_fullStr | Place preference and vocal learning rely on distinct reinforcers in songbirds |
title_full_unstemmed | Place preference and vocal learning rely on distinct reinforcers in songbirds |
title_short | Place preference and vocal learning rely on distinct reinforcers in songbirds |
title_sort | place preference and vocal learning rely on distinct reinforcers in songbirds |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5928228/ https://www.ncbi.nlm.nih.gov/pubmed/29712967 http://dx.doi.org/10.1038/s41598-018-25112-5 |
work_keys_str_mv | AT murdochdon placepreferenceandvocallearningrelyondistinctreinforcersinsongbirds AT chenruidong placepreferenceandvocallearningrelyondistinctreinforcersinsongbirds AT goldbergjesseh placepreferenceandvocallearningrelyondistinctreinforcersinsongbirds |