Cargando…

Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning

A large body of experimental evidence suggests that the hippocampal place field system is involved in reward based navigation learning in rodents. Reinforcement learning (RL) mechanisms have been used to model this, associating the state space in an RL-algorithm to the place-field map in a rat. The...

Descripción completa

Detalles Bibliográficos
Autores principales:	Tamosiunaite, Minija, Ainge, James, Kulvicius, Tomas, Porr, Bernd, Dudchenko, Paul, Wörgötter, Florentin
Formato:	Texto
Lenguaje:	English
Publicado:	Springer US 2008
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3085791/ https://www.ncbi.nlm.nih.gov/pubmed/18446432 http://dx.doi.org/10.1007/s10827-008-0094-6

_version_	1782202661039243264
author	Tamosiunaite, Minija Ainge, James Kulvicius, Tomas Porr, Bernd Dudchenko, Paul Wörgötter, Florentin
author_facet	Tamosiunaite, Minija Ainge, James Kulvicius, Tomas Porr, Bernd Dudchenko, Paul Wörgötter, Florentin
author_sort	Tamosiunaite, Minija
collection	PubMed
description	A large body of experimental evidence suggests that the hippocampal place field system is involved in reward based navigation learning in rodents. Reinforcement learning (RL) mechanisms have been used to model this, associating the state space in an RL-algorithm to the place-field map in a rat. The convergence properties of RL-algorithms are affected by the exploration patterns of the learner. Therefore, we first analyzed the path characteristics of freely exploring rats in a test arena. We found that straight path segments with mean length 23 cm up to a maximal length of 80 cm take up a significant proportion of the total paths. Thus, rat paths are biased as compared to random exploration. Next we designed a RL system that reproduces these specific path characteristics. Our model arena is covered by overlapping, probabilistically firing place fields (PF) of realistic size and coverage. Because convergence of RL-algorithms is also influenced by the state space characteristics, different PF-sizes and densities, leading to a different degree of overlap, were also investigated. The model rat learns finding a reward opposite to its starting point. We observed that the combination of biased straight exploration, overlapping coverage and probabilistic firing will strongly impair the convergence of learning. When the degree of randomness in the exploration is increased, convergence improves, but the distribution of straight path segments becomes unrealistic and paths become ‘wiggly’. To mend this situation without affecting the path characteristic two additional mechanisms are implemented: A gradual drop of the learned weights (weight decay) and path length limitation, which prevents learning if the reward is not found after some expected time. Both mechanisms limit the memory of the system and thereby counteract effects of getting trapped on a wrong path. When using these strategies individually divergent cases get substantially reduced and for some parameter settings no divergence was found anymore at all. Using weight decay and path length limitation at the same time, convergence is not much improved but instead time to convergence increases as the memory limiting effect is getting too strong. The degree of improvement relies also on the size and degree of overlap (coverage density) in the place field system. The used combination of these two parameters leads to a trade-off between convergence and speed to convergence. Thus, this study suggests that the role of the PF-system in navigation learning cannot be considered independently from the animals’ exploration pattern.
format	Text
id	pubmed-3085791
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	Springer US
record_format	MEDLINE/PubMed
spelling	pubmed-30857912011-06-06 Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning Tamosiunaite, Minija Ainge, James Kulvicius, Tomas Porr, Bernd Dudchenko, Paul Wörgötter, Florentin J Comput Neurosci Article A large body of experimental evidence suggests that the hippocampal place field system is involved in reward based navigation learning in rodents. Reinforcement learning (RL) mechanisms have been used to model this, associating the state space in an RL-algorithm to the place-field map in a rat. The convergence properties of RL-algorithms are affected by the exploration patterns of the learner. Therefore, we first analyzed the path characteristics of freely exploring rats in a test arena. We found that straight path segments with mean length 23 cm up to a maximal length of 80 cm take up a significant proportion of the total paths. Thus, rat paths are biased as compared to random exploration. Next we designed a RL system that reproduces these specific path characteristics. Our model arena is covered by overlapping, probabilistically firing place fields (PF) of realistic size and coverage. Because convergence of RL-algorithms is also influenced by the state space characteristics, different PF-sizes and densities, leading to a different degree of overlap, were also investigated. The model rat learns finding a reward opposite to its starting point. We observed that the combination of biased straight exploration, overlapping coverage and probabilistic firing will strongly impair the convergence of learning. When the degree of randomness in the exploration is increased, convergence improves, but the distribution of straight path segments becomes unrealistic and paths become ‘wiggly’. To mend this situation without affecting the path characteristic two additional mechanisms are implemented: A gradual drop of the learned weights (weight decay) and path length limitation, which prevents learning if the reward is not found after some expected time. Both mechanisms limit the memory of the system and thereby counteract effects of getting trapped on a wrong path. When using these strategies individually divergent cases get substantially reduced and for some parameter settings no divergence was found anymore at all. Using weight decay and path length limitation at the same time, convergence is not much improved but instead time to convergence increases as the memory limiting effect is getting too strong. The degree of improvement relies also on the size and degree of overlap (coverage density) in the place field system. The used combination of these two parameters leads to a trade-off between convergence and speed to convergence. Thus, this study suggests that the role of the PF-system in navigation learning cannot be considered independently from the animals’ exploration pattern. Springer US 2008-04-30 2008 /pmc/articles/PMC3085791/ /pubmed/18446432 http://dx.doi.org/10.1007/s10827-008-0094-6 Text en © The Author(s) 2008 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
spellingShingle	Article Tamosiunaite, Minija Ainge, James Kulvicius, Tomas Porr, Bernd Dudchenko, Paul Wörgötter, Florentin Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning
title	Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning
title_full	Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning
title_fullStr	Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning
title_full_unstemmed	Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning
title_short	Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning
title_sort	path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3085791/ https://www.ncbi.nlm.nih.gov/pubmed/18446432 http://dx.doi.org/10.1007/s10827-008-0094-6
work_keys_str_mv	AT tamosiunaiteminija pathfindinginrealandsimulatedratsassessingtheinfluenceofpathcharacteristicsonnavigationlearning AT aingejames pathfindinginrealandsimulatedratsassessingtheinfluenceofpathcharacteristicsonnavigationlearning AT kulviciustomas pathfindinginrealandsimulatedratsassessingtheinfluenceofpathcharacteristicsonnavigationlearning AT porrbernd pathfindinginrealandsimulatedratsassessingtheinfluenceofpathcharacteristicsonnavigationlearning AT dudchenkopaul pathfindinginrealandsimulatedratsassessingtheinfluenceofpathcharacteristicsonnavigationlearning AT worgotterflorentin pathfindinginrealandsimulatedratsassessingtheinfluenceofpathcharacteristicsonnavigationlearning

Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning

Ejemplares similares