Cargando…

A Spiking Network Model of Decision Making Employing Rewarded STDP

Reward-modulated spike timing dependent plasticity (STDP) combines unsupervised STDP with a reinforcement signal that modulates synaptic changes. It was proposed as a learning rule capable of solving the distal reward problem in reinforcement learning. Nonetheless, performance and limitations of thi...

Descripción completa

Detalles Bibliográficos
Autores principales: Skorheim, Steven, Lonjers, Peter, Bazhenov, Maxim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3954625/
https://www.ncbi.nlm.nih.gov/pubmed/24632858
http://dx.doi.org/10.1371/journal.pone.0090821

Ejemplares similares