Cargando…

Neuroprosthetic Decoder Training as Imitation Learning

Neuroprosthetic brain-computer interfaces function via an algorithm which decodes neural activity of the user into movements of an end effector, such as a cursor or robotic arm. In practice, the decoder is often learned by updating its parameters while the user performs a task. When the user’s inten...

Descripción completa

Detalles Bibliográficos
Autores principales:	Merel, Josh, Carlson, David, Paninski, Liam, Cunningham, John P.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2016
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4871564/ https://www.ncbi.nlm.nih.gov/pubmed/27191387 http://dx.doi.org/10.1371/journal.pcbi.1004948

_version_	1782432619848269824
author	Merel, Josh Carlson, David Paninski, Liam Cunningham, John P.
author_facet	Merel, Josh Carlson, David Paninski, Liam Cunningham, John P.
author_sort	Merel, Josh
collection	PubMed
description	Neuroprosthetic brain-computer interfaces function via an algorithm which decodes neural activity of the user into movements of an end effector, such as a cursor or robotic arm. In practice, the decoder is often learned by updating its parameters while the user performs a task. When the user’s intention is not directly observable, recent methods have demonstrated value in training the decoder against a surrogate for the user’s intended movement. Here we show that training a decoder in this way is a novel variant of an imitation learning problem, where an oracle or expert is employed for supervised training in lieu of direct observations, which are not available. Specifically, we describe how a generic imitation learning meta-algorithm, dataset aggregation (DAgger), can be adapted to train a generic brain-computer interface. By deriving existing learning algorithms for brain-computer interfaces in this framework, we provide a novel analysis of regret (an important metric of learning efficacy) for brain-computer interfaces. This analysis allows us to characterize the space of algorithmic variants and bounds on their regret rates. Existing approaches for decoder learning have been performed in the cursor control setting, but the available design principles for these decoders are such that it has been impossible to scale them to naturalistic settings. Leveraging our findings, we then offer an algorithm that combines imitation learning with optimal control, which should allow for training of arbitrary effectors for which optimal control can generate goal-oriented control. We demonstrate this novel and general BCI algorithm with simulated neuroprosthetic control of a 26 degree-of-freedom model of an arm, a sophisticated and realistic end effector.
format	Online Article Text
id	pubmed-4871564
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-48715642016-05-31 Neuroprosthetic Decoder Training as Imitation Learning Merel, Josh Carlson, David Paninski, Liam Cunningham, John P. PLoS Comput Biol Research Article Neuroprosthetic brain-computer interfaces function via an algorithm which decodes neural activity of the user into movements of an end effector, such as a cursor or robotic arm. In practice, the decoder is often learned by updating its parameters while the user performs a task. When the user’s intention is not directly observable, recent methods have demonstrated value in training the decoder against a surrogate for the user’s intended movement. Here we show that training a decoder in this way is a novel variant of an imitation learning problem, where an oracle or expert is employed for supervised training in lieu of direct observations, which are not available. Specifically, we describe how a generic imitation learning meta-algorithm, dataset aggregation (DAgger), can be adapted to train a generic brain-computer interface. By deriving existing learning algorithms for brain-computer interfaces in this framework, we provide a novel analysis of regret (an important metric of learning efficacy) for brain-computer interfaces. This analysis allows us to characterize the space of algorithmic variants and bounds on their regret rates. Existing approaches for decoder learning have been performed in the cursor control setting, but the available design principles for these decoders are such that it has been impossible to scale them to naturalistic settings. Leveraging our findings, we then offer an algorithm that combines imitation learning with optimal control, which should allow for training of arbitrary effectors for which optimal control can generate goal-oriented control. We demonstrate this novel and general BCI algorithm with simulated neuroprosthetic control of a 26 degree-of-freedom model of an arm, a sophisticated and realistic end effector. Public Library of Science 2016-05-18 /pmc/articles/PMC4871564/ /pubmed/27191387 http://dx.doi.org/10.1371/journal.pcbi.1004948 Text en © 2016 Merel et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Merel, Josh Carlson, David Paninski, Liam Cunningham, John P. Neuroprosthetic Decoder Training as Imitation Learning
title	Neuroprosthetic Decoder Training as Imitation Learning
title_full	Neuroprosthetic Decoder Training as Imitation Learning
title_fullStr	Neuroprosthetic Decoder Training as Imitation Learning
title_full_unstemmed	Neuroprosthetic Decoder Training as Imitation Learning
title_short	Neuroprosthetic Decoder Training as Imitation Learning
title_sort	neuroprosthetic decoder training as imitation learning
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4871564/ https://www.ncbi.nlm.nih.gov/pubmed/27191387 http://dx.doi.org/10.1371/journal.pcbi.1004948
work_keys_str_mv	AT mereljosh neuroprostheticdecodertrainingasimitationlearning AT carlsondavid neuroprostheticdecodertrainingasimitationlearning AT paninskiliam neuroprostheticdecodertrainingasimitationlearning AT cunninghamjohnp neuroprostheticdecodertrainingasimitationlearning

Neuroprosthetic Decoder Training as Imitation Learning

Ejemplares similares