Cargando…

Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models

Objective. Development of brain–computer interface (BCI) technology is key for enabling communication in individuals who have lost the faculty of speech due to severe motor paralysis. A BCI control strategy that is gaining attention employs speech decoding from neural data. Recent studies have shown...

Descripción completa

Detalles Bibliográficos
Autores principales: Berezutskaya, Julia, Freudenburg, Zachary V, Vansteensel, Mariska J, Aarnoutse, Erik J, Ramsey, Nick F, van Gerven, Marcel A J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: IOP Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10510111/
https://www.ncbi.nlm.nih.gov/pubmed/37467739
http://dx.doi.org/10.1088/1741-2552/ace8be
_version_ 1785107894728916992
author Berezutskaya, Julia
Freudenburg, Zachary V
Vansteensel, Mariska J
Aarnoutse, Erik J
Ramsey, Nick F
van Gerven, Marcel A J
author_facet Berezutskaya, Julia
Freudenburg, Zachary V
Vansteensel, Mariska J
Aarnoutse, Erik J
Ramsey, Nick F
van Gerven, Marcel A J
author_sort Berezutskaya, Julia
collection PubMed
description Objective. Development of brain–computer interface (BCI) technology is key for enabling communication in individuals who have lost the faculty of speech due to severe motor paralysis. A BCI control strategy that is gaining attention employs speech decoding from neural data. Recent studies have shown that a combination of direct neural recordings and advanced computational models can provide promising results. Understanding which decoding strategies deliver best and directly applicable results is crucial for advancing the field. Approach. In this paper, we optimized and validated a decoding approach based on speech reconstruction directly from high-density electrocorticography recordings from sensorimotor cortex during a speech production task. Main results. We show that (1) dedicated machine learning optimization of reconstruction models is key for achieving the best reconstruction performance; (2) individual word decoding in reconstructed speech achieves 92%–100% accuracy (chance level is 8%); (3) direct reconstruction from sensorimotor brain activity produces intelligible speech. Significance. These results underline the need for model optimization in achieving best speech decoding results and highlight the potential that reconstruction-based speech decoding from sensorimotor cortex can offer for development of next-generation BCI technology for communication.
format Online
Article
Text
id pubmed-10510111
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher IOP Publishing
record_format MEDLINE/PubMed
spelling pubmed-105101112023-09-21 Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models Berezutskaya, Julia Freudenburg, Zachary V Vansteensel, Mariska J Aarnoutse, Erik J Ramsey, Nick F van Gerven, Marcel A J J Neural Eng Paper Objective. Development of brain–computer interface (BCI) technology is key for enabling communication in individuals who have lost the faculty of speech due to severe motor paralysis. A BCI control strategy that is gaining attention employs speech decoding from neural data. Recent studies have shown that a combination of direct neural recordings and advanced computational models can provide promising results. Understanding which decoding strategies deliver best and directly applicable results is crucial for advancing the field. Approach. In this paper, we optimized and validated a decoding approach based on speech reconstruction directly from high-density electrocorticography recordings from sensorimotor cortex during a speech production task. Main results. We show that (1) dedicated machine learning optimization of reconstruction models is key for achieving the best reconstruction performance; (2) individual word decoding in reconstructed speech achieves 92%–100% accuracy (chance level is 8%); (3) direct reconstruction from sensorimotor brain activity produces intelligible speech. Significance. These results underline the need for model optimization in achieving best speech decoding results and highlight the potential that reconstruction-based speech decoding from sensorimotor cortex can offer for development of next-generation BCI technology for communication. IOP Publishing 2023-10-01 2023-09-20 /pmc/articles/PMC10510111/ /pubmed/37467739 http://dx.doi.org/10.1088/1741-2552/ace8be Text en © 2023 The Author(s). Published by IOP Publishing Ltd https://creativecommons.org/licenses/by/4.0/ Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 license (https://creativecommons.org/licenses/by/4.0/) . Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
spellingShingle Paper
Berezutskaya, Julia
Freudenburg, Zachary V
Vansteensel, Mariska J
Aarnoutse, Erik J
Ramsey, Nick F
van Gerven, Marcel A J
Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models
title Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models
title_full Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models
title_fullStr Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models
title_full_unstemmed Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models
title_short Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models
title_sort direct speech reconstruction from sensorimotor brain activity with optimized deep learning models
topic Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10510111/
https://www.ncbi.nlm.nih.gov/pubmed/37467739
http://dx.doi.org/10.1088/1741-2552/ace8be
work_keys_str_mv AT berezutskayajulia directspeechreconstructionfromsensorimotorbrainactivitywithoptimizeddeeplearningmodels
AT freudenburgzacharyv directspeechreconstructionfromsensorimotorbrainactivitywithoptimizeddeeplearningmodels
AT vansteenselmariskaj directspeechreconstructionfromsensorimotorbrainactivitywithoptimizeddeeplearningmodels
AT aarnoutseerikj directspeechreconstructionfromsensorimotorbrainactivitywithoptimizeddeeplearningmodels
AT ramseynickf directspeechreconstructionfromsensorimotorbrainactivitywithoptimizeddeeplearningmodels
AT vangervenmarcelaj directspeechreconstructionfromsensorimotorbrainactivitywithoptimizeddeeplearningmodels