Cargando…

Overt speech decoding from cortical activity: a comparison of different linear methods

INTRODUCTION: Speech BCIs aim at reconstructing speech in real time from ongoing cortical activity. Ideal BCIs would need to reconstruct speech audio signal frame by frame on a millisecond-timescale. Such approaches require fast computation. In this respect, linear decoder are good candidates and ha...

Descripción completa

Detalles Bibliográficos
Autores principales: Le Godais, Gaël, Roussel, Philémon, Bocquelet, Florent, Aubert, Marc, Kahane, Philippe, Chabardès, Stéphan, Yvert, Blaise
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10326283/
https://www.ncbi.nlm.nih.gov/pubmed/37425292
http://dx.doi.org/10.3389/fnhum.2023.1124065
_version_ 1785069391456501760
author Le Godais, Gaël
Roussel, Philémon
Bocquelet, Florent
Aubert, Marc
Kahane, Philippe
Chabardès, Stéphan
Yvert, Blaise
author_facet Le Godais, Gaël
Roussel, Philémon
Bocquelet, Florent
Aubert, Marc
Kahane, Philippe
Chabardès, Stéphan
Yvert, Blaise
author_sort Le Godais, Gaël
collection PubMed
description INTRODUCTION: Speech BCIs aim at reconstructing speech in real time from ongoing cortical activity. Ideal BCIs would need to reconstruct speech audio signal frame by frame on a millisecond-timescale. Such approaches require fast computation. In this respect, linear decoder are good candidates and have been widely used in motor BCIs. Yet, they have been very seldomly studied for speech reconstruction, and never for reconstruction of articulatory movements from intracranial activity. Here, we compared vanilla linear regression, ridge-regularized linear regressions, and partial least squares regressions for offline decoding of overt speech from cortical activity. METHODS: Two decoding paradigms were investigated: (1) direct decoding of acoustic vocoder features of speech, and (2) indirect decoding of vocoder features through an intermediate articulatory representation chained with a real-time-compatible DNN-based articulatory-to-acoustic synthesizer. Participant's articulatory trajectories were estimated from an electromagnetic-articulography dataset using dynamic time warping. The accuracy of the decoders was evaluated by computing correlations between original and reconstructed features. RESULTS: We found that similar performance was achieved by all linear methods well above chance levels, albeit without reaching intelligibility. Direct and indirect methods achieved comparable performance, with an advantage for direct decoding. DISCUSSION: Future work will address the development of an improved neural speech decoder compatible with fast frame-by-frame speech reconstruction from ongoing activity at a millisecond timescale.
format Online
Article
Text
id pubmed-10326283
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-103262832023-07-08 Overt speech decoding from cortical activity: a comparison of different linear methods Le Godais, Gaël Roussel, Philémon Bocquelet, Florent Aubert, Marc Kahane, Philippe Chabardès, Stéphan Yvert, Blaise Front Hum Neurosci Human Neuroscience INTRODUCTION: Speech BCIs aim at reconstructing speech in real time from ongoing cortical activity. Ideal BCIs would need to reconstruct speech audio signal frame by frame on a millisecond-timescale. Such approaches require fast computation. In this respect, linear decoder are good candidates and have been widely used in motor BCIs. Yet, they have been very seldomly studied for speech reconstruction, and never for reconstruction of articulatory movements from intracranial activity. Here, we compared vanilla linear regression, ridge-regularized linear regressions, and partial least squares regressions for offline decoding of overt speech from cortical activity. METHODS: Two decoding paradigms were investigated: (1) direct decoding of acoustic vocoder features of speech, and (2) indirect decoding of vocoder features through an intermediate articulatory representation chained with a real-time-compatible DNN-based articulatory-to-acoustic synthesizer. Participant's articulatory trajectories were estimated from an electromagnetic-articulography dataset using dynamic time warping. The accuracy of the decoders was evaluated by computing correlations between original and reconstructed features. RESULTS: We found that similar performance was achieved by all linear methods well above chance levels, albeit without reaching intelligibility. Direct and indirect methods achieved comparable performance, with an advantage for direct decoding. DISCUSSION: Future work will address the development of an improved neural speech decoder compatible with fast frame-by-frame speech reconstruction from ongoing activity at a millisecond timescale. Frontiers Media S.A. 2023-06-23 /pmc/articles/PMC10326283/ /pubmed/37425292 http://dx.doi.org/10.3389/fnhum.2023.1124065 Text en Copyright © 2023 Le Godais, Roussel, Bocquelet, Aubert, Kahane, Chabardès and Yvert. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Human Neuroscience
Le Godais, Gaël
Roussel, Philémon
Bocquelet, Florent
Aubert, Marc
Kahane, Philippe
Chabardès, Stéphan
Yvert, Blaise
Overt speech decoding from cortical activity: a comparison of different linear methods
title Overt speech decoding from cortical activity: a comparison of different linear methods
title_full Overt speech decoding from cortical activity: a comparison of different linear methods
title_fullStr Overt speech decoding from cortical activity: a comparison of different linear methods
title_full_unstemmed Overt speech decoding from cortical activity: a comparison of different linear methods
title_short Overt speech decoding from cortical activity: a comparison of different linear methods
title_sort overt speech decoding from cortical activity: a comparison of different linear methods
topic Human Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10326283/
https://www.ncbi.nlm.nih.gov/pubmed/37425292
http://dx.doi.org/10.3389/fnhum.2023.1124065
work_keys_str_mv AT legodaisgael overtspeechdecodingfromcorticalactivityacomparisonofdifferentlinearmethods
AT rousselphilemon overtspeechdecodingfromcorticalactivityacomparisonofdifferentlinearmethods
AT bocqueletflorent overtspeechdecodingfromcorticalactivityacomparisonofdifferentlinearmethods
AT aubertmarc overtspeechdecodingfromcorticalactivityacomparisonofdifferentlinearmethods
AT kahanephilippe overtspeechdecodingfromcorticalactivityacomparisonofdifferentlinearmethods
AT chabardesstephan overtspeechdecodingfromcorticalactivityacomparisonofdifferentlinearmethods
AT yvertblaise overtspeechdecodingfromcorticalactivityacomparisonofdifferentlinearmethods