Cargando…

Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices

Neural interfaces that directly produce intelligible speech from brain activity would allow people with severe impairment from neurological disorders to communicate more naturally. Here, we record neural population activity in motor, premotor and inferior frontal cortices during speech production us...

Descripción completa

Detalles Bibliográficos
Autores principales: Herff, Christian, Diener, Lorenz, Angrick, Miguel, Mugler, Emily, Tate, Matthew C., Goldrick, Matthew A., Krusienski, Dean J., Slutzky, Marc W., Schultz, Tanja
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882773/
https://www.ncbi.nlm.nih.gov/pubmed/31824257
http://dx.doi.org/10.3389/fnins.2019.01267
_version_ 1783474233773391872
author Herff, Christian
Diener, Lorenz
Angrick, Miguel
Mugler, Emily
Tate, Matthew C.
Goldrick, Matthew A.
Krusienski, Dean J.
Slutzky, Marc W.
Schultz, Tanja
author_facet Herff, Christian
Diener, Lorenz
Angrick, Miguel
Mugler, Emily
Tate, Matthew C.
Goldrick, Matthew A.
Krusienski, Dean J.
Slutzky, Marc W.
Schultz, Tanja
author_sort Herff, Christian
collection PubMed
description Neural interfaces that directly produce intelligible speech from brain activity would allow people with severe impairment from neurological disorders to communicate more naturally. Here, we record neural population activity in motor, premotor and inferior frontal cortices during speech production using electrocorticography (ECoG) and show that ECoG signals alone can be used to generate intelligible speech output that can preserve conversational cues. To produce speech directly from neural data, we adapted a method from the field of speech synthesis called unit selection, in which units of speech are concatenated to form audible output. In our approach, which we call Brain-To-Speech, we chose subsequent units of speech based on the measured ECoG activity to generate audio waveforms directly from the neural recordings. Brain-To-Speech employed the user's own voice to generate speech that sounded very natural and included features such as prosody and accentuation. By investigating the brain areas involved in speech production separately, we found that speech motor cortex provided more information for the reconstruction process than the other cortical areas.
format Online
Article
Text
id pubmed-6882773
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-68827732019-12-10 Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices Herff, Christian Diener, Lorenz Angrick, Miguel Mugler, Emily Tate, Matthew C. Goldrick, Matthew A. Krusienski, Dean J. Slutzky, Marc W. Schultz, Tanja Front Neurosci Neuroscience Neural interfaces that directly produce intelligible speech from brain activity would allow people with severe impairment from neurological disorders to communicate more naturally. Here, we record neural population activity in motor, premotor and inferior frontal cortices during speech production using electrocorticography (ECoG) and show that ECoG signals alone can be used to generate intelligible speech output that can preserve conversational cues. To produce speech directly from neural data, we adapted a method from the field of speech synthesis called unit selection, in which units of speech are concatenated to form audible output. In our approach, which we call Brain-To-Speech, we chose subsequent units of speech based on the measured ECoG activity to generate audio waveforms directly from the neural recordings. Brain-To-Speech employed the user's own voice to generate speech that sounded very natural and included features such as prosody and accentuation. By investigating the brain areas involved in speech production separately, we found that speech motor cortex provided more information for the reconstruction process than the other cortical areas. Frontiers Media S.A. 2019-11-22 /pmc/articles/PMC6882773/ /pubmed/31824257 http://dx.doi.org/10.3389/fnins.2019.01267 Text en Copyright © 2019 Herff, Diener, Angrick, Mugler, Tate, Goldrick, Krusienski, Slutzky and Schultz. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Herff, Christian
Diener, Lorenz
Angrick, Miguel
Mugler, Emily
Tate, Matthew C.
Goldrick, Matthew A.
Krusienski, Dean J.
Slutzky, Marc W.
Schultz, Tanja
Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices
title Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices
title_full Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices
title_fullStr Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices
title_full_unstemmed Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices
title_short Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices
title_sort generating natural, intelligible speech from brain activity in motor, premotor, and inferior frontal cortices
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882773/
https://www.ncbi.nlm.nih.gov/pubmed/31824257
http://dx.doi.org/10.3389/fnins.2019.01267
work_keys_str_mv AT herffchristian generatingnaturalintelligiblespeechfrombrainactivityinmotorpremotorandinferiorfrontalcortices
AT dienerlorenz generatingnaturalintelligiblespeechfrombrainactivityinmotorpremotorandinferiorfrontalcortices
AT angrickmiguel generatingnaturalintelligiblespeechfrombrainactivityinmotorpremotorandinferiorfrontalcortices
AT mugleremily generatingnaturalintelligiblespeechfrombrainactivityinmotorpremotorandinferiorfrontalcortices
AT tatematthewc generatingnaturalintelligiblespeechfrombrainactivityinmotorpremotorandinferiorfrontalcortices
AT goldrickmatthewa generatingnaturalintelligiblespeechfrombrainactivityinmotorpremotorandinferiorfrontalcortices
AT krusienskideanj generatingnaturalintelligiblespeechfrombrainactivityinmotorpremotorandinferiorfrontalcortices
AT slutzkymarcw generatingnaturalintelligiblespeechfrombrainactivityinmotorpremotorandinferiorfrontalcortices
AT schultztanja generatingnaturalintelligiblespeechfrombrainactivityinmotorpremotorandinferiorfrontalcortices