Cargando…

Reconstructing Speech from Human Auditory Cortex

How the human auditory system extracts perceptually relevant acoustic features of speech is unknown. To address this question, we used intracranial recordings from nonprimary auditory cortex in the human superior temporal gyrus to determine what acoustic information in speech sounds can be reconstru...

Descripción completa

Detalles Bibliográficos
Autores principales: Pasley, Brian N., David, Stephen V., Mesgarani, Nima, Flinker, Adeen, Shamma, Shihab A., Crone, Nathan E., Knight, Robert T., Chang, Edward F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3269422/
https://www.ncbi.nlm.nih.gov/pubmed/22303281
http://dx.doi.org/10.1371/journal.pbio.1001251
_version_ 1782222475504910336
author Pasley, Brian N.
David, Stephen V.
Mesgarani, Nima
Flinker, Adeen
Shamma, Shihab A.
Crone, Nathan E.
Knight, Robert T.
Chang, Edward F.
author_facet Pasley, Brian N.
David, Stephen V.
Mesgarani, Nima
Flinker, Adeen
Shamma, Shihab A.
Crone, Nathan E.
Knight, Robert T.
Chang, Edward F.
author_sort Pasley, Brian N.
collection PubMed
description How the human auditory system extracts perceptually relevant acoustic features of speech is unknown. To address this question, we used intracranial recordings from nonprimary auditory cortex in the human superior temporal gyrus to determine what acoustic information in speech sounds can be reconstructed from population neural activity. We found that slow and intermediate temporal fluctuations, such as those corresponding to syllable rate, were accurately reconstructed using a linear model based on the auditory spectrogram. However, reconstruction of fast temporal fluctuations, such as syllable onsets and offsets, required a nonlinear sound representation based on temporal modulation energy. Reconstruction accuracy was highest within the range of spectro-temporal fluctuations that have been found to be critical for speech intelligibility. The decoded speech representations allowed readout and identification of individual words directly from brain activity during single trial sound presentations. These findings reveal neural encoding mechanisms of speech acoustic parameters in higher order human auditory cortex.
format Online
Article
Text
id pubmed-3269422
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32694222012-02-02 Reconstructing Speech from Human Auditory Cortex Pasley, Brian N. David, Stephen V. Mesgarani, Nima Flinker, Adeen Shamma, Shihab A. Crone, Nathan E. Knight, Robert T. Chang, Edward F. PLoS Biol Research Article How the human auditory system extracts perceptually relevant acoustic features of speech is unknown. To address this question, we used intracranial recordings from nonprimary auditory cortex in the human superior temporal gyrus to determine what acoustic information in speech sounds can be reconstructed from population neural activity. We found that slow and intermediate temporal fluctuations, such as those corresponding to syllable rate, were accurately reconstructed using a linear model based on the auditory spectrogram. However, reconstruction of fast temporal fluctuations, such as syllable onsets and offsets, required a nonlinear sound representation based on temporal modulation energy. Reconstruction accuracy was highest within the range of spectro-temporal fluctuations that have been found to be critical for speech intelligibility. The decoded speech representations allowed readout and identification of individual words directly from brain activity during single trial sound presentations. These findings reveal neural encoding mechanisms of speech acoustic parameters in higher order human auditory cortex. Public Library of Science 2012-01-31 /pmc/articles/PMC3269422/ /pubmed/22303281 http://dx.doi.org/10.1371/journal.pbio.1001251 Text en Pasley et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Pasley, Brian N.
David, Stephen V.
Mesgarani, Nima
Flinker, Adeen
Shamma, Shihab A.
Crone, Nathan E.
Knight, Robert T.
Chang, Edward F.
Reconstructing Speech from Human Auditory Cortex
title Reconstructing Speech from Human Auditory Cortex
title_full Reconstructing Speech from Human Auditory Cortex
title_fullStr Reconstructing Speech from Human Auditory Cortex
title_full_unstemmed Reconstructing Speech from Human Auditory Cortex
title_short Reconstructing Speech from Human Auditory Cortex
title_sort reconstructing speech from human auditory cortex
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3269422/
https://www.ncbi.nlm.nih.gov/pubmed/22303281
http://dx.doi.org/10.1371/journal.pbio.1001251
work_keys_str_mv AT pasleybriann reconstructingspeechfromhumanauditorycortex
AT davidstephenv reconstructingspeechfromhumanauditorycortex
AT mesgaraninima reconstructingspeechfromhumanauditorycortex
AT flinkeradeen reconstructingspeechfromhumanauditorycortex
AT shammashihaba reconstructingspeechfromhumanauditorycortex
AT cronenathane reconstructingspeechfromhumanauditorycortex
AT knightrobertt reconstructingspeechfromhumanauditorycortex
AT changedwardf reconstructingspeechfromhumanauditorycortex