Cargando…

Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem

There is widespread interest in the relationship between the neurobiological systems supporting human cognition and emerging computational systems capable of emulating these capacities. Human speech comprehension, poorly understood as a neurobiological process, is an important case in point. Automat...

Descripción completa

Detalles Bibliográficos
Autores principales: Wingfield, Cai, Su, Li, Liu, Xunying, Zhang, Chao, Woodland, Phil, Thwaites, Andrew, Fonteneau, Elisabeth, Marslen-Wilson, William D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5612454/
https://www.ncbi.nlm.nih.gov/pubmed/28945744
http://dx.doi.org/10.1371/journal.pcbi.1005617
_version_ 1783266089567780864
author Wingfield, Cai
Su, Li
Liu, Xunying
Zhang, Chao
Woodland, Phil
Thwaites, Andrew
Fonteneau, Elisabeth
Marslen-Wilson, William D.
author_facet Wingfield, Cai
Su, Li
Liu, Xunying
Zhang, Chao
Woodland, Phil
Thwaites, Andrew
Fonteneau, Elisabeth
Marslen-Wilson, William D.
author_sort Wingfield, Cai
collection PubMed
description There is widespread interest in the relationship between the neurobiological systems supporting human cognition and emerging computational systems capable of emulating these capacities. Human speech comprehension, poorly understood as a neurobiological process, is an important case in point. Automatic Speech Recognition (ASR) systems with near-human levels of performance are now available, which provide a computationally explicit solution for the recognition of words in continuous speech. This research aims to bridge the gap between speech recognition processes in humans and machines, using novel multivariate techniques to compare incremental ‘machine states’, generated as the ASR analysis progresses over time, to the incremental ‘brain states’, measured using combined electro- and magneto-encephalography (EMEG), generated as the same inputs are heard by human listeners. This direct comparison of dynamic human and machine internal states, as they respond to the same incrementally delivered sensory input, revealed a significant correspondence between neural response patterns in human superior temporal cortex and the structural properties of ASR-derived phonetic models. Spatially coherent patches in human temporal cortex responded selectively to individual phonetic features defined on the basis of machine-extracted regularities in the speech to lexicon mapping process. These results demonstrate the feasibility of relating human and ASR solutions to the problem of speech recognition, and suggest the potential for further studies relating complex neural computations in human speech comprehension to the rapidly evolving ASR systems that address the same problem domain.
format Online
Article
Text
id pubmed-5612454
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56124542017-10-09 Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem Wingfield, Cai Su, Li Liu, Xunying Zhang, Chao Woodland, Phil Thwaites, Andrew Fonteneau, Elisabeth Marslen-Wilson, William D. PLoS Comput Biol Research Article There is widespread interest in the relationship between the neurobiological systems supporting human cognition and emerging computational systems capable of emulating these capacities. Human speech comprehension, poorly understood as a neurobiological process, is an important case in point. Automatic Speech Recognition (ASR) systems with near-human levels of performance are now available, which provide a computationally explicit solution for the recognition of words in continuous speech. This research aims to bridge the gap between speech recognition processes in humans and machines, using novel multivariate techniques to compare incremental ‘machine states’, generated as the ASR analysis progresses over time, to the incremental ‘brain states’, measured using combined electro- and magneto-encephalography (EMEG), generated as the same inputs are heard by human listeners. This direct comparison of dynamic human and machine internal states, as they respond to the same incrementally delivered sensory input, revealed a significant correspondence between neural response patterns in human superior temporal cortex and the structural properties of ASR-derived phonetic models. Spatially coherent patches in human temporal cortex responded selectively to individual phonetic features defined on the basis of machine-extracted regularities in the speech to lexicon mapping process. These results demonstrate the feasibility of relating human and ASR solutions to the problem of speech recognition, and suggest the potential for further studies relating complex neural computations in human speech comprehension to the rapidly evolving ASR systems that address the same problem domain. Public Library of Science 2017-09-25 /pmc/articles/PMC5612454/ /pubmed/28945744 http://dx.doi.org/10.1371/journal.pcbi.1005617 Text en © 2017 Wingfield et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wingfield, Cai
Su, Li
Liu, Xunying
Zhang, Chao
Woodland, Phil
Thwaites, Andrew
Fonteneau, Elisabeth
Marslen-Wilson, William D.
Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem
title Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem
title_full Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem
title_fullStr Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem
title_full_unstemmed Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem
title_short Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem
title_sort relating dynamic brain states to dynamic machine states: human and machine solutions to the speech recognition problem
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5612454/
https://www.ncbi.nlm.nih.gov/pubmed/28945744
http://dx.doi.org/10.1371/journal.pcbi.1005617
work_keys_str_mv AT wingfieldcai relatingdynamicbrainstatestodynamicmachinestateshumanandmachinesolutionstothespeechrecognitionproblem
AT suli relatingdynamicbrainstatestodynamicmachinestateshumanandmachinesolutionstothespeechrecognitionproblem
AT liuxunying relatingdynamicbrainstatestodynamicmachinestateshumanandmachinesolutionstothespeechrecognitionproblem
AT zhangchao relatingdynamicbrainstatestodynamicmachinestateshumanandmachinesolutionstothespeechrecognitionproblem
AT woodlandphil relatingdynamicbrainstatestodynamicmachinestateshumanandmachinesolutionstothespeechrecognitionproblem
AT thwaitesandrew relatingdynamicbrainstatestodynamicmachinestateshumanandmachinesolutionstothespeechrecognitionproblem
AT fonteneauelisabeth relatingdynamicbrainstatestodynamicmachinestateshumanandmachinesolutionstothespeechrecognitionproblem
AT marslenwilsonwilliamd relatingdynamicbrainstatestodynamicmachinestateshumanandmachinesolutionstothespeechrecognitionproblem