Cargando…

Dissecting neural computations in the human auditory pathway using deep neural networks for speech

The human auditory system extracts rich linguistic abstractions from speech signals. Traditional approaches to understanding this complex process have used linear feature-encoding models, with limited success. Artificial neural networks excel in speech recognition tasks and offer promising computati...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yuanning, Anumanchipalli, Gopala K., Mohamed, Abdelrahman, Chen, Peili, Carney, Laurel H., Lu, Junfeng, Wu, Jinsong, Chang, Edward F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10689246/
https://www.ncbi.nlm.nih.gov/pubmed/37904043
http://dx.doi.org/10.1038/s41593-023-01468-4
_version_ 1785152330865311744
author Li, Yuanning
Anumanchipalli, Gopala K.
Mohamed, Abdelrahman
Chen, Peili
Carney, Laurel H.
Lu, Junfeng
Wu, Jinsong
Chang, Edward F.
author_facet Li, Yuanning
Anumanchipalli, Gopala K.
Mohamed, Abdelrahman
Chen, Peili
Carney, Laurel H.
Lu, Junfeng
Wu, Jinsong
Chang, Edward F.
author_sort Li, Yuanning
collection PubMed
description The human auditory system extracts rich linguistic abstractions from speech signals. Traditional approaches to understanding this complex process have used linear feature-encoding models, with limited success. Artificial neural networks excel in speech recognition tasks and offer promising computational models of speech processing. We used speech representations in state-of-the-art deep neural network (DNN) models to investigate neural coding from the auditory nerve to the speech cortex. Representations in hierarchical layers of the DNN correlated well with the neural activity throughout the ascending auditory system. Unsupervised speech models performed at least as well as other purely supervised or fine-tuned models. Deeper DNN layers were better correlated with the neural activity in the higher-order auditory cortex, with computations aligned with phonemic and syllabic structures in speech. Accordingly, DNN models trained on either English or Mandarin predicted cortical responses in native speakers of each language. These results reveal convergence between DNN model representations and the biological auditory pathway, offering new approaches for modeling neural coding in the auditory cortex.
format Online
Article
Text
id pubmed-10689246
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-106892462023-12-02 Dissecting neural computations in the human auditory pathway using deep neural networks for speech Li, Yuanning Anumanchipalli, Gopala K. Mohamed, Abdelrahman Chen, Peili Carney, Laurel H. Lu, Junfeng Wu, Jinsong Chang, Edward F. Nat Neurosci Article The human auditory system extracts rich linguistic abstractions from speech signals. Traditional approaches to understanding this complex process have used linear feature-encoding models, with limited success. Artificial neural networks excel in speech recognition tasks and offer promising computational models of speech processing. We used speech representations in state-of-the-art deep neural network (DNN) models to investigate neural coding from the auditory nerve to the speech cortex. Representations in hierarchical layers of the DNN correlated well with the neural activity throughout the ascending auditory system. Unsupervised speech models performed at least as well as other purely supervised or fine-tuned models. Deeper DNN layers were better correlated with the neural activity in the higher-order auditory cortex, with computations aligned with phonemic and syllabic structures in speech. Accordingly, DNN models trained on either English or Mandarin predicted cortical responses in native speakers of each language. These results reveal convergence between DNN model representations and the biological auditory pathway, offering new approaches for modeling neural coding in the auditory cortex. Nature Publishing Group US 2023-10-30 2023 /pmc/articles/PMC10689246/ /pubmed/37904043 http://dx.doi.org/10.1038/s41593-023-01468-4 Text en © The Author(s). 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Li, Yuanning
Anumanchipalli, Gopala K.
Mohamed, Abdelrahman
Chen, Peili
Carney, Laurel H.
Lu, Junfeng
Wu, Jinsong
Chang, Edward F.
Dissecting neural computations in the human auditory pathway using deep neural networks for speech
title Dissecting neural computations in the human auditory pathway using deep neural networks for speech
title_full Dissecting neural computations in the human auditory pathway using deep neural networks for speech
title_fullStr Dissecting neural computations in the human auditory pathway using deep neural networks for speech
title_full_unstemmed Dissecting neural computations in the human auditory pathway using deep neural networks for speech
title_short Dissecting neural computations in the human auditory pathway using deep neural networks for speech
title_sort dissecting neural computations in the human auditory pathway using deep neural networks for speech
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10689246/
https://www.ncbi.nlm.nih.gov/pubmed/37904043
http://dx.doi.org/10.1038/s41593-023-01468-4
work_keys_str_mv AT liyuanning dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech
AT anumanchipalligopalak dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech
AT mohamedabdelrahman dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech
AT chenpeili dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech
AT carneylaurelh dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech
AT lujunfeng dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech
AT wujinsong dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech
AT changedwardf dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech