Cargando…
Dissecting neural computations in the human auditory pathway using deep neural networks for speech
The human auditory system extracts rich linguistic abstractions from speech signals. Traditional approaches to understanding this complex process have used linear feature-encoding models, with limited success. Artificial neural networks excel in speech recognition tasks and offer promising computati...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group US
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10689246/ https://www.ncbi.nlm.nih.gov/pubmed/37904043 http://dx.doi.org/10.1038/s41593-023-01468-4 |
_version_ | 1785152330865311744 |
---|---|
author | Li, Yuanning Anumanchipalli, Gopala K. Mohamed, Abdelrahman Chen, Peili Carney, Laurel H. Lu, Junfeng Wu, Jinsong Chang, Edward F. |
author_facet | Li, Yuanning Anumanchipalli, Gopala K. Mohamed, Abdelrahman Chen, Peili Carney, Laurel H. Lu, Junfeng Wu, Jinsong Chang, Edward F. |
author_sort | Li, Yuanning |
collection | PubMed |
description | The human auditory system extracts rich linguistic abstractions from speech signals. Traditional approaches to understanding this complex process have used linear feature-encoding models, with limited success. Artificial neural networks excel in speech recognition tasks and offer promising computational models of speech processing. We used speech representations in state-of-the-art deep neural network (DNN) models to investigate neural coding from the auditory nerve to the speech cortex. Representations in hierarchical layers of the DNN correlated well with the neural activity throughout the ascending auditory system. Unsupervised speech models performed at least as well as other purely supervised or fine-tuned models. Deeper DNN layers were better correlated with the neural activity in the higher-order auditory cortex, with computations aligned with phonemic and syllabic structures in speech. Accordingly, DNN models trained on either English or Mandarin predicted cortical responses in native speakers of each language. These results reveal convergence between DNN model representations and the biological auditory pathway, offering new approaches for modeling neural coding in the auditory cortex. |
format | Online Article Text |
id | pubmed-10689246 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group US |
record_format | MEDLINE/PubMed |
spelling | pubmed-106892462023-12-02 Dissecting neural computations in the human auditory pathway using deep neural networks for speech Li, Yuanning Anumanchipalli, Gopala K. Mohamed, Abdelrahman Chen, Peili Carney, Laurel H. Lu, Junfeng Wu, Jinsong Chang, Edward F. Nat Neurosci Article The human auditory system extracts rich linguistic abstractions from speech signals. Traditional approaches to understanding this complex process have used linear feature-encoding models, with limited success. Artificial neural networks excel in speech recognition tasks and offer promising computational models of speech processing. We used speech representations in state-of-the-art deep neural network (DNN) models to investigate neural coding from the auditory nerve to the speech cortex. Representations in hierarchical layers of the DNN correlated well with the neural activity throughout the ascending auditory system. Unsupervised speech models performed at least as well as other purely supervised or fine-tuned models. Deeper DNN layers were better correlated with the neural activity in the higher-order auditory cortex, with computations aligned with phonemic and syllabic structures in speech. Accordingly, DNN models trained on either English or Mandarin predicted cortical responses in native speakers of each language. These results reveal convergence between DNN model representations and the biological auditory pathway, offering new approaches for modeling neural coding in the auditory cortex. Nature Publishing Group US 2023-10-30 2023 /pmc/articles/PMC10689246/ /pubmed/37904043 http://dx.doi.org/10.1038/s41593-023-01468-4 Text en © The Author(s). 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Li, Yuanning Anumanchipalli, Gopala K. Mohamed, Abdelrahman Chen, Peili Carney, Laurel H. Lu, Junfeng Wu, Jinsong Chang, Edward F. Dissecting neural computations in the human auditory pathway using deep neural networks for speech |
title | Dissecting neural computations in the human auditory pathway using deep neural networks for speech |
title_full | Dissecting neural computations in the human auditory pathway using deep neural networks for speech |
title_fullStr | Dissecting neural computations in the human auditory pathway using deep neural networks for speech |
title_full_unstemmed | Dissecting neural computations in the human auditory pathway using deep neural networks for speech |
title_short | Dissecting neural computations in the human auditory pathway using deep neural networks for speech |
title_sort | dissecting neural computations in the human auditory pathway using deep neural networks for speech |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10689246/ https://www.ncbi.nlm.nih.gov/pubmed/37904043 http://dx.doi.org/10.1038/s41593-023-01468-4 |
work_keys_str_mv | AT liyuanning dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech AT anumanchipalligopalak dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech AT mohamedabdelrahman dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech AT chenpeili dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech AT carneylaurelh dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech AT lujunfeng dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech AT wujinsong dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech AT changedwardf dissectingneuralcomputationsinthehumanauditorypathwayusingdeepneuralnetworksforspeech |