Cargando…
A dual‐channel language decoding from brain activity with progressive transfer training
When we view a scene, the visual cortex extracts and processes visual information in the scene through various kinds of neural activities. Previous studies have decoded the neural activity into single/multiple semantic category tags which can caption the scene to some extent. However, these tags are...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley & Sons, Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449118/ https://www.ncbi.nlm.nih.gov/pubmed/34314088 http://dx.doi.org/10.1002/hbm.25603 |
_version_ | 1784569367083614208 |
---|---|
author | Huang, Wei Yan, Hongmei Cheng, Kaiwen Wang, Yuting Wang, Chong Li, Jiyi Li, Chen Li, Chaorong Zuo, Zhentao Chen, Huafu |
author_facet | Huang, Wei Yan, Hongmei Cheng, Kaiwen Wang, Yuting Wang, Chong Li, Jiyi Li, Chen Li, Chaorong Zuo, Zhentao Chen, Huafu |
author_sort | Huang, Wei |
collection | PubMed |
description | When we view a scene, the visual cortex extracts and processes visual information in the scene through various kinds of neural activities. Previous studies have decoded the neural activity into single/multiple semantic category tags which can caption the scene to some extent. However, these tags are isolated words with no grammatical structure, insufficiently conveying what the scene contains. It is well‐known that textual language (sentences/phrases) is superior to single word in disclosing the meaning of images as well as reflecting people's real understanding of the images. Here, based on artificial intelligence technologies, we attempted to build a dual‐channel language decoding model (DC‐LDM) to decode the neural activities evoked by images into language (phrases or short sentences). The DC‐LDM consisted of five modules, namely, Image‐Extractor, Image‐Encoder, Nerve‐Extractor, Nerve‐Encoder, and Language‐Decoder. In addition, we employed a strategy of progressive transfer to train the DC‐LDM for improving the performance of language decoding. The results showed that the texts decoded by DC‐LDM could describe natural image stimuli accurately and vividly. We adopted six indexes to quantitatively evaluate the difference between the decoded texts and the annotated texts of corresponding visual images, and found that Word2vec‐Cosine similarity (WCS) was the best indicator to reflect the similarity between the decoded and the annotated texts. In addition, among different visual cortices, we found that the text decoded by the higher visual cortex was more consistent with the description of the natural image than the lower one. Our decoding model may provide enlightenment in language‐based brain‐computer interface explorations. |
format | Online Article Text |
id | pubmed-8449118 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley & Sons, Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-84491182021-09-24 A dual‐channel language decoding from brain activity with progressive transfer training Huang, Wei Yan, Hongmei Cheng, Kaiwen Wang, Yuting Wang, Chong Li, Jiyi Li, Chen Li, Chaorong Zuo, Zhentao Chen, Huafu Hum Brain Mapp Research Articles When we view a scene, the visual cortex extracts and processes visual information in the scene through various kinds of neural activities. Previous studies have decoded the neural activity into single/multiple semantic category tags which can caption the scene to some extent. However, these tags are isolated words with no grammatical structure, insufficiently conveying what the scene contains. It is well‐known that textual language (sentences/phrases) is superior to single word in disclosing the meaning of images as well as reflecting people's real understanding of the images. Here, based on artificial intelligence technologies, we attempted to build a dual‐channel language decoding model (DC‐LDM) to decode the neural activities evoked by images into language (phrases or short sentences). The DC‐LDM consisted of five modules, namely, Image‐Extractor, Image‐Encoder, Nerve‐Extractor, Nerve‐Encoder, and Language‐Decoder. In addition, we employed a strategy of progressive transfer to train the DC‐LDM for improving the performance of language decoding. The results showed that the texts decoded by DC‐LDM could describe natural image stimuli accurately and vividly. We adopted six indexes to quantitatively evaluate the difference between the decoded texts and the annotated texts of corresponding visual images, and found that Word2vec‐Cosine similarity (WCS) was the best indicator to reflect the similarity between the decoded and the annotated texts. In addition, among different visual cortices, we found that the text decoded by the higher visual cortex was more consistent with the description of the natural image than the lower one. Our decoding model may provide enlightenment in language‐based brain‐computer interface explorations. John Wiley & Sons, Inc. 2021-07-27 /pmc/articles/PMC8449118/ /pubmed/34314088 http://dx.doi.org/10.1002/hbm.25603 Text en © 2021 The Authors. Human Brain Mapping published by Wiley Periodicals LLC. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Research Articles Huang, Wei Yan, Hongmei Cheng, Kaiwen Wang, Yuting Wang, Chong Li, Jiyi Li, Chen Li, Chaorong Zuo, Zhentao Chen, Huafu A dual‐channel language decoding from brain activity with progressive transfer training |
title | A dual‐channel language decoding from brain activity with progressive transfer training |
title_full | A dual‐channel language decoding from brain activity with progressive transfer training |
title_fullStr | A dual‐channel language decoding from brain activity with progressive transfer training |
title_full_unstemmed | A dual‐channel language decoding from brain activity with progressive transfer training |
title_short | A dual‐channel language decoding from brain activity with progressive transfer training |
title_sort | dual‐channel language decoding from brain activity with progressive transfer training |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449118/ https://www.ncbi.nlm.nih.gov/pubmed/34314088 http://dx.doi.org/10.1002/hbm.25603 |
work_keys_str_mv | AT huangwei adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT yanhongmei adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT chengkaiwen adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT wangyuting adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT wangchong adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT lijiyi adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT lichen adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT lichaorong adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT zuozhentao adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT chenhuafu adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT huangwei dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT yanhongmei dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT chengkaiwen dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT wangyuting dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT wangchong dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT lijiyi dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT lichen dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT lichaorong dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT zuozhentao dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining AT chenhuafu dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining |