Cargando…

A dual‐channel language decoding from brain activity with progressive transfer training

When we view a scene, the visual cortex extracts and processes visual information in the scene through various kinds of neural activities. Previous studies have decoded the neural activity into single/multiple semantic category tags which can caption the scene to some extent. However, these tags are...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Wei, Yan, Hongmei, Cheng, Kaiwen, Wang, Yuting, Wang, Chong, Li, Jiyi, Li, Chen, Li, Chaorong, Zuo, Zhentao, Chen, Huafu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449118/
https://www.ncbi.nlm.nih.gov/pubmed/34314088
http://dx.doi.org/10.1002/hbm.25603
_version_ 1784569367083614208
author Huang, Wei
Yan, Hongmei
Cheng, Kaiwen
Wang, Yuting
Wang, Chong
Li, Jiyi
Li, Chen
Li, Chaorong
Zuo, Zhentao
Chen, Huafu
author_facet Huang, Wei
Yan, Hongmei
Cheng, Kaiwen
Wang, Yuting
Wang, Chong
Li, Jiyi
Li, Chen
Li, Chaorong
Zuo, Zhentao
Chen, Huafu
author_sort Huang, Wei
collection PubMed
description When we view a scene, the visual cortex extracts and processes visual information in the scene through various kinds of neural activities. Previous studies have decoded the neural activity into single/multiple semantic category tags which can caption the scene to some extent. However, these tags are isolated words with no grammatical structure, insufficiently conveying what the scene contains. It is well‐known that textual language (sentences/phrases) is superior to single word in disclosing the meaning of images as well as reflecting people's real understanding of the images. Here, based on artificial intelligence technologies, we attempted to build a dual‐channel language decoding model (DC‐LDM) to decode the neural activities evoked by images into language (phrases or short sentences). The DC‐LDM consisted of five modules, namely, Image‐Extractor, Image‐Encoder, Nerve‐Extractor, Nerve‐Encoder, and Language‐Decoder. In addition, we employed a strategy of progressive transfer to train the DC‐LDM for improving the performance of language decoding. The results showed that the texts decoded by DC‐LDM could describe natural image stimuli accurately and vividly. We adopted six indexes to quantitatively evaluate the difference between the decoded texts and the annotated texts of corresponding visual images, and found that Word2vec‐Cosine similarity (WCS) was the best indicator to reflect the similarity between the decoded and the annotated texts. In addition, among different visual cortices, we found that the text decoded by the higher visual cortex was more consistent with the description of the natural image than the lower one. Our decoding model may provide enlightenment in language‐based brain‐computer interface explorations.
format Online
Article
Text
id pubmed-8449118
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-84491182021-09-24 A dual‐channel language decoding from brain activity with progressive transfer training Huang, Wei Yan, Hongmei Cheng, Kaiwen Wang, Yuting Wang, Chong Li, Jiyi Li, Chen Li, Chaorong Zuo, Zhentao Chen, Huafu Hum Brain Mapp Research Articles When we view a scene, the visual cortex extracts and processes visual information in the scene through various kinds of neural activities. Previous studies have decoded the neural activity into single/multiple semantic category tags which can caption the scene to some extent. However, these tags are isolated words with no grammatical structure, insufficiently conveying what the scene contains. It is well‐known that textual language (sentences/phrases) is superior to single word in disclosing the meaning of images as well as reflecting people's real understanding of the images. Here, based on artificial intelligence technologies, we attempted to build a dual‐channel language decoding model (DC‐LDM) to decode the neural activities evoked by images into language (phrases or short sentences). The DC‐LDM consisted of five modules, namely, Image‐Extractor, Image‐Encoder, Nerve‐Extractor, Nerve‐Encoder, and Language‐Decoder. In addition, we employed a strategy of progressive transfer to train the DC‐LDM for improving the performance of language decoding. The results showed that the texts decoded by DC‐LDM could describe natural image stimuli accurately and vividly. We adopted six indexes to quantitatively evaluate the difference between the decoded texts and the annotated texts of corresponding visual images, and found that Word2vec‐Cosine similarity (WCS) was the best indicator to reflect the similarity between the decoded and the annotated texts. In addition, among different visual cortices, we found that the text decoded by the higher visual cortex was more consistent with the description of the natural image than the lower one. Our decoding model may provide enlightenment in language‐based brain‐computer interface explorations. John Wiley & Sons, Inc. 2021-07-27 /pmc/articles/PMC8449118/ /pubmed/34314088 http://dx.doi.org/10.1002/hbm.25603 Text en © 2021 The Authors. Human Brain Mapping published by Wiley Periodicals LLC. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Research Articles
Huang, Wei
Yan, Hongmei
Cheng, Kaiwen
Wang, Yuting
Wang, Chong
Li, Jiyi
Li, Chen
Li, Chaorong
Zuo, Zhentao
Chen, Huafu
A dual‐channel language decoding from brain activity with progressive transfer training
title A dual‐channel language decoding from brain activity with progressive transfer training
title_full A dual‐channel language decoding from brain activity with progressive transfer training
title_fullStr A dual‐channel language decoding from brain activity with progressive transfer training
title_full_unstemmed A dual‐channel language decoding from brain activity with progressive transfer training
title_short A dual‐channel language decoding from brain activity with progressive transfer training
title_sort dual‐channel language decoding from brain activity with progressive transfer training
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449118/
https://www.ncbi.nlm.nih.gov/pubmed/34314088
http://dx.doi.org/10.1002/hbm.25603
work_keys_str_mv AT huangwei adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT yanhongmei adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT chengkaiwen adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT wangyuting adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT wangchong adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT lijiyi adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT lichen adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT lichaorong adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT zuozhentao adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT chenhuafu adualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT huangwei dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT yanhongmei dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT chengkaiwen dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT wangyuting dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT wangchong dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT lijiyi dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT lichen dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT lichaorong dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT zuozhentao dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining
AT chenhuafu dualchannellanguagedecodingfrombrainactivitywithprogressivetransfertraining