Cargando…

A synchronized multimodal neuroimaging dataset for studying brain language processing

We present a synchronized multimodal neuroimaging dataset for studying brain language processing (SMN4Lang) that contains functional magnetic resonance imaging (fMRI) and magnetoencephalography (MEG) data on the same 12 healthy volunteers while the volunteers listened to 6 hours of naturalistic stor...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Shaonan, Zhang, Xiaohan, Zhang, Jiajun, Zong, Chengqing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9525723/
https://www.ncbi.nlm.nih.gov/pubmed/36180444
http://dx.doi.org/10.1038/s41597-022-01708-5
_version_ 1784800741612847104
author Wang, Shaonan
Zhang, Xiaohan
Zhang, Jiajun
Zong, Chengqing
author_facet Wang, Shaonan
Zhang, Xiaohan
Zhang, Jiajun
Zong, Chengqing
author_sort Wang, Shaonan
collection PubMed
description We present a synchronized multimodal neuroimaging dataset for studying brain language processing (SMN4Lang) that contains functional magnetic resonance imaging (fMRI) and magnetoencephalography (MEG) data on the same 12 healthy volunteers while the volunteers listened to 6 hours of naturalistic stories, as well as high-resolution structural (T1, T2), diffusion MRI and resting-state fMRI data for each participant. We also provide rich linguistic annotations for the stimuli, including word frequencies, syntactic tree structures, time-aligned characters and words, and various types of word and character embeddings. Quality assessment indicators verify that this is a high-quality neuroimaging dataset. Such synchronized data is separately collected by the same group of participants first listening to story materials in fMRI and then in MEG which are well suited to studying the dynamic processing of language comprehension, such as the time and location of different linguistic features encoded in the brain. In addition, this dataset, comprising a large vocabulary from stories with various topics, can serve as a brain benchmark to evaluate and improve computational language models.
format Online
Article
Text
id pubmed-9525723
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-95257232022-10-02 A synchronized multimodal neuroimaging dataset for studying brain language processing Wang, Shaonan Zhang, Xiaohan Zhang, Jiajun Zong, Chengqing Sci Data Data Descriptor We present a synchronized multimodal neuroimaging dataset for studying brain language processing (SMN4Lang) that contains functional magnetic resonance imaging (fMRI) and magnetoencephalography (MEG) data on the same 12 healthy volunteers while the volunteers listened to 6 hours of naturalistic stories, as well as high-resolution structural (T1, T2), diffusion MRI and resting-state fMRI data for each participant. We also provide rich linguistic annotations for the stimuli, including word frequencies, syntactic tree structures, time-aligned characters and words, and various types of word and character embeddings. Quality assessment indicators verify that this is a high-quality neuroimaging dataset. Such synchronized data is separately collected by the same group of participants first listening to story materials in fMRI and then in MEG which are well suited to studying the dynamic processing of language comprehension, such as the time and location of different linguistic features encoded in the brain. In addition, this dataset, comprising a large vocabulary from stories with various topics, can serve as a brain benchmark to evaluate and improve computational language models. Nature Publishing Group UK 2022-09-30 /pmc/articles/PMC9525723/ /pubmed/36180444 http://dx.doi.org/10.1038/s41597-022-01708-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Data Descriptor
Wang, Shaonan
Zhang, Xiaohan
Zhang, Jiajun
Zong, Chengqing
A synchronized multimodal neuroimaging dataset for studying brain language processing
title A synchronized multimodal neuroimaging dataset for studying brain language processing
title_full A synchronized multimodal neuroimaging dataset for studying brain language processing
title_fullStr A synchronized multimodal neuroimaging dataset for studying brain language processing
title_full_unstemmed A synchronized multimodal neuroimaging dataset for studying brain language processing
title_short A synchronized multimodal neuroimaging dataset for studying brain language processing
title_sort synchronized multimodal neuroimaging dataset for studying brain language processing
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9525723/
https://www.ncbi.nlm.nih.gov/pubmed/36180444
http://dx.doi.org/10.1038/s41597-022-01708-5
work_keys_str_mv AT wangshaonan asynchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing
AT zhangxiaohan asynchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing
AT zhangjiajun asynchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing
AT zongchengqing asynchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing
AT wangshaonan synchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing
AT zhangxiaohan synchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing
AT zhangjiajun synchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing
AT zongchengqing synchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing