Cargando…
A synchronized multimodal neuroimaging dataset for studying brain language processing
We present a synchronized multimodal neuroimaging dataset for studying brain language processing (SMN4Lang) that contains functional magnetic resonance imaging (fMRI) and magnetoencephalography (MEG) data on the same 12 healthy volunteers while the volunteers listened to 6 hours of naturalistic stor...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9525723/ https://www.ncbi.nlm.nih.gov/pubmed/36180444 http://dx.doi.org/10.1038/s41597-022-01708-5 |
_version_ | 1784800741612847104 |
---|---|
author | Wang, Shaonan Zhang, Xiaohan Zhang, Jiajun Zong, Chengqing |
author_facet | Wang, Shaonan Zhang, Xiaohan Zhang, Jiajun Zong, Chengqing |
author_sort | Wang, Shaonan |
collection | PubMed |
description | We present a synchronized multimodal neuroimaging dataset for studying brain language processing (SMN4Lang) that contains functional magnetic resonance imaging (fMRI) and magnetoencephalography (MEG) data on the same 12 healthy volunteers while the volunteers listened to 6 hours of naturalistic stories, as well as high-resolution structural (T1, T2), diffusion MRI and resting-state fMRI data for each participant. We also provide rich linguistic annotations for the stimuli, including word frequencies, syntactic tree structures, time-aligned characters and words, and various types of word and character embeddings. Quality assessment indicators verify that this is a high-quality neuroimaging dataset. Such synchronized data is separately collected by the same group of participants first listening to story materials in fMRI and then in MEG which are well suited to studying the dynamic processing of language comprehension, such as the time and location of different linguistic features encoded in the brain. In addition, this dataset, comprising a large vocabulary from stories with various topics, can serve as a brain benchmark to evaluate and improve computational language models. |
format | Online Article Text |
id | pubmed-9525723 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-95257232022-10-02 A synchronized multimodal neuroimaging dataset for studying brain language processing Wang, Shaonan Zhang, Xiaohan Zhang, Jiajun Zong, Chengqing Sci Data Data Descriptor We present a synchronized multimodal neuroimaging dataset for studying brain language processing (SMN4Lang) that contains functional magnetic resonance imaging (fMRI) and magnetoencephalography (MEG) data on the same 12 healthy volunteers while the volunteers listened to 6 hours of naturalistic stories, as well as high-resolution structural (T1, T2), diffusion MRI and resting-state fMRI data for each participant. We also provide rich linguistic annotations for the stimuli, including word frequencies, syntactic tree structures, time-aligned characters and words, and various types of word and character embeddings. Quality assessment indicators verify that this is a high-quality neuroimaging dataset. Such synchronized data is separately collected by the same group of participants first listening to story materials in fMRI and then in MEG which are well suited to studying the dynamic processing of language comprehension, such as the time and location of different linguistic features encoded in the brain. In addition, this dataset, comprising a large vocabulary from stories with various topics, can serve as a brain benchmark to evaluate and improve computational language models. Nature Publishing Group UK 2022-09-30 /pmc/articles/PMC9525723/ /pubmed/36180444 http://dx.doi.org/10.1038/s41597-022-01708-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Data Descriptor Wang, Shaonan Zhang, Xiaohan Zhang, Jiajun Zong, Chengqing A synchronized multimodal neuroimaging dataset for studying brain language processing |
title | A synchronized multimodal neuroimaging dataset for studying brain language processing |
title_full | A synchronized multimodal neuroimaging dataset for studying brain language processing |
title_fullStr | A synchronized multimodal neuroimaging dataset for studying brain language processing |
title_full_unstemmed | A synchronized multimodal neuroimaging dataset for studying brain language processing |
title_short | A synchronized multimodal neuroimaging dataset for studying brain language processing |
title_sort | synchronized multimodal neuroimaging dataset for studying brain language processing |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9525723/ https://www.ncbi.nlm.nih.gov/pubmed/36180444 http://dx.doi.org/10.1038/s41597-022-01708-5 |
work_keys_str_mv | AT wangshaonan asynchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing AT zhangxiaohan asynchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing AT zhangjiajun asynchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing AT zongchengqing asynchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing AT wangshaonan synchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing AT zhangxiaohan synchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing AT zhangjiajun synchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing AT zongchengqing synchronizedmultimodalneuroimagingdatasetforstudyingbrainlanguageprocessing |