Cargando…
Introducing MEG-MASC a high-quality magneto-encephalography dataset for evaluating natural speech processing
The “MEG-MASC” dataset provides a curated set of raw magnetoencephalography (MEG) recordings of 27 English speakers who listened to two hours of naturalistic stories. Each participant performed two identical sessions, involving listening to four fictional stories from the Manually Annotated Sub-Corp...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10695966/ https://www.ncbi.nlm.nih.gov/pubmed/38049487 http://dx.doi.org/10.1038/s41597-023-02752-5 |
_version_ | 1785154469507366912 |
---|---|
author | Gwilliams, Laura Flick, Graham Marantz, Alec Pylkkänen, Liina Poeppel, David King, Jean-Rémi |
author_facet | Gwilliams, Laura Flick, Graham Marantz, Alec Pylkkänen, Liina Poeppel, David King, Jean-Rémi |
author_sort | Gwilliams, Laura |
collection | PubMed |
description | The “MEG-MASC” dataset provides a curated set of raw magnetoencephalography (MEG) recordings of 27 English speakers who listened to two hours of naturalistic stories. Each participant performed two identical sessions, involving listening to four fictional stories from the Manually Annotated Sub-Corpus (MASC) intermixed with random word lists and comprehension questions. We time-stamp the onset and offset of each word and phoneme in the metadata of the recording, and organize the dataset according to the ‘Brain Imaging Data Structure’ (BIDS). This data collection provides a suitable benchmark to large-scale encoding and decoding analyses of temporally-resolved brain responses to speech. We provide the Python code to replicate several validations analyses of the MEG evoked responses such as the temporal decoding of phonetic features and word frequency. All code and MEG, audio and text data are publicly available to keep with best practices in transparent and reproducible research. |
format | Online Article Text |
id | pubmed-10695966 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-106959662023-12-06 Introducing MEG-MASC a high-quality magneto-encephalography dataset for evaluating natural speech processing Gwilliams, Laura Flick, Graham Marantz, Alec Pylkkänen, Liina Poeppel, David King, Jean-Rémi Sci Data Data Descriptor The “MEG-MASC” dataset provides a curated set of raw magnetoencephalography (MEG) recordings of 27 English speakers who listened to two hours of naturalistic stories. Each participant performed two identical sessions, involving listening to four fictional stories from the Manually Annotated Sub-Corpus (MASC) intermixed with random word lists and comprehension questions. We time-stamp the onset and offset of each word and phoneme in the metadata of the recording, and organize the dataset according to the ‘Brain Imaging Data Structure’ (BIDS). This data collection provides a suitable benchmark to large-scale encoding and decoding analyses of temporally-resolved brain responses to speech. We provide the Python code to replicate several validations analyses of the MEG evoked responses such as the temporal decoding of phonetic features and word frequency. All code and MEG, audio and text data are publicly available to keep with best practices in transparent and reproducible research. Nature Publishing Group UK 2023-12-04 /pmc/articles/PMC10695966/ /pubmed/38049487 http://dx.doi.org/10.1038/s41597-023-02752-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Data Descriptor Gwilliams, Laura Flick, Graham Marantz, Alec Pylkkänen, Liina Poeppel, David King, Jean-Rémi Introducing MEG-MASC a high-quality magneto-encephalography dataset for evaluating natural speech processing |
title | Introducing MEG-MASC a high-quality magneto-encephalography dataset for evaluating natural speech processing |
title_full | Introducing MEG-MASC a high-quality magneto-encephalography dataset for evaluating natural speech processing |
title_fullStr | Introducing MEG-MASC a high-quality magneto-encephalography dataset for evaluating natural speech processing |
title_full_unstemmed | Introducing MEG-MASC a high-quality magneto-encephalography dataset for evaluating natural speech processing |
title_short | Introducing MEG-MASC a high-quality magneto-encephalography dataset for evaluating natural speech processing |
title_sort | introducing meg-masc a high-quality magneto-encephalography dataset for evaluating natural speech processing |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10695966/ https://www.ncbi.nlm.nih.gov/pubmed/38049487 http://dx.doi.org/10.1038/s41597-023-02752-5 |
work_keys_str_mv | AT gwilliamslaura introducingmegmascahighqualitymagnetoencephalographydatasetforevaluatingnaturalspeechprocessing AT flickgraham introducingmegmascahighqualitymagnetoencephalographydatasetforevaluatingnaturalspeechprocessing AT marantzalec introducingmegmascahighqualitymagnetoencephalographydatasetforevaluatingnaturalspeechprocessing AT pylkkanenliina introducingmegmascahighqualitymagnetoencephalographydatasetforevaluatingnaturalspeechprocessing AT poeppeldavid introducingmegmascahighqualitymagnetoencephalographydatasetforevaluatingnaturalspeechprocessing AT kingjeanremi introducingmegmascahighqualitymagnetoencephalographydatasetforevaluatingnaturalspeechprocessing |