Cargando…

The NMT Scalp EEG Dataset: An Open-Source Annotated Dataset of Healthy and Pathological EEG Recordings for Predictive Modeling

Electroencephalogram (EEG) is widely used for the diagnosis of neurological conditions like epilepsy, neurodegenerative illnesses and sleep related disorders. Proper interpretation of EEG recordings requires the expertise of trained neurologists, a resource which is scarce in the developing world. N...

Descripción completa

Detalles Bibliográficos
Autores principales: Khan, Hassan Aqeel, Ul Ain, Rahat, Kamboh, Awais Mehmood, Butt, Hammad Tanveer, Shafait, Saima, Alamgir, Wasim, Stricker, Didier, Shafait, Faisal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8766964/
https://www.ncbi.nlm.nih.gov/pubmed/35069095
http://dx.doi.org/10.3389/fnins.2021.755817
_version_ 1784634622865309696
author Khan, Hassan Aqeel
Ul Ain, Rahat
Kamboh, Awais Mehmood
Butt, Hammad Tanveer
Shafait, Saima
Alamgir, Wasim
Stricker, Didier
Shafait, Faisal
author_facet Khan, Hassan Aqeel
Ul Ain, Rahat
Kamboh, Awais Mehmood
Butt, Hammad Tanveer
Shafait, Saima
Alamgir, Wasim
Stricker, Didier
Shafait, Faisal
author_sort Khan, Hassan Aqeel
collection PubMed
description Electroencephalogram (EEG) is widely used for the diagnosis of neurological conditions like epilepsy, neurodegenerative illnesses and sleep related disorders. Proper interpretation of EEG recordings requires the expertise of trained neurologists, a resource which is scarce in the developing world. Neurologists spend a significant portion of their time sifting through EEG recordings looking for abnormalities. Most recordings turn out to be completely normal, owing to the low yield of EEG tests. To minimize such wastage of time and effort, automatic algorithms could be used to provide pre-diagnostic screening to separate normal from abnormal EEG. Data driven machine learning offers a way forward however, design and verification of modern machine learning algorithms require properly curated labeled datasets. To avoid bias, deep learning based methods must be trained on large datasets from diverse sources. This work presents a new open-source dataset, named the NMT Scalp EEG Dataset, consisting of 2,417 recordings from unique participants spanning almost 625 h. Each recording is labeled as normal or abnormal by a team of qualified neurologists. Demographic information such as gender and age of the patient are also included. Our dataset focuses on the South Asian population. Several existing state-of-the-art deep learning architectures developed for pre-diagnostic screening of EEG are implemented and evaluated on the NMT, and referenced against baseline performance on the well-known Temple University Hospital EEG Abnormal Corpus. Generalization of deep learning based architectures across the NMT and the reference datasets is also investigated. The NMT dataset is being released to increase the diversity of EEG datasets and to overcome the scarcity of accurately annotated publicly available datasets for EEG research.
format Online
Article
Text
id pubmed-8766964
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-87669642022-01-20 The NMT Scalp EEG Dataset: An Open-Source Annotated Dataset of Healthy and Pathological EEG Recordings for Predictive Modeling Khan, Hassan Aqeel Ul Ain, Rahat Kamboh, Awais Mehmood Butt, Hammad Tanveer Shafait, Saima Alamgir, Wasim Stricker, Didier Shafait, Faisal Front Neurosci Neuroscience Electroencephalogram (EEG) is widely used for the diagnosis of neurological conditions like epilepsy, neurodegenerative illnesses and sleep related disorders. Proper interpretation of EEG recordings requires the expertise of trained neurologists, a resource which is scarce in the developing world. Neurologists spend a significant portion of their time sifting through EEG recordings looking for abnormalities. Most recordings turn out to be completely normal, owing to the low yield of EEG tests. To minimize such wastage of time and effort, automatic algorithms could be used to provide pre-diagnostic screening to separate normal from abnormal EEG. Data driven machine learning offers a way forward however, design and verification of modern machine learning algorithms require properly curated labeled datasets. To avoid bias, deep learning based methods must be trained on large datasets from diverse sources. This work presents a new open-source dataset, named the NMT Scalp EEG Dataset, consisting of 2,417 recordings from unique participants spanning almost 625 h. Each recording is labeled as normal or abnormal by a team of qualified neurologists. Demographic information such as gender and age of the patient are also included. Our dataset focuses on the South Asian population. Several existing state-of-the-art deep learning architectures developed for pre-diagnostic screening of EEG are implemented and evaluated on the NMT, and referenced against baseline performance on the well-known Temple University Hospital EEG Abnormal Corpus. Generalization of deep learning based architectures across the NMT and the reference datasets is also investigated. The NMT dataset is being released to increase the diversity of EEG datasets and to overcome the scarcity of accurately annotated publicly available datasets for EEG research. Frontiers Media S.A. 2022-01-05 /pmc/articles/PMC8766964/ /pubmed/35069095 http://dx.doi.org/10.3389/fnins.2021.755817 Text en Copyright © 2022 Khan, Ul Ain, Kamboh, Butt, Shafait, Alamgir, Stricker and Shafait. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Khan, Hassan Aqeel
Ul Ain, Rahat
Kamboh, Awais Mehmood
Butt, Hammad Tanveer
Shafait, Saima
Alamgir, Wasim
Stricker, Didier
Shafait, Faisal
The NMT Scalp EEG Dataset: An Open-Source Annotated Dataset of Healthy and Pathological EEG Recordings for Predictive Modeling
title The NMT Scalp EEG Dataset: An Open-Source Annotated Dataset of Healthy and Pathological EEG Recordings for Predictive Modeling
title_full The NMT Scalp EEG Dataset: An Open-Source Annotated Dataset of Healthy and Pathological EEG Recordings for Predictive Modeling
title_fullStr The NMT Scalp EEG Dataset: An Open-Source Annotated Dataset of Healthy and Pathological EEG Recordings for Predictive Modeling
title_full_unstemmed The NMT Scalp EEG Dataset: An Open-Source Annotated Dataset of Healthy and Pathological EEG Recordings for Predictive Modeling
title_short The NMT Scalp EEG Dataset: An Open-Source Annotated Dataset of Healthy and Pathological EEG Recordings for Predictive Modeling
title_sort nmt scalp eeg dataset: an open-source annotated dataset of healthy and pathological eeg recordings for predictive modeling
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8766964/
https://www.ncbi.nlm.nih.gov/pubmed/35069095
http://dx.doi.org/10.3389/fnins.2021.755817
work_keys_str_mv AT khanhassanaqeel thenmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT ulainrahat thenmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT kambohawaismehmood thenmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT butthammadtanveer thenmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT shafaitsaima thenmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT alamgirwasim thenmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT strickerdidier thenmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT shafaitfaisal thenmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT khanhassanaqeel nmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT ulainrahat nmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT kambohawaismehmood nmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT butthammadtanveer nmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT shafaitsaima nmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT alamgirwasim nmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT strickerdidier nmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling
AT shafaitfaisal nmtscalpeegdatasetanopensourceannotateddatasetofhealthyandpathologicaleegrecordingsforpredictivemodeling