Cargando…

A Lightweight Multi-Scale Convolutional Neural Network for P300 Decoding: Analysis of Training Strategies and Uncovering of Network Decision

Convolutional neural networks (CNNs), which automatically learn features from raw data to approximate functions, are being increasingly applied to the end-to-end analysis of electroencephalographic (EEG) signals, especially for decoding brain states in brain-computer interfaces (BCIs). Nevertheless,...

Descripción completa

Detalles Bibliográficos
Autores principales: Borra, Davide, Fantozzi, Silvia, Magosso, Elisa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8295472/
https://www.ncbi.nlm.nih.gov/pubmed/34305550
http://dx.doi.org/10.3389/fnhum.2021.655840
_version_ 1783725435579793408
author Borra, Davide
Fantozzi, Silvia
Magosso, Elisa
author_facet Borra, Davide
Fantozzi, Silvia
Magosso, Elisa
author_sort Borra, Davide
collection PubMed
description Convolutional neural networks (CNNs), which automatically learn features from raw data to approximate functions, are being increasingly applied to the end-to-end analysis of electroencephalographic (EEG) signals, especially for decoding brain states in brain-computer interfaces (BCIs). Nevertheless, CNNs introduce a large number of trainable parameters, may require long training times, and lack in interpretability of learned features. The aim of this study is to propose a CNN design for P300 decoding with emphasis on its lightweight design while guaranteeing high performance, on the effects of different training strategies, and on the use of post-hoc techniques to explain network decisions. The proposed design, named MS-EEGNet, learned temporal features in two different timescales (i.e., multi-scale, MS) in an efficient and optimized (in terms of trainable parameters) way, and was validated on three P300 datasets. The CNN was trained using different strategies (within-participant and within-session, within-participant and cross-session, leave-one-subject-out, transfer learning) and was compared with several state-of-the-art (SOA) algorithms. Furthermore, variants of the baseline MS-EEGNet were analyzed to evaluate the impact of different hyper-parameters on performance. Lastly, saliency maps were used to derive representations of the relevant spatio-temporal features that drove CNN decisions. MS-EEGNet was the lightest CNN compared with the tested SOA CNNs, despite its multiple timescales, and significantly outperformed the SOA algorithms. Post-hoc hyper-parameter analysis confirmed the benefits of the innovative aspects of MS-EEGNet. Furthermore, MS-EEGNet did benefit from transfer learning, especially using a low number of training examples, suggesting that the proposed approach could be used in BCIs to accurately decode the P300 event while reducing calibration times. Representations derived from the saliency maps matched the P300 spatio-temporal distribution, further validating the proposed decoding approach. This study, by specifically addressing the aspects of lightweight design, transfer learning, and interpretability, can contribute to advance the development of deep learning algorithms for P300-based BCIs.
format Online
Article
Text
id pubmed-8295472
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82954722021-07-23 A Lightweight Multi-Scale Convolutional Neural Network for P300 Decoding: Analysis of Training Strategies and Uncovering of Network Decision Borra, Davide Fantozzi, Silvia Magosso, Elisa Front Hum Neurosci Human Neuroscience Convolutional neural networks (CNNs), which automatically learn features from raw data to approximate functions, are being increasingly applied to the end-to-end analysis of electroencephalographic (EEG) signals, especially for decoding brain states in brain-computer interfaces (BCIs). Nevertheless, CNNs introduce a large number of trainable parameters, may require long training times, and lack in interpretability of learned features. The aim of this study is to propose a CNN design for P300 decoding with emphasis on its lightweight design while guaranteeing high performance, on the effects of different training strategies, and on the use of post-hoc techniques to explain network decisions. The proposed design, named MS-EEGNet, learned temporal features in two different timescales (i.e., multi-scale, MS) in an efficient and optimized (in terms of trainable parameters) way, and was validated on three P300 datasets. The CNN was trained using different strategies (within-participant and within-session, within-participant and cross-session, leave-one-subject-out, transfer learning) and was compared with several state-of-the-art (SOA) algorithms. Furthermore, variants of the baseline MS-EEGNet were analyzed to evaluate the impact of different hyper-parameters on performance. Lastly, saliency maps were used to derive representations of the relevant spatio-temporal features that drove CNN decisions. MS-EEGNet was the lightest CNN compared with the tested SOA CNNs, despite its multiple timescales, and significantly outperformed the SOA algorithms. Post-hoc hyper-parameter analysis confirmed the benefits of the innovative aspects of MS-EEGNet. Furthermore, MS-EEGNet did benefit from transfer learning, especially using a low number of training examples, suggesting that the proposed approach could be used in BCIs to accurately decode the P300 event while reducing calibration times. Representations derived from the saliency maps matched the P300 spatio-temporal distribution, further validating the proposed decoding approach. This study, by specifically addressing the aspects of lightweight design, transfer learning, and interpretability, can contribute to advance the development of deep learning algorithms for P300-based BCIs. Frontiers Media S.A. 2021-07-08 /pmc/articles/PMC8295472/ /pubmed/34305550 http://dx.doi.org/10.3389/fnhum.2021.655840 Text en Copyright © 2021 Borra, Fantozzi and Magosso. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Human Neuroscience
Borra, Davide
Fantozzi, Silvia
Magosso, Elisa
A Lightweight Multi-Scale Convolutional Neural Network for P300 Decoding: Analysis of Training Strategies and Uncovering of Network Decision
title A Lightweight Multi-Scale Convolutional Neural Network for P300 Decoding: Analysis of Training Strategies and Uncovering of Network Decision
title_full A Lightweight Multi-Scale Convolutional Neural Network for P300 Decoding: Analysis of Training Strategies and Uncovering of Network Decision
title_fullStr A Lightweight Multi-Scale Convolutional Neural Network for P300 Decoding: Analysis of Training Strategies and Uncovering of Network Decision
title_full_unstemmed A Lightweight Multi-Scale Convolutional Neural Network for P300 Decoding: Analysis of Training Strategies and Uncovering of Network Decision
title_short A Lightweight Multi-Scale Convolutional Neural Network for P300 Decoding: Analysis of Training Strategies and Uncovering of Network Decision
title_sort lightweight multi-scale convolutional neural network for p300 decoding: analysis of training strategies and uncovering of network decision
topic Human Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8295472/
https://www.ncbi.nlm.nih.gov/pubmed/34305550
http://dx.doi.org/10.3389/fnhum.2021.655840
work_keys_str_mv AT borradavide alightweightmultiscaleconvolutionalneuralnetworkforp300decodinganalysisoftrainingstrategiesanduncoveringofnetworkdecision
AT fantozzisilvia alightweightmultiscaleconvolutionalneuralnetworkforp300decodinganalysisoftrainingstrategiesanduncoveringofnetworkdecision
AT magossoelisa alightweightmultiscaleconvolutionalneuralnetworkforp300decodinganalysisoftrainingstrategiesanduncoveringofnetworkdecision
AT borradavide lightweightmultiscaleconvolutionalneuralnetworkforp300decodinganalysisoftrainingstrategiesanduncoveringofnetworkdecision
AT fantozzisilvia lightweightmultiscaleconvolutionalneuralnetworkforp300decodinganalysisoftrainingstrategiesanduncoveringofnetworkdecision
AT magossoelisa lightweightmultiscaleconvolutionalneuralnetworkforp300decodinganalysisoftrainingstrategiesanduncoveringofnetworkdecision