Cargando…

Vowel speech recognition from rat electroencephalography using long short-term memory neural network

Over the years, considerable research has been conducted to investigate the mechanisms of speech perception and recognition. Electroencephalography (EEG) is a powerful tool for identifying brain activity; therefore, it has been widely used to determine the neural basis of speech recognition. In part...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ham, Jinsil, Yoo, Hyun-Joon, Kim, Jongin, Lee, Boreom
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9223328/ https://www.ncbi.nlm.nih.gov/pubmed/35737731 http://dx.doi.org/10.1371/journal.pone.0270405

_version_	1784733099504959488
author	Ham, Jinsil Yoo, Hyun-Joon Kim, Jongin Lee, Boreom
author_facet	Ham, Jinsil Yoo, Hyun-Joon Kim, Jongin Lee, Boreom
author_sort	Ham, Jinsil
collection	PubMed
description	Over the years, considerable research has been conducted to investigate the mechanisms of speech perception and recognition. Electroencephalography (EEG) is a powerful tool for identifying brain activity; therefore, it has been widely used to determine the neural basis of speech recognition. In particular, for the classification of speech recognition, deep learning-based approaches are in the spotlight because they can automatically learn and extract representative features through end-to-end learning. This study aimed to identify particular components that are potentially related to phoneme representation in the rat brain and to discriminate brain activity for each vowel stimulus on a single-trial basis using a bidirectional long short-term memory (BiLSTM) network and classical machine learning methods. Nineteen male Sprague-Dawley rats subjected to microelectrode implantation surgery to record EEG signals from the bilateral anterior auditory fields were used. Five different vowel speech stimuli were chosen, /a/, /e/, /i/, /o/, and /u/, which have highly different formant frequencies. EEG recorded under randomly given vowel stimuli was minimally preprocessed and normalized by a z-score transformation to be used as input for the classification of speech recognition. The BiLSTM network showed the best performance among the classifiers by achieving an overall accuracy, f1-score, and Cohen’s κ values of 75.18%, 0.75, and 0.68, respectively, using a 10-fold cross-validation approach. These results indicate that LSTM layers can effectively model sequential data, such as EEG; hence, informative features can be derived through BiLSTM trained with end-to-end learning without any additional hand-crafted feature extraction methods.
format	Online Article Text
id	pubmed-9223328
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-92233282022-06-24 Vowel speech recognition from rat electroencephalography using long short-term memory neural network Ham, Jinsil Yoo, Hyun-Joon Kim, Jongin Lee, Boreom PLoS One Research Article Over the years, considerable research has been conducted to investigate the mechanisms of speech perception and recognition. Electroencephalography (EEG) is a powerful tool for identifying brain activity; therefore, it has been widely used to determine the neural basis of speech recognition. In particular, for the classification of speech recognition, deep learning-based approaches are in the spotlight because they can automatically learn and extract representative features through end-to-end learning. This study aimed to identify particular components that are potentially related to phoneme representation in the rat brain and to discriminate brain activity for each vowel stimulus on a single-trial basis using a bidirectional long short-term memory (BiLSTM) network and classical machine learning methods. Nineteen male Sprague-Dawley rats subjected to microelectrode implantation surgery to record EEG signals from the bilateral anterior auditory fields were used. Five different vowel speech stimuli were chosen, /a/, /e/, /i/, /o/, and /u/, which have highly different formant frequencies. EEG recorded under randomly given vowel stimuli was minimally preprocessed and normalized by a z-score transformation to be used as input for the classification of speech recognition. The BiLSTM network showed the best performance among the classifiers by achieving an overall accuracy, f1-score, and Cohen’s κ values of 75.18%, 0.75, and 0.68, respectively, using a 10-fold cross-validation approach. These results indicate that LSTM layers can effectively model sequential data, such as EEG; hence, informative features can be derived through BiLSTM trained with end-to-end learning without any additional hand-crafted feature extraction methods. Public Library of Science 2022-06-23 /pmc/articles/PMC9223328/ /pubmed/35737731 http://dx.doi.org/10.1371/journal.pone.0270405 Text en © 2022 Ham et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Ham, Jinsil Yoo, Hyun-Joon Kim, Jongin Lee, Boreom Vowel speech recognition from rat electroencephalography using long short-term memory neural network
title	Vowel speech recognition from rat electroencephalography using long short-term memory neural network
title_full	Vowel speech recognition from rat electroencephalography using long short-term memory neural network
title_fullStr	Vowel speech recognition from rat electroencephalography using long short-term memory neural network
title_full_unstemmed	Vowel speech recognition from rat electroencephalography using long short-term memory neural network
title_short	Vowel speech recognition from rat electroencephalography using long short-term memory neural network
title_sort	vowel speech recognition from rat electroencephalography using long short-term memory neural network
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9223328/ https://www.ncbi.nlm.nih.gov/pubmed/35737731 http://dx.doi.org/10.1371/journal.pone.0270405
work_keys_str_mv	AT hamjinsil vowelspeechrecognitionfromratelectroencephalographyusinglongshorttermmemoryneuralnetwork AT yoohyunjoon vowelspeechrecognitionfromratelectroencephalographyusinglongshorttermmemoryneuralnetwork AT kimjongin vowelspeechrecognitionfromratelectroencephalographyusinglongshorttermmemoryneuralnetwork AT leeboreom vowelspeechrecognitionfromratelectroencephalographyusinglongshorttermmemoryneuralnetwork

Vowel speech recognition from rat electroencephalography using long short-term memory neural network

Ejemplares similares