Cargando…

Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features

In recent years, the use of Artificial Intelligence for emotion recognition has attracted much attention. The industrial applicability of emotion recognition is quite comprehensive and has good development potential. This research uses voice emotion recognition technology to apply it to Chinese spee...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lee, Ming-Che, Yeh, Sheng-Cheng, Chang, Jia-Wei, Chen, Zhen-Yi
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9269147/ https://www.ncbi.nlm.nih.gov/pubmed/35808238 http://dx.doi.org/10.3390/s22134744

_version_	1784744161881096192
author	Lee, Ming-Che Yeh, Sheng-Cheng Chang, Jia-Wei Chen, Zhen-Yi
author_facet	Lee, Ming-Che Yeh, Sheng-Cheng Chang, Jia-Wei Chen, Zhen-Yi
author_sort	Lee, Ming-Che
collection	PubMed
description	In recent years, the use of Artificial Intelligence for emotion recognition has attracted much attention. The industrial applicability of emotion recognition is quite comprehensive and has good development potential. This research uses voice emotion recognition technology to apply it to Chinese speech emotion recognition. The main purpose of this research is to transform gradually popularized smart home voice assistants or AI system service robots from a touch-sensitive interface to a voice operation. This research proposed a specifically designed Deep Neural Network (DNN) model to develop a Chinese speech emotion recognition system. In this research, 29 acoustic characteristics in acoustic theory are used as the training attributes of the proposed model. This research also proposes a variety of audio adjustment methods to amplify datasets and enhance training accuracy, including waveform adjustment, pitch adjustment, and pre-emphasize. This study achieved an average emotion recognition accuracy of 88.9% in the CASIA Chinese sentiment corpus. The results show that the deep learning model and audio adjustment method proposed in this study can effectively identify the emotions of Chinese short sentences and can be applied to Chinese voice assistants or integrated with other dialogue applications.
format	Online Article Text
id	pubmed-9269147
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-92691472022-07-09 Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features Lee, Ming-Che Yeh, Sheng-Cheng Chang, Jia-Wei Chen, Zhen-Yi Sensors (Basel) Article In recent years, the use of Artificial Intelligence for emotion recognition has attracted much attention. The industrial applicability of emotion recognition is quite comprehensive and has good development potential. This research uses voice emotion recognition technology to apply it to Chinese speech emotion recognition. The main purpose of this research is to transform gradually popularized smart home voice assistants or AI system service robots from a touch-sensitive interface to a voice operation. This research proposed a specifically designed Deep Neural Network (DNN) model to develop a Chinese speech emotion recognition system. In this research, 29 acoustic characteristics in acoustic theory are used as the training attributes of the proposed model. This research also proposes a variety of audio adjustment methods to amplify datasets and enhance training accuracy, including waveform adjustment, pitch adjustment, and pre-emphasize. This study achieved an average emotion recognition accuracy of 88.9% in the CASIA Chinese sentiment corpus. The results show that the deep learning model and audio adjustment method proposed in this study can effectively identify the emotions of Chinese short sentences and can be applied to Chinese voice assistants or integrated with other dialogue applications. MDPI 2022-06-23 /pmc/articles/PMC9269147/ /pubmed/35808238 http://dx.doi.org/10.3390/s22134744 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Lee, Ming-Che Yeh, Sheng-Cheng Chang, Jia-Wei Chen, Zhen-Yi Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features
title	Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features
title_full	Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features
title_fullStr	Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features
title_full_unstemmed	Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features
title_short	Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features
title_sort	research on chinese speech emotion recognition based on deep neural network and acoustic features
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9269147/ https://www.ncbi.nlm.nih.gov/pubmed/35808238 http://dx.doi.org/10.3390/s22134744
work_keys_str_mv	AT leemingche researchonchinesespeechemotionrecognitionbasedondeepneuralnetworkandacousticfeatures AT yehshengcheng researchonchinesespeechemotionrecognitionbasedondeepneuralnetworkandacousticfeatures AT changjiawei researchonchinesespeechemotionrecognitionbasedondeepneuralnetworkandacousticfeatures AT chenzhenyi researchonchinesespeechemotionrecognitionbasedondeepneuralnetworkandacousticfeatures

Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features

Ejemplares similares