Cargando…

A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features

Human lying is influenced by cognitive neural mechanisms in the brain, and conducting research on lie detection in speech can help to reveal the cognitive mechanisms of the human brain. Inappropriate deception detection features can easily lead to dimension disaster and make the generalization abili...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fu, Hongliang, Yu, Hang, Wang, Xuemei, Lu, Xiangying, Zhu, Chunhua
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10216231/ https://www.ncbi.nlm.nih.gov/pubmed/37239197 http://dx.doi.org/10.3390/brainsci13050725

_version_	1785048248486985728
author	Fu, Hongliang Yu, Hang Wang, Xuemei Lu, Xiangying Zhu, Chunhua
author_facet	Fu, Hongliang Yu, Hang Wang, Xuemei Lu, Xiangying Zhu, Chunhua
author_sort	Fu, Hongliang
collection	PubMed
description	Human lying is influenced by cognitive neural mechanisms in the brain, and conducting research on lie detection in speech can help to reveal the cognitive mechanisms of the human brain. Inappropriate deception detection features can easily lead to dimension disaster and make the generalization ability of the widely used semi-supervised speech deception detection model worse. Because of this, this paper proposes a semi-supervised speech deception detection algorithm combining acoustic statistical features and time-frequency two-dimensional features. Firstly, a hybrid semi-supervised neural network based on a semi-supervised autoencoder network (AE) and a mean-teacher network is established. Secondly, the static artificial statistical features are input into the semi-supervised AE to extract more robust advanced features, and the three-dimensional (3D) mel-spectrum features are input into the mean-teacher network to obtain features rich in time-frequency two-dimensional information. Finally, a consistency regularization method is introduced after feature fusion, effectively reducing the occurrence of over-fitting and improving the generalization ability of the model. This paper carries out experiments on the self-built corpus for deception detection. The experimental results show that the highest recognition accuracy of the algorithm proposed in this paper is 68.62% which is 1.2% higher than the baseline system and effectively improves the detection accuracy.
format	Online Article Text
id	pubmed-10216231
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-102162312023-05-27 A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features Fu, Hongliang Yu, Hang Wang, Xuemei Lu, Xiangying Zhu, Chunhua Brain Sci Article Human lying is influenced by cognitive neural mechanisms in the brain, and conducting research on lie detection in speech can help to reveal the cognitive mechanisms of the human brain. Inappropriate deception detection features can easily lead to dimension disaster and make the generalization ability of the widely used semi-supervised speech deception detection model worse. Because of this, this paper proposes a semi-supervised speech deception detection algorithm combining acoustic statistical features and time-frequency two-dimensional features. Firstly, a hybrid semi-supervised neural network based on a semi-supervised autoencoder network (AE) and a mean-teacher network is established. Secondly, the static artificial statistical features are input into the semi-supervised AE to extract more robust advanced features, and the three-dimensional (3D) mel-spectrum features are input into the mean-teacher network to obtain features rich in time-frequency two-dimensional information. Finally, a consistency regularization method is introduced after feature fusion, effectively reducing the occurrence of over-fitting and improving the generalization ability of the model. This paper carries out experiments on the self-built corpus for deception detection. The experimental results show that the highest recognition accuracy of the algorithm proposed in this paper is 68.62% which is 1.2% higher than the baseline system and effectively improves the detection accuracy. MDPI 2023-04-26 /pmc/articles/PMC10216231/ /pubmed/37239197 http://dx.doi.org/10.3390/brainsci13050725 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Fu, Hongliang Yu, Hang Wang, Xuemei Lu, Xiangying Zhu, Chunhua A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
title	A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
title_full	A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
title_fullStr	A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
title_full_unstemmed	A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
title_short	A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
title_sort	semi-supervised speech deception detection algorithm combining acoustic statistical features and time-frequency two-dimensional features
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10216231/ https://www.ncbi.nlm.nih.gov/pubmed/37239197 http://dx.doi.org/10.3390/brainsci13050725
work_keys_str_mv	AT fuhongliang asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT yuhang asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT wangxuemei asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT luxiangying asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT zhuchunhua asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT fuhongliang semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT yuhang semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT wangxuemei semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT luxiangying semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT zhuchunhua semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures

A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features

Ejemplares similares