Cargando…

A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features

Human lying is influenced by cognitive neural mechanisms in the brain, and conducting research on lie detection in speech can help to reveal the cognitive mechanisms of the human brain. Inappropriate deception detection features can easily lead to dimension disaster and make the generalization abili...

Descripción completa

Detalles Bibliográficos
Autores principales: Fu, Hongliang, Yu, Hang, Wang, Xuemei, Lu, Xiangying, Zhu, Chunhua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10216231/
https://www.ncbi.nlm.nih.gov/pubmed/37239197
http://dx.doi.org/10.3390/brainsci13050725
_version_ 1785048248486985728
author Fu, Hongliang
Yu, Hang
Wang, Xuemei
Lu, Xiangying
Zhu, Chunhua
author_facet Fu, Hongliang
Yu, Hang
Wang, Xuemei
Lu, Xiangying
Zhu, Chunhua
author_sort Fu, Hongliang
collection PubMed
description Human lying is influenced by cognitive neural mechanisms in the brain, and conducting research on lie detection in speech can help to reveal the cognitive mechanisms of the human brain. Inappropriate deception detection features can easily lead to dimension disaster and make the generalization ability of the widely used semi-supervised speech deception detection model worse. Because of this, this paper proposes a semi-supervised speech deception detection algorithm combining acoustic statistical features and time-frequency two-dimensional features. Firstly, a hybrid semi-supervised neural network based on a semi-supervised autoencoder network (AE) and a mean-teacher network is established. Secondly, the static artificial statistical features are input into the semi-supervised AE to extract more robust advanced features, and the three-dimensional (3D) mel-spectrum features are input into the mean-teacher network to obtain features rich in time-frequency two-dimensional information. Finally, a consistency regularization method is introduced after feature fusion, effectively reducing the occurrence of over-fitting and improving the generalization ability of the model. This paper carries out experiments on the self-built corpus for deception detection. The experimental results show that the highest recognition accuracy of the algorithm proposed in this paper is 68.62% which is 1.2% higher than the baseline system and effectively improves the detection accuracy.
format Online
Article
Text
id pubmed-10216231
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102162312023-05-27 A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features Fu, Hongliang Yu, Hang Wang, Xuemei Lu, Xiangying Zhu, Chunhua Brain Sci Article Human lying is influenced by cognitive neural mechanisms in the brain, and conducting research on lie detection in speech can help to reveal the cognitive mechanisms of the human brain. Inappropriate deception detection features can easily lead to dimension disaster and make the generalization ability of the widely used semi-supervised speech deception detection model worse. Because of this, this paper proposes a semi-supervised speech deception detection algorithm combining acoustic statistical features and time-frequency two-dimensional features. Firstly, a hybrid semi-supervised neural network based on a semi-supervised autoencoder network (AE) and a mean-teacher network is established. Secondly, the static artificial statistical features are input into the semi-supervised AE to extract more robust advanced features, and the three-dimensional (3D) mel-spectrum features are input into the mean-teacher network to obtain features rich in time-frequency two-dimensional information. Finally, a consistency regularization method is introduced after feature fusion, effectively reducing the occurrence of over-fitting and improving the generalization ability of the model. This paper carries out experiments on the self-built corpus for deception detection. The experimental results show that the highest recognition accuracy of the algorithm proposed in this paper is 68.62% which is 1.2% higher than the baseline system and effectively improves the detection accuracy. MDPI 2023-04-26 /pmc/articles/PMC10216231/ /pubmed/37239197 http://dx.doi.org/10.3390/brainsci13050725 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Fu, Hongliang
Yu, Hang
Wang, Xuemei
Lu, Xiangying
Zhu, Chunhua
A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
title A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
title_full A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
title_fullStr A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
title_full_unstemmed A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
title_short A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
title_sort semi-supervised speech deception detection algorithm combining acoustic statistical features and time-frequency two-dimensional features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10216231/
https://www.ncbi.nlm.nih.gov/pubmed/37239197
http://dx.doi.org/10.3390/brainsci13050725
work_keys_str_mv AT fuhongliang asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures
AT yuhang asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures
AT wangxuemei asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures
AT luxiangying asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures
AT zhuchunhua asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures
AT fuhongliang semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures
AT yuhang semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures
AT wangxuemei semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures
AT luxiangying semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures
AT zhuchunhua semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures