Cargando…
A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features
Human lying is influenced by cognitive neural mechanisms in the brain, and conducting research on lie detection in speech can help to reveal the cognitive mechanisms of the human brain. Inappropriate deception detection features can easily lead to dimension disaster and make the generalization abili...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10216231/ https://www.ncbi.nlm.nih.gov/pubmed/37239197 http://dx.doi.org/10.3390/brainsci13050725 |
_version_ | 1785048248486985728 |
---|---|
author | Fu, Hongliang Yu, Hang Wang, Xuemei Lu, Xiangying Zhu, Chunhua |
author_facet | Fu, Hongliang Yu, Hang Wang, Xuemei Lu, Xiangying Zhu, Chunhua |
author_sort | Fu, Hongliang |
collection | PubMed |
description | Human lying is influenced by cognitive neural mechanisms in the brain, and conducting research on lie detection in speech can help to reveal the cognitive mechanisms of the human brain. Inappropriate deception detection features can easily lead to dimension disaster and make the generalization ability of the widely used semi-supervised speech deception detection model worse. Because of this, this paper proposes a semi-supervised speech deception detection algorithm combining acoustic statistical features and time-frequency two-dimensional features. Firstly, a hybrid semi-supervised neural network based on a semi-supervised autoencoder network (AE) and a mean-teacher network is established. Secondly, the static artificial statistical features are input into the semi-supervised AE to extract more robust advanced features, and the three-dimensional (3D) mel-spectrum features are input into the mean-teacher network to obtain features rich in time-frequency two-dimensional information. Finally, a consistency regularization method is introduced after feature fusion, effectively reducing the occurrence of over-fitting and improving the generalization ability of the model. This paper carries out experiments on the self-built corpus for deception detection. The experimental results show that the highest recognition accuracy of the algorithm proposed in this paper is 68.62% which is 1.2% higher than the baseline system and effectively improves the detection accuracy. |
format | Online Article Text |
id | pubmed-10216231 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102162312023-05-27 A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features Fu, Hongliang Yu, Hang Wang, Xuemei Lu, Xiangying Zhu, Chunhua Brain Sci Article Human lying is influenced by cognitive neural mechanisms in the brain, and conducting research on lie detection in speech can help to reveal the cognitive mechanisms of the human brain. Inappropriate deception detection features can easily lead to dimension disaster and make the generalization ability of the widely used semi-supervised speech deception detection model worse. Because of this, this paper proposes a semi-supervised speech deception detection algorithm combining acoustic statistical features and time-frequency two-dimensional features. Firstly, a hybrid semi-supervised neural network based on a semi-supervised autoencoder network (AE) and a mean-teacher network is established. Secondly, the static artificial statistical features are input into the semi-supervised AE to extract more robust advanced features, and the three-dimensional (3D) mel-spectrum features are input into the mean-teacher network to obtain features rich in time-frequency two-dimensional information. Finally, a consistency regularization method is introduced after feature fusion, effectively reducing the occurrence of over-fitting and improving the generalization ability of the model. This paper carries out experiments on the self-built corpus for deception detection. The experimental results show that the highest recognition accuracy of the algorithm proposed in this paper is 68.62% which is 1.2% higher than the baseline system and effectively improves the detection accuracy. MDPI 2023-04-26 /pmc/articles/PMC10216231/ /pubmed/37239197 http://dx.doi.org/10.3390/brainsci13050725 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Fu, Hongliang Yu, Hang Wang, Xuemei Lu, Xiangying Zhu, Chunhua A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features |
title | A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features |
title_full | A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features |
title_fullStr | A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features |
title_full_unstemmed | A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features |
title_short | A Semi-Supervised Speech Deception Detection Algorithm Combining Acoustic Statistical Features and Time-Frequency Two-Dimensional Features |
title_sort | semi-supervised speech deception detection algorithm combining acoustic statistical features and time-frequency two-dimensional features |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10216231/ https://www.ncbi.nlm.nih.gov/pubmed/37239197 http://dx.doi.org/10.3390/brainsci13050725 |
work_keys_str_mv | AT fuhongliang asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT yuhang asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT wangxuemei asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT luxiangying asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT zhuchunhua asemisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT fuhongliang semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT yuhang semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT wangxuemei semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT luxiangying semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures AT zhuchunhua semisupervisedspeechdeceptiondetectionalgorithmcombiningacousticstatisticalfeaturesandtimefrequencytwodimensionalfeatures |