Cargando…

Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech

Clinical studies have shown that speech pauses can reflect the cognitive function differences between Alzheimer’s Disease (AD) and non-AD patients, while the value of pause information in AD detection has not been fully explored. Herein, we propose a speech pause feature extraction and encoding stra...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Jiamin, Fu, Fan, Li, Liang, Yu, Junxiao, Zhong, Dacheng, Zhu, Songsheng, Zhou, Yuxuan, Liu, Bin, Li, Jianqing
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10046767/ https://www.ncbi.nlm.nih.gov/pubmed/36979287 http://dx.doi.org/10.3390/brainsci13030477

_version_	1785013755462025216
author	Liu, Jiamin Fu, Fan Li, Liang Yu, Junxiao Zhong, Dacheng Zhu, Songsheng Zhou, Yuxuan Liu, Bin Li, Jianqing
author_facet	Liu, Jiamin Fu, Fan Li, Liang Yu, Junxiao Zhong, Dacheng Zhu, Songsheng Zhou, Yuxuan Liu, Bin Li, Jianqing
author_sort	Liu, Jiamin
collection	PubMed
description	Clinical studies have shown that speech pauses can reflect the cognitive function differences between Alzheimer’s Disease (AD) and non-AD patients, while the value of pause information in AD detection has not been fully explored. Herein, we propose a speech pause feature extraction and encoding strategy for only acoustic-signal-based AD detection. First, a voice activity detection (VAD) method was constructed to detect pause/non-pause feature and encode it to binary pause sequences that are easier to calculate. Then, an ensemble machine-learning-based approach was proposed for the classification of AD from the participants’ spontaneous speech, based on the VAD Pause feature sequence and common acoustic feature sets (ComParE and eGeMAPS). The proposed pause feature sequence was verified in five machine-learning models. The validation data included two public challenge datasets (ADReSS and ADReSSo, English voice) and a local dataset (10 audio recordings containing five patients and five controls, Chinese voice). Results showed that the VAD Pause feature was more effective than common feature sets (ComParE: 6373 features and eGeMAPS: 88 features) for AD classification, and that the ensemble method improved the accuracy by more than 5% compared to several baseline methods (8% on the ADReSS dataset; 5.9% on the ADReSSo dataset). Moreover, the pause-sequence-based AD detection method could achieve 80% accuracy on the local dataset. Our study further demonstrated the potential of pause information in speech-based AD detection, and also contributed to a more accessible and general pause feature extraction and encoding method for AD detection.
format	Online Article Text
id	pubmed-10046767
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-100467672023-03-29 Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech Liu, Jiamin Fu, Fan Li, Liang Yu, Junxiao Zhong, Dacheng Zhu, Songsheng Zhou, Yuxuan Liu, Bin Li, Jianqing Brain Sci Article Clinical studies have shown that speech pauses can reflect the cognitive function differences between Alzheimer’s Disease (AD) and non-AD patients, while the value of pause information in AD detection has not been fully explored. Herein, we propose a speech pause feature extraction and encoding strategy for only acoustic-signal-based AD detection. First, a voice activity detection (VAD) method was constructed to detect pause/non-pause feature and encode it to binary pause sequences that are easier to calculate. Then, an ensemble machine-learning-based approach was proposed for the classification of AD from the participants’ spontaneous speech, based on the VAD Pause feature sequence and common acoustic feature sets (ComParE and eGeMAPS). The proposed pause feature sequence was verified in five machine-learning models. The validation data included two public challenge datasets (ADReSS and ADReSSo, English voice) and a local dataset (10 audio recordings containing five patients and five controls, Chinese voice). Results showed that the VAD Pause feature was more effective than common feature sets (ComParE: 6373 features and eGeMAPS: 88 features) for AD classification, and that the ensemble method improved the accuracy by more than 5% compared to several baseline methods (8% on the ADReSS dataset; 5.9% on the ADReSSo dataset). Moreover, the pause-sequence-based AD detection method could achieve 80% accuracy on the local dataset. Our study further demonstrated the potential of pause information in speech-based AD detection, and also contributed to a more accessible and general pause feature extraction and encoding method for AD detection. MDPI 2023-03-11 /pmc/articles/PMC10046767/ /pubmed/36979287 http://dx.doi.org/10.3390/brainsci13030477 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Liu, Jiamin Fu, Fan Li, Liang Yu, Junxiao Zhong, Dacheng Zhu, Songsheng Zhou, Yuxuan Liu, Bin Li, Jianqing Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
title	Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
title_full	Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
title_fullStr	Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
title_full_unstemmed	Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
title_short	Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
title_sort	efficient pause extraction and encode strategy for alzheimer’s disease detection using only acoustic features from spontaneous speech
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10046767/ https://www.ncbi.nlm.nih.gov/pubmed/36979287 http://dx.doi.org/10.3390/brainsci13030477
work_keys_str_mv	AT liujiamin efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT fufan efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT liliang efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT yujunxiao efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT zhongdacheng efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT zhusongsheng efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT zhouyuxuan efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT liubin efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT lijianqing efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech

Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech

Ejemplares similares