Cargando…

Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech

Clinical studies have shown that speech pauses can reflect the cognitive function differences between Alzheimer’s Disease (AD) and non-AD patients, while the value of pause information in AD detection has not been fully explored. Herein, we propose a speech pause feature extraction and encoding stra...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Jiamin, Fu, Fan, Li, Liang, Yu, Junxiao, Zhong, Dacheng, Zhu, Songsheng, Zhou, Yuxuan, Liu, Bin, Li, Jianqing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10046767/
https://www.ncbi.nlm.nih.gov/pubmed/36979287
http://dx.doi.org/10.3390/brainsci13030477
_version_ 1785013755462025216
author Liu, Jiamin
Fu, Fan
Li, Liang
Yu, Junxiao
Zhong, Dacheng
Zhu, Songsheng
Zhou, Yuxuan
Liu, Bin
Li, Jianqing
author_facet Liu, Jiamin
Fu, Fan
Li, Liang
Yu, Junxiao
Zhong, Dacheng
Zhu, Songsheng
Zhou, Yuxuan
Liu, Bin
Li, Jianqing
author_sort Liu, Jiamin
collection PubMed
description Clinical studies have shown that speech pauses can reflect the cognitive function differences between Alzheimer’s Disease (AD) and non-AD patients, while the value of pause information in AD detection has not been fully explored. Herein, we propose a speech pause feature extraction and encoding strategy for only acoustic-signal-based AD detection. First, a voice activity detection (VAD) method was constructed to detect pause/non-pause feature and encode it to binary pause sequences that are easier to calculate. Then, an ensemble machine-learning-based approach was proposed for the classification of AD from the participants’ spontaneous speech, based on the VAD Pause feature sequence and common acoustic feature sets (ComParE and eGeMAPS). The proposed pause feature sequence was verified in five machine-learning models. The validation data included two public challenge datasets (ADReSS and ADReSSo, English voice) and a local dataset (10 audio recordings containing five patients and five controls, Chinese voice). Results showed that the VAD Pause feature was more effective than common feature sets (ComParE: 6373 features and eGeMAPS: 88 features) for AD classification, and that the ensemble method improved the accuracy by more than 5% compared to several baseline methods (8% on the ADReSS dataset; 5.9% on the ADReSSo dataset). Moreover, the pause-sequence-based AD detection method could achieve 80% accuracy on the local dataset. Our study further demonstrated the potential of pause information in speech-based AD detection, and also contributed to a more accessible and general pause feature extraction and encoding method for AD detection.
format Online
Article
Text
id pubmed-10046767
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100467672023-03-29 Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech Liu, Jiamin Fu, Fan Li, Liang Yu, Junxiao Zhong, Dacheng Zhu, Songsheng Zhou, Yuxuan Liu, Bin Li, Jianqing Brain Sci Article Clinical studies have shown that speech pauses can reflect the cognitive function differences between Alzheimer’s Disease (AD) and non-AD patients, while the value of pause information in AD detection has not been fully explored. Herein, we propose a speech pause feature extraction and encoding strategy for only acoustic-signal-based AD detection. First, a voice activity detection (VAD) method was constructed to detect pause/non-pause feature and encode it to binary pause sequences that are easier to calculate. Then, an ensemble machine-learning-based approach was proposed for the classification of AD from the participants’ spontaneous speech, based on the VAD Pause feature sequence and common acoustic feature sets (ComParE and eGeMAPS). The proposed pause feature sequence was verified in five machine-learning models. The validation data included two public challenge datasets (ADReSS and ADReSSo, English voice) and a local dataset (10 audio recordings containing five patients and five controls, Chinese voice). Results showed that the VAD Pause feature was more effective than common feature sets (ComParE: 6373 features and eGeMAPS: 88 features) for AD classification, and that the ensemble method improved the accuracy by more than 5% compared to several baseline methods (8% on the ADReSS dataset; 5.9% on the ADReSSo dataset). Moreover, the pause-sequence-based AD detection method could achieve 80% accuracy on the local dataset. Our study further demonstrated the potential of pause information in speech-based AD detection, and also contributed to a more accessible and general pause feature extraction and encoding method for AD detection. MDPI 2023-03-11 /pmc/articles/PMC10046767/ /pubmed/36979287 http://dx.doi.org/10.3390/brainsci13030477 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liu, Jiamin
Fu, Fan
Li, Liang
Yu, Junxiao
Zhong, Dacheng
Zhu, Songsheng
Zhou, Yuxuan
Liu, Bin
Li, Jianqing
Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
title Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
title_full Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
title_fullStr Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
title_full_unstemmed Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
title_short Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
title_sort efficient pause extraction and encode strategy for alzheimer’s disease detection using only acoustic features from spontaneous speech
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10046767/
https://www.ncbi.nlm.nih.gov/pubmed/36979287
http://dx.doi.org/10.3390/brainsci13030477
work_keys_str_mv AT liujiamin efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech
AT fufan efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech
AT liliang efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech
AT yujunxiao efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech
AT zhongdacheng efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech
AT zhusongsheng efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech
AT zhouyuxuan efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech
AT liubin efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech
AT lijianqing efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech