Cargando…
Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech
Clinical studies have shown that speech pauses can reflect the cognitive function differences between Alzheimer’s Disease (AD) and non-AD patients, while the value of pause information in AD detection has not been fully explored. Herein, we propose a speech pause feature extraction and encoding stra...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10046767/ https://www.ncbi.nlm.nih.gov/pubmed/36979287 http://dx.doi.org/10.3390/brainsci13030477 |
_version_ | 1785013755462025216 |
---|---|
author | Liu, Jiamin Fu, Fan Li, Liang Yu, Junxiao Zhong, Dacheng Zhu, Songsheng Zhou, Yuxuan Liu, Bin Li, Jianqing |
author_facet | Liu, Jiamin Fu, Fan Li, Liang Yu, Junxiao Zhong, Dacheng Zhu, Songsheng Zhou, Yuxuan Liu, Bin Li, Jianqing |
author_sort | Liu, Jiamin |
collection | PubMed |
description | Clinical studies have shown that speech pauses can reflect the cognitive function differences between Alzheimer’s Disease (AD) and non-AD patients, while the value of pause information in AD detection has not been fully explored. Herein, we propose a speech pause feature extraction and encoding strategy for only acoustic-signal-based AD detection. First, a voice activity detection (VAD) method was constructed to detect pause/non-pause feature and encode it to binary pause sequences that are easier to calculate. Then, an ensemble machine-learning-based approach was proposed for the classification of AD from the participants’ spontaneous speech, based on the VAD Pause feature sequence and common acoustic feature sets (ComParE and eGeMAPS). The proposed pause feature sequence was verified in five machine-learning models. The validation data included two public challenge datasets (ADReSS and ADReSSo, English voice) and a local dataset (10 audio recordings containing five patients and five controls, Chinese voice). Results showed that the VAD Pause feature was more effective than common feature sets (ComParE: 6373 features and eGeMAPS: 88 features) for AD classification, and that the ensemble method improved the accuracy by more than 5% compared to several baseline methods (8% on the ADReSS dataset; 5.9% on the ADReSSo dataset). Moreover, the pause-sequence-based AD detection method could achieve 80% accuracy on the local dataset. Our study further demonstrated the potential of pause information in speech-based AD detection, and also contributed to a more accessible and general pause feature extraction and encoding method for AD detection. |
format | Online Article Text |
id | pubmed-10046767 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100467672023-03-29 Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech Liu, Jiamin Fu, Fan Li, Liang Yu, Junxiao Zhong, Dacheng Zhu, Songsheng Zhou, Yuxuan Liu, Bin Li, Jianqing Brain Sci Article Clinical studies have shown that speech pauses can reflect the cognitive function differences between Alzheimer’s Disease (AD) and non-AD patients, while the value of pause information in AD detection has not been fully explored. Herein, we propose a speech pause feature extraction and encoding strategy for only acoustic-signal-based AD detection. First, a voice activity detection (VAD) method was constructed to detect pause/non-pause feature and encode it to binary pause sequences that are easier to calculate. Then, an ensemble machine-learning-based approach was proposed for the classification of AD from the participants’ spontaneous speech, based on the VAD Pause feature sequence and common acoustic feature sets (ComParE and eGeMAPS). The proposed pause feature sequence was verified in five machine-learning models. The validation data included two public challenge datasets (ADReSS and ADReSSo, English voice) and a local dataset (10 audio recordings containing five patients and five controls, Chinese voice). Results showed that the VAD Pause feature was more effective than common feature sets (ComParE: 6373 features and eGeMAPS: 88 features) for AD classification, and that the ensemble method improved the accuracy by more than 5% compared to several baseline methods (8% on the ADReSS dataset; 5.9% on the ADReSSo dataset). Moreover, the pause-sequence-based AD detection method could achieve 80% accuracy on the local dataset. Our study further demonstrated the potential of pause information in speech-based AD detection, and also contributed to a more accessible and general pause feature extraction and encoding method for AD detection. MDPI 2023-03-11 /pmc/articles/PMC10046767/ /pubmed/36979287 http://dx.doi.org/10.3390/brainsci13030477 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Liu, Jiamin Fu, Fan Li, Liang Yu, Junxiao Zhong, Dacheng Zhu, Songsheng Zhou, Yuxuan Liu, Bin Li, Jianqing Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech |
title | Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech |
title_full | Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech |
title_fullStr | Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech |
title_full_unstemmed | Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech |
title_short | Efficient Pause Extraction and Encode Strategy for Alzheimer’s Disease Detection Using Only Acoustic Features from Spontaneous Speech |
title_sort | efficient pause extraction and encode strategy for alzheimer’s disease detection using only acoustic features from spontaneous speech |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10046767/ https://www.ncbi.nlm.nih.gov/pubmed/36979287 http://dx.doi.org/10.3390/brainsci13030477 |
work_keys_str_mv | AT liujiamin efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT fufan efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT liliang efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT yujunxiao efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT zhongdacheng efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT zhusongsheng efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT zhouyuxuan efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT liubin efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech AT lijianqing efficientpauseextractionandencodestrategyforalzheimersdiseasedetectionusingonlyacousticfeaturesfromspontaneousspeech |