Cargando…

Automatic Depression Detection Using Smartphone-Based Text-Dependent Speech Signals: Deep Convolutional Neural Network Approach

BACKGROUND: Automatic diagnosis of depression based on speech can complement mental health treatment methods in the future. Previous studies have reported that acoustic properties can be used to identify depression. However, few studies have attempted a large-scale differential diagnosis of patients...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kim, Ah Young, Jang, Eun Hye, Lee, Seung-Hwan, Choi, Kwang-Yeon, Park, Jeon Gue, Shin, Hyun-Chool
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2023
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9909514/ https://www.ncbi.nlm.nih.gov/pubmed/36696160 http://dx.doi.org/10.2196/34474

_version_	1784884592032874496
author	Kim, Ah Young Jang, Eun Hye Lee, Seung-Hwan Choi, Kwang-Yeon Park, Jeon Gue Shin, Hyun-Chool
author_facet	Kim, Ah Young Jang, Eun Hye Lee, Seung-Hwan Choi, Kwang-Yeon Park, Jeon Gue Shin, Hyun-Chool
author_sort	Kim, Ah Young
collection	PubMed
description	BACKGROUND: Automatic diagnosis of depression based on speech can complement mental health treatment methods in the future. Previous studies have reported that acoustic properties can be used to identify depression. However, few studies have attempted a large-scale differential diagnosis of patients with depressive disorders using acoustic characteristics of non-English speakers. OBJECTIVE: This study proposes a framework for automatic depression detection using large-scale acoustic characteristics based on the Korean language. METHODS: We recruited 153 patients who met the criteria for major depressive disorder and 165 healthy controls without current or past mental illness. Participants' voices were recorded on a smartphone while performing the task of reading predefined text-based sentences. Three approaches were evaluated and compared to detect depression using data sets with text-dependent read speech tasks: conventional machine learning models based on acoustic features, a proposed model that trains and classifies log-Mel spectrograms by applying a deep convolutional neural network (CNN) with a relatively small number of parameters, and models that train and classify log-Mel spectrograms by applying well-known pretrained networks. RESULTS: The acoustic characteristics of the predefined text-based sentence reading automatically detected depression using the proposed CNN model. The highest accuracy achieved with the proposed CNN on the speech data was 78.14%. Our results show that the deep-learned acoustic characteristics lead to better performance than those obtained using the conventional approach and pretrained models. CONCLUSIONS: Checking the mood of patients with major depressive disorder and detecting the consistency of objective descriptions are very important research topics. This study suggests that the analysis of speech data recorded while reading text-dependent sentences could help predict depression status automatically by capturing the characteristics of depression. Our method is smartphone based, is easily accessible, and can contribute to the automatic identification of depressive states.
format	Online Article Text
id	pubmed-9909514
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-99095142023-02-10 Automatic Depression Detection Using Smartphone-Based Text-Dependent Speech Signals: Deep Convolutional Neural Network Approach Kim, Ah Young Jang, Eun Hye Lee, Seung-Hwan Choi, Kwang-Yeon Park, Jeon Gue Shin, Hyun-Chool J Med Internet Res Original Paper BACKGROUND: Automatic diagnosis of depression based on speech can complement mental health treatment methods in the future. Previous studies have reported that acoustic properties can be used to identify depression. However, few studies have attempted a large-scale differential diagnosis of patients with depressive disorders using acoustic characteristics of non-English speakers. OBJECTIVE: This study proposes a framework for automatic depression detection using large-scale acoustic characteristics based on the Korean language. METHODS: We recruited 153 patients who met the criteria for major depressive disorder and 165 healthy controls without current or past mental illness. Participants' voices were recorded on a smartphone while performing the task of reading predefined text-based sentences. Three approaches were evaluated and compared to detect depression using data sets with text-dependent read speech tasks: conventional machine learning models based on acoustic features, a proposed model that trains and classifies log-Mel spectrograms by applying a deep convolutional neural network (CNN) with a relatively small number of parameters, and models that train and classify log-Mel spectrograms by applying well-known pretrained networks. RESULTS: The acoustic characteristics of the predefined text-based sentence reading automatically detected depression using the proposed CNN model. The highest accuracy achieved with the proposed CNN on the speech data was 78.14%. Our results show that the deep-learned acoustic characteristics lead to better performance than those obtained using the conventional approach and pretrained models. CONCLUSIONS: Checking the mood of patients with major depressive disorder and detecting the consistency of objective descriptions are very important research topics. This study suggests that the analysis of speech data recorded while reading text-dependent sentences could help predict depression status automatically by capturing the characteristics of depression. Our method is smartphone based, is easily accessible, and can contribute to the automatic identification of depressive states. JMIR Publications 2023-01-25 /pmc/articles/PMC9909514/ /pubmed/36696160 http://dx.doi.org/10.2196/34474 Text en ©Ah Young Kim, Eun Hye Jang, Seung-Hwan Lee, Kwang-Yeon Choi, Jeon Gue Park, Hyun-Chool Shin. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 25.01.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Kim, Ah Young Jang, Eun Hye Lee, Seung-Hwan Choi, Kwang-Yeon Park, Jeon Gue Shin, Hyun-Chool Automatic Depression Detection Using Smartphone-Based Text-Dependent Speech Signals: Deep Convolutional Neural Network Approach
title	Automatic Depression Detection Using Smartphone-Based Text-Dependent Speech Signals: Deep Convolutional Neural Network Approach
title_full	Automatic Depression Detection Using Smartphone-Based Text-Dependent Speech Signals: Deep Convolutional Neural Network Approach
title_fullStr	Automatic Depression Detection Using Smartphone-Based Text-Dependent Speech Signals: Deep Convolutional Neural Network Approach
title_full_unstemmed	Automatic Depression Detection Using Smartphone-Based Text-Dependent Speech Signals: Deep Convolutional Neural Network Approach
title_short	Automatic Depression Detection Using Smartphone-Based Text-Dependent Speech Signals: Deep Convolutional Neural Network Approach
title_sort	automatic depression detection using smartphone-based text-dependent speech signals: deep convolutional neural network approach
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9909514/ https://www.ncbi.nlm.nih.gov/pubmed/36696160 http://dx.doi.org/10.2196/34474
work_keys_str_mv	AT kimahyoung automaticdepressiondetectionusingsmartphonebasedtextdependentspeechsignalsdeepconvolutionalneuralnetworkapproach AT jangeunhye automaticdepressiondetectionusingsmartphonebasedtextdependentspeechsignalsdeepconvolutionalneuralnetworkapproach AT leeseunghwan automaticdepressiondetectionusingsmartphonebasedtextdependentspeechsignalsdeepconvolutionalneuralnetworkapproach AT choikwangyeon automaticdepressiondetectionusingsmartphonebasedtextdependentspeechsignalsdeepconvolutionalneuralnetworkapproach AT parkjeongue automaticdepressiondetectionusingsmartphonebasedtextdependentspeechsignalsdeepconvolutionalneuralnetworkapproach AT shinhyunchool automaticdepressiondetectionusingsmartphonebasedtextdependentspeechsignalsdeepconvolutionalneuralnetworkapproach

Automatic Depression Detection Using Smartphone-Based Text-Dependent Speech Signals: Deep Convolutional Neural Network Approach

Ejemplares similares