Cargando…

A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network

Recent studies have revealed mutually correlated audio features in the voices of depressed patients. Thus, the voices of these patients can be characterized based on the combinatorial relationships among the audio features. To date, many deep learning–based methods have been proposed to predict the...

Descripción completa

Detalles Bibliográficos
Autores principales: Ishimaru, Momoko, Okada, Yoshifumi, Uchiyama, Ryunosuke, Horiguchi, Ryo, Toyoshima, Itsuki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9955540/
https://www.ncbi.nlm.nih.gov/pubmed/36832211
http://dx.doi.org/10.3390/diagnostics13040727
_version_ 1784894371710107648
author Ishimaru, Momoko
Okada, Yoshifumi
Uchiyama, Ryunosuke
Horiguchi, Ryo
Toyoshima, Itsuki
author_facet Ishimaru, Momoko
Okada, Yoshifumi
Uchiyama, Ryunosuke
Horiguchi, Ryo
Toyoshima, Itsuki
author_sort Ishimaru, Momoko
collection PubMed
description Recent studies have revealed mutually correlated audio features in the voices of depressed patients. Thus, the voices of these patients can be characterized based on the combinatorial relationships among the audio features. To date, many deep learning–based methods have been proposed to predict the depression severity using audio data. However, existing methods have assumed that the individual audio features are independent. Hence, in this paper, we propose a new deep learning–based regression model that allows for the prediction of depression severity on the basis of the correlation among audio features. The proposed model was developed using a graph convolutional neural network. This model trains the voice characteristics using graph-structured data generated to express the correlation among audio features. We conducted prediction experiments on depression severity using the DAIC-WOZ dataset employed in several previous studies. The experimental results showed that the proposed model achieved a root mean square error (RMSE) of 2.15, a mean absolute error (MAE) of 1.25, and a symmetric mean absolute percentage error of 50.96%. Notably, RMSE and MAE significantly outperformed the existing state-of-the-art prediction methods. From these results, we conclude that the proposed model can be a promising tool for depression diagnosis.
format Online
Article
Text
id pubmed-9955540
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99555402023-02-25 A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network Ishimaru, Momoko Okada, Yoshifumi Uchiyama, Ryunosuke Horiguchi, Ryo Toyoshima, Itsuki Diagnostics (Basel) Brief Report Recent studies have revealed mutually correlated audio features in the voices of depressed patients. Thus, the voices of these patients can be characterized based on the combinatorial relationships among the audio features. To date, many deep learning–based methods have been proposed to predict the depression severity using audio data. However, existing methods have assumed that the individual audio features are independent. Hence, in this paper, we propose a new deep learning–based regression model that allows for the prediction of depression severity on the basis of the correlation among audio features. The proposed model was developed using a graph convolutional neural network. This model trains the voice characteristics using graph-structured data generated to express the correlation among audio features. We conducted prediction experiments on depression severity using the DAIC-WOZ dataset employed in several previous studies. The experimental results showed that the proposed model achieved a root mean square error (RMSE) of 2.15, a mean absolute error (MAE) of 1.25, and a symmetric mean absolute percentage error of 50.96%. Notably, RMSE and MAE significantly outperformed the existing state-of-the-art prediction methods. From these results, we conclude that the proposed model can be a promising tool for depression diagnosis. MDPI 2023-02-14 /pmc/articles/PMC9955540/ /pubmed/36832211 http://dx.doi.org/10.3390/diagnostics13040727 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Brief Report
Ishimaru, Momoko
Okada, Yoshifumi
Uchiyama, Ryunosuke
Horiguchi, Ryo
Toyoshima, Itsuki
A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network
title A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network
title_full A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network
title_fullStr A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network
title_full_unstemmed A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network
title_short A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network
title_sort new regression model for depression severity prediction based on correlation among audio features using a graph convolutional neural network
topic Brief Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9955540/
https://www.ncbi.nlm.nih.gov/pubmed/36832211
http://dx.doi.org/10.3390/diagnostics13040727
work_keys_str_mv AT ishimarumomoko anewregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT okadayoshifumi anewregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT uchiyamaryunosuke anewregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT horiguchiryo anewregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT toyoshimaitsuki anewregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT ishimarumomoko newregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT okadayoshifumi newregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT uchiyamaryunosuke newregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT horiguchiryo newregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT toyoshimaitsuki newregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork