Cargando…

Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks

This paper proposes a speech-based method for automatic depression classification. The system is based on ensemble learning for Convolutional Neural Networks (CNNs) and is evaluated using the data and the experimental protocol provided in the Depression Classification Sub-Challenge (DCC) at the 2016...

Descripción completa

Detalles Bibliográficos
Autores principales: Vázquez-Romero, Adrián, Gallardo-Antolín, Ascensión
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517226/
https://www.ncbi.nlm.nih.gov/pubmed/33286460
http://dx.doi.org/10.3390/e22060688
_version_ 1783587181443416064
author Vázquez-Romero, Adrián
Gallardo-Antolín, Ascensión
author_facet Vázquez-Romero, Adrián
Gallardo-Antolín, Ascensión
author_sort Vázquez-Romero, Adrián
collection PubMed
description This paper proposes a speech-based method for automatic depression classification. The system is based on ensemble learning for Convolutional Neural Networks (CNNs) and is evaluated using the data and the experimental protocol provided in the Depression Classification Sub-Challenge (DCC) at the 2016 Audio–Visual Emotion Challenge (AVEC-2016). In the pre-processing phase, speech files are represented as a sequence of log-spectrograms and randomly sampled to balance positive and negative samples. For the classification task itself, first, a more suitable architecture for this task, based on One-Dimensional Convolutional Neural Networks, is built. Secondly, several of these CNN-based models are trained with different initializations and then the corresponding individual predictions are fused by using an Ensemble Averaging algorithm and combined per speaker to get an appropriate final decision. The proposed ensemble system achieves satisfactory results on the DCC at the AVEC-2016 in comparison with a reference system based on Support Vector Machines and hand-crafted features, with a CNN+LSTM-based system called DepAudionet, and with the case of a single CNN-based classifier.
format Online
Article
Text
id pubmed-7517226
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75172262020-11-09 Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks Vázquez-Romero, Adrián Gallardo-Antolín, Ascensión Entropy (Basel) Article This paper proposes a speech-based method for automatic depression classification. The system is based on ensemble learning for Convolutional Neural Networks (CNNs) and is evaluated using the data and the experimental protocol provided in the Depression Classification Sub-Challenge (DCC) at the 2016 Audio–Visual Emotion Challenge (AVEC-2016). In the pre-processing phase, speech files are represented as a sequence of log-spectrograms and randomly sampled to balance positive and negative samples. For the classification task itself, first, a more suitable architecture for this task, based on One-Dimensional Convolutional Neural Networks, is built. Secondly, several of these CNN-based models are trained with different initializations and then the corresponding individual predictions are fused by using an Ensemble Averaging algorithm and combined per speaker to get an appropriate final decision. The proposed ensemble system achieves satisfactory results on the DCC at the AVEC-2016 in comparison with a reference system based on Support Vector Machines and hand-crafted features, with a CNN+LSTM-based system called DepAudionet, and with the case of a single CNN-based classifier. MDPI 2020-06-20 /pmc/articles/PMC7517226/ /pubmed/33286460 http://dx.doi.org/10.3390/e22060688 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Vázquez-Romero, Adrián
Gallardo-Antolín, Ascensión
Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks
title Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks
title_full Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks
title_fullStr Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks
title_full_unstemmed Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks
title_short Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks
title_sort automatic detection of depression in speech using ensemble convolutional neural networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517226/
https://www.ncbi.nlm.nih.gov/pubmed/33286460
http://dx.doi.org/10.3390/e22060688
work_keys_str_mv AT vazquezromeroadrian automaticdetectionofdepressioninspeechusingensembleconvolutionalneuralnetworks
AT gallardoantolinascension automaticdetectionofdepressioninspeechusingensembleconvolutionalneuralnetworks