Cargando…
Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks
This paper proposes a speech-based method for automatic depression classification. The system is based on ensemble learning for Convolutional Neural Networks (CNNs) and is evaluated using the data and the experimental protocol provided in the Depression Classification Sub-Challenge (DCC) at the 2016...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517226/ https://www.ncbi.nlm.nih.gov/pubmed/33286460 http://dx.doi.org/10.3390/e22060688 |
_version_ | 1783587181443416064 |
---|---|
author | Vázquez-Romero, Adrián Gallardo-Antolín, Ascensión |
author_facet | Vázquez-Romero, Adrián Gallardo-Antolín, Ascensión |
author_sort | Vázquez-Romero, Adrián |
collection | PubMed |
description | This paper proposes a speech-based method for automatic depression classification. The system is based on ensemble learning for Convolutional Neural Networks (CNNs) and is evaluated using the data and the experimental protocol provided in the Depression Classification Sub-Challenge (DCC) at the 2016 Audio–Visual Emotion Challenge (AVEC-2016). In the pre-processing phase, speech files are represented as a sequence of log-spectrograms and randomly sampled to balance positive and negative samples. For the classification task itself, first, a more suitable architecture for this task, based on One-Dimensional Convolutional Neural Networks, is built. Secondly, several of these CNN-based models are trained with different initializations and then the corresponding individual predictions are fused by using an Ensemble Averaging algorithm and combined per speaker to get an appropriate final decision. The proposed ensemble system achieves satisfactory results on the DCC at the AVEC-2016 in comparison with a reference system based on Support Vector Machines and hand-crafted features, with a CNN+LSTM-based system called DepAudionet, and with the case of a single CNN-based classifier. |
format | Online Article Text |
id | pubmed-7517226 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75172262020-11-09 Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks Vázquez-Romero, Adrián Gallardo-Antolín, Ascensión Entropy (Basel) Article This paper proposes a speech-based method for automatic depression classification. The system is based on ensemble learning for Convolutional Neural Networks (CNNs) and is evaluated using the data and the experimental protocol provided in the Depression Classification Sub-Challenge (DCC) at the 2016 Audio–Visual Emotion Challenge (AVEC-2016). In the pre-processing phase, speech files are represented as a sequence of log-spectrograms and randomly sampled to balance positive and negative samples. For the classification task itself, first, a more suitable architecture for this task, based on One-Dimensional Convolutional Neural Networks, is built. Secondly, several of these CNN-based models are trained with different initializations and then the corresponding individual predictions are fused by using an Ensemble Averaging algorithm and combined per speaker to get an appropriate final decision. The proposed ensemble system achieves satisfactory results on the DCC at the AVEC-2016 in comparison with a reference system based on Support Vector Machines and hand-crafted features, with a CNN+LSTM-based system called DepAudionet, and with the case of a single CNN-based classifier. MDPI 2020-06-20 /pmc/articles/PMC7517226/ /pubmed/33286460 http://dx.doi.org/10.3390/e22060688 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Vázquez-Romero, Adrián Gallardo-Antolín, Ascensión Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks |
title | Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks |
title_full | Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks |
title_fullStr | Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks |
title_full_unstemmed | Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks |
title_short | Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks |
title_sort | automatic detection of depression in speech using ensemble convolutional neural networks |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517226/ https://www.ncbi.nlm.nih.gov/pubmed/33286460 http://dx.doi.org/10.3390/e22060688 |
work_keys_str_mv | AT vazquezromeroadrian automaticdetectionofdepressioninspeechusingensembleconvolutionalneuralnetworks AT gallardoantolinascension automaticdetectionofdepressioninspeechusingensembleconvolutionalneuralnetworks |