Cargando…

Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection

In the emerging field of acoustic novelty detection, most research efforts are devoted to probabilistic approaches such as mixture models or state-space models. Only recent studies introduced (pseudo-)generative models for acoustic novelty detection with recurrent neural networks in the form of an a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Marchi, Erik, Vesperini, Fabio, Squartini, Stefano, Schuller, Björn
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi Publishing Corporation 2017
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5274684/ https://www.ncbi.nlm.nih.gov/pubmed/28182121 http://dx.doi.org/10.1155/2017/4694860

_version_	1782501946253377536
author	Marchi, Erik Vesperini, Fabio Squartini, Stefano Schuller, Björn
author_facet	Marchi, Erik Vesperini, Fabio Squartini, Stefano Schuller, Björn
author_sort	Marchi, Erik
collection	PubMed
description	In the emerging field of acoustic novelty detection, most research efforts are devoted to probabilistic approaches such as mixture models or state-space models. Only recent studies introduced (pseudo-)generative models for acoustic novelty detection with recurrent neural networks in the form of an autoencoder. In these approaches, auditory spectral features of the next short term frame are predicted from the previous frames by means of Long-Short Term Memory recurrent denoising autoencoders. The reconstruction error between the input and the output of the autoencoder is used as activation signal to detect novel events. There is no evidence of studies focused on comparing previous efforts to automatically recognize novel events from audio signals and giving a broad and in depth evaluation of recurrent neural network-based autoencoders. The present contribution aims to consistently evaluate our recent novel approaches to fill this white spot in the literature and provide insight by extensive evaluations carried out on three databases: A3Novelty, PASCAL CHiME, and PROMETHEUS. Besides providing an extensive analysis of novel and state-of-the-art methods, the article shows how RNN-based autoencoders outperform statistical approaches up to an absolute improvement of 16.4% average F-measure over the three databases.
format	Online Article Text
id	pubmed-5274684
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Hindawi Publishing Corporation
record_format	MEDLINE/PubMed
spelling	pubmed-52746842017-02-08 Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection Marchi, Erik Vesperini, Fabio Squartini, Stefano Schuller, Björn Comput Intell Neurosci Research Article In the emerging field of acoustic novelty detection, most research efforts are devoted to probabilistic approaches such as mixture models or state-space models. Only recent studies introduced (pseudo-)generative models for acoustic novelty detection with recurrent neural networks in the form of an autoencoder. In these approaches, auditory spectral features of the next short term frame are predicted from the previous frames by means of Long-Short Term Memory recurrent denoising autoencoders. The reconstruction error between the input and the output of the autoencoder is used as activation signal to detect novel events. There is no evidence of studies focused on comparing previous efforts to automatically recognize novel events from audio signals and giving a broad and in depth evaluation of recurrent neural network-based autoencoders. The present contribution aims to consistently evaluate our recent novel approaches to fill this white spot in the literature and provide insight by extensive evaluations carried out on three databases: A3Novelty, PASCAL CHiME, and PROMETHEUS. Besides providing an extensive analysis of novel and state-of-the-art methods, the article shows how RNN-based autoencoders outperform statistical approaches up to an absolute improvement of 16.4% average F-measure over the three databases. Hindawi Publishing Corporation 2017 2017-01-15 /pmc/articles/PMC5274684/ /pubmed/28182121 http://dx.doi.org/10.1155/2017/4694860 Text en Copyright © 2017 Erik Marchi et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Marchi, Erik Vesperini, Fabio Squartini, Stefano Schuller, Björn Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection
title	Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection
title_full	Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection
title_fullStr	Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection
title_full_unstemmed	Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection
title_short	Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection
title_sort	deep recurrent neural network-based autoencoders for acoustic novelty detection
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5274684/ https://www.ncbi.nlm.nih.gov/pubmed/28182121 http://dx.doi.org/10.1155/2017/4694860
work_keys_str_mv	AT marchierik deeprecurrentneuralnetworkbasedautoencodersforacousticnoveltydetection AT vesperinifabio deeprecurrentneuralnetworkbasedautoencodersforacousticnoveltydetection AT squartinistefano deeprecurrentneuralnetworkbasedautoencodersforacousticnoveltydetection AT schullerbjorn deeprecurrentneuralnetworkbasedautoencodersforacousticnoveltydetection

Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection

Ejemplares similares