Cargando…

Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation

This paper proposes an audio data augmentation method based on deep learning in order to improve the performance of dereverberation. Conventionally, audio data are augmented using a room impulse response, which is artificially generated by some methods, such as the image method. The proposed method...

Descripción completa

Detalles Bibliográficos
Autores principales: Yun, Deokgyu, Choi, Seung Ho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8777689/
https://www.ncbi.nlm.nih.gov/pubmed/35062553
http://dx.doi.org/10.3390/s22020592
_version_ 1784637127170981888
author Yun, Deokgyu
Choi, Seung Ho
author_facet Yun, Deokgyu
Choi, Seung Ho
author_sort Yun, Deokgyu
collection PubMed
description This paper proposes an audio data augmentation method based on deep learning in order to improve the performance of dereverberation. Conventionally, audio data are augmented using a room impulse response, which is artificially generated by some methods, such as the image method. The proposed method estimates a reverberation environment model based on a deep neural network that is trained by using clean and recorded audio data as inputs and outputs, respectively. Then, a large amount of a real augmented database is constructed by using the trained reverberation model, and the dereverberation model is trained with the augmented database. The performance of the augmentation model was verified by a log spectral distance and mean square error between the real augmented data and the recorded data. In addition, according to dereverberation experiments, the proposed method showed improved performance compared with the conventional method.
format Online
Article
Text
id pubmed-8777689
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87776892022-01-22 Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation Yun, Deokgyu Choi, Seung Ho Sensors (Basel) Communication This paper proposes an audio data augmentation method based on deep learning in order to improve the performance of dereverberation. Conventionally, audio data are augmented using a room impulse response, which is artificially generated by some methods, such as the image method. The proposed method estimates a reverberation environment model based on a deep neural network that is trained by using clean and recorded audio data as inputs and outputs, respectively. Then, a large amount of a real augmented database is constructed by using the trained reverberation model, and the dereverberation model is trained with the augmented database. The performance of the augmentation model was verified by a log spectral distance and mean square error between the real augmented data and the recorded data. In addition, according to dereverberation experiments, the proposed method showed improved performance compared with the conventional method. MDPI 2022-01-13 /pmc/articles/PMC8777689/ /pubmed/35062553 http://dx.doi.org/10.3390/s22020592 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Communication
Yun, Deokgyu
Choi, Seung Ho
Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
title Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
title_full Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
title_fullStr Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
title_full_unstemmed Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
title_short Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation
title_sort deep learning-based estimation of reverberant environment for audio data augmentation
topic Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8777689/
https://www.ncbi.nlm.nih.gov/pubmed/35062553
http://dx.doi.org/10.3390/s22020592
work_keys_str_mv AT yundeokgyu deeplearningbasedestimationofreverberantenvironmentforaudiodataaugmentation
AT choiseungho deeplearningbasedestimationofreverberantenvironmentforaudiodataaugmentation