Cargando…

Benchmarking Audio Signal Representation Techniques for Classification with Convolutional Neural Networks

Audio signal classification finds various applications in detecting and monitoring health conditions in healthcare. Convolutional neural networks (CNN) have produced state-of-the-art results in image classification and are being increasingly used in other tasks, including signal classification. Howe...

Descripción completa

Detalles Bibliográficos
Autores principales: Sharan, Roneel V., Xiong, Hao, Berkovsky, Shlomo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8156023/
https://www.ncbi.nlm.nih.gov/pubmed/34069189
http://dx.doi.org/10.3390/s21103434
_version_ 1783699340843286528
author Sharan, Roneel V.
Xiong, Hao
Berkovsky, Shlomo
author_facet Sharan, Roneel V.
Xiong, Hao
Berkovsky, Shlomo
author_sort Sharan, Roneel V.
collection PubMed
description Audio signal classification finds various applications in detecting and monitoring health conditions in healthcare. Convolutional neural networks (CNN) have produced state-of-the-art results in image classification and are being increasingly used in other tasks, including signal classification. However, audio signal classification using CNN presents various challenges. In image classification tasks, raw images of equal dimensions can be used as a direct input to CNN. Raw time-domain signals, on the other hand, can be of varying dimensions. In addition, the temporal signal often has to be transformed to frequency-domain to reveal unique spectral characteristics, therefore requiring signal transformation. In this work, we overview and benchmark various audio signal representation techniques for classification using CNN, including approaches that deal with signals of different lengths and combine multiple representations to improve the classification accuracy. Hence, this work surfaces important empirical evidence that may guide future works deploying CNN for audio signal classification purposes.
format Online
Article
Text
id pubmed-8156023
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-81560232021-05-28 Benchmarking Audio Signal Representation Techniques for Classification with Convolutional Neural Networks Sharan, Roneel V. Xiong, Hao Berkovsky, Shlomo Sensors (Basel) Article Audio signal classification finds various applications in detecting and monitoring health conditions in healthcare. Convolutional neural networks (CNN) have produced state-of-the-art results in image classification and are being increasingly used in other tasks, including signal classification. However, audio signal classification using CNN presents various challenges. In image classification tasks, raw images of equal dimensions can be used as a direct input to CNN. Raw time-domain signals, on the other hand, can be of varying dimensions. In addition, the temporal signal often has to be transformed to frequency-domain to reveal unique spectral characteristics, therefore requiring signal transformation. In this work, we overview and benchmark various audio signal representation techniques for classification using CNN, including approaches that deal with signals of different lengths and combine multiple representations to improve the classification accuracy. Hence, this work surfaces important empirical evidence that may guide future works deploying CNN for audio signal classification purposes. MDPI 2021-05-14 /pmc/articles/PMC8156023/ /pubmed/34069189 http://dx.doi.org/10.3390/s21103434 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sharan, Roneel V.
Xiong, Hao
Berkovsky, Shlomo
Benchmarking Audio Signal Representation Techniques for Classification with Convolutional Neural Networks
title Benchmarking Audio Signal Representation Techniques for Classification with Convolutional Neural Networks
title_full Benchmarking Audio Signal Representation Techniques for Classification with Convolutional Neural Networks
title_fullStr Benchmarking Audio Signal Representation Techniques for Classification with Convolutional Neural Networks
title_full_unstemmed Benchmarking Audio Signal Representation Techniques for Classification with Convolutional Neural Networks
title_short Benchmarking Audio Signal Representation Techniques for Classification with Convolutional Neural Networks
title_sort benchmarking audio signal representation techniques for classification with convolutional neural networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8156023/
https://www.ncbi.nlm.nih.gov/pubmed/34069189
http://dx.doi.org/10.3390/s21103434
work_keys_str_mv AT sharanroneelv benchmarkingaudiosignalrepresentationtechniquesforclassificationwithconvolutionalneuralnetworks
AT xionghao benchmarkingaudiosignalrepresentationtechniquesforclassificationwithconvolutionalneuralnetworks
AT berkovskyshlomo benchmarkingaudiosignalrepresentationtechniquesforclassificationwithconvolutionalneuralnetworks