Cargando…

Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture

The use of autonomous recordings of animal sounds to detect species is a popular conservation tool, constantly improving in fidelity as audio hardware and software evolves. Current classification algorithms utilise sound features extracted from the recording rather than the sound itself, with varyin...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bravo Sanchez, Francisco J., Hossain, Md Rahat, English, Nathan B., Moore, Steven T.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8333097/ https://www.ncbi.nlm.nih.gov/pubmed/34344970 http://dx.doi.org/10.1038/s41598-021-95076-6

_version_	1783732966988447744
author	Bravo Sanchez, Francisco J. Hossain, Md Rahat English, Nathan B. Moore, Steven T.
author_facet	Bravo Sanchez, Francisco J. Hossain, Md Rahat English, Nathan B. Moore, Steven T.
author_sort	Bravo Sanchez, Francisco J.
collection	PubMed
description	The use of autonomous recordings of animal sounds to detect species is a popular conservation tool, constantly improving in fidelity as audio hardware and software evolves. Current classification algorithms utilise sound features extracted from the recording rather than the sound itself, with varying degrees of success. Neural networks that learn directly from the raw sound waveforms have been implemented in human speech recognition but the requirements of detailed labelled data have limited their use in bioacoustics. Here we test SincNet, an efficient neural network architecture that learns from the raw waveform using sinc-based filters. Results using an off-the-shelf implementation of SincNet on a publicly available bird sound dataset (NIPS4Bplus) show that the neural network rapidly converged reaching accuracies of over 65% with limited data. Their performance is comparable with traditional methods after hyperparameter tuning but they are more efficient. Learning directly from the raw waveform allows the algorithm to select automatically those elements of the sound that are best suited for the task, bypassing the onerous task of selecting feature extraction techniques and reducing possible biases. We use publicly released code and datasets to encourage others to replicate our results and to apply SincNet to their own datasets; and we review possible enhancements in the hope that algorithms that learn from the raw waveform will become useful bioacoustic tools.
format	Online Article Text
id	pubmed-8333097
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-83330972021-08-04 Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture Bravo Sanchez, Francisco J. Hossain, Md Rahat English, Nathan B. Moore, Steven T. Sci Rep Article The use of autonomous recordings of animal sounds to detect species is a popular conservation tool, constantly improving in fidelity as audio hardware and software evolves. Current classification algorithms utilise sound features extracted from the recording rather than the sound itself, with varying degrees of success. Neural networks that learn directly from the raw sound waveforms have been implemented in human speech recognition but the requirements of detailed labelled data have limited their use in bioacoustics. Here we test SincNet, an efficient neural network architecture that learns from the raw waveform using sinc-based filters. Results using an off-the-shelf implementation of SincNet on a publicly available bird sound dataset (NIPS4Bplus) show that the neural network rapidly converged reaching accuracies of over 65% with limited data. Their performance is comparable with traditional methods after hyperparameter tuning but they are more efficient. Learning directly from the raw waveform allows the algorithm to select automatically those elements of the sound that are best suited for the task, bypassing the onerous task of selecting feature extraction techniques and reducing possible biases. We use publicly released code and datasets to encourage others to replicate our results and to apply SincNet to their own datasets; and we review possible enhancements in the hope that algorithms that learn from the raw waveform will become useful bioacoustic tools. Nature Publishing Group UK 2021-08-03 /pmc/articles/PMC8333097/ /pubmed/34344970 http://dx.doi.org/10.1038/s41598-021-95076-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Bravo Sanchez, Francisco J. Hossain, Md Rahat English, Nathan B. Moore, Steven T. Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture
title	Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture
title_full	Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture
title_fullStr	Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture
title_full_unstemmed	Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture
title_short	Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture
title_sort	bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8333097/ https://www.ncbi.nlm.nih.gov/pubmed/34344970 http://dx.doi.org/10.1038/s41598-021-95076-6
work_keys_str_mv	AT bravosanchezfranciscoj bioacousticclassificationofaviancallsfromrawsoundwaveformswithanopensourcedeeplearningarchitecture AT hossainmdrahat bioacousticclassificationofaviancallsfromrawsoundwaveformswithanopensourcedeeplearningarchitecture AT englishnathanb bioacousticclassificationofaviancallsfromrawsoundwaveformswithanopensourcedeeplearningarchitecture AT moorestevent bioacousticclassificationofaviancallsfromrawsoundwaveformswithanopensourcedeeplearningarchitecture

Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture

Ejemplares similares