Cargando…

Speeding up training of automated bird recognizers by data reduction of audio features

Automated acoustic recognition of birds is considered an important technology in support of biodiversity monitoring and biodiversity conservation activities. These activities require processing large amounts of soundscape recordings. Typically, recordings are transformed to a number of acoustic feat...

Descripción completa

Detalles Bibliográficos
Autores principales: de Oliveira, Allan G., Ventura, Thiago M., Ganchev, Todor D., Silva, Lucas N.S., Marques, Marinêz I., Schuchmann, Karl-L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6991130/
https://www.ncbi.nlm.nih.gov/pubmed/32025373
http://dx.doi.org/10.7717/peerj.8407
_version_ 1783492597048672256
author de Oliveira, Allan G.
Ventura, Thiago M.
Ganchev, Todor D.
Silva, Lucas N.S.
Marques, Marinêz I.
Schuchmann, Karl-L.
author_facet de Oliveira, Allan G.
Ventura, Thiago M.
Ganchev, Todor D.
Silva, Lucas N.S.
Marques, Marinêz I.
Schuchmann, Karl-L.
author_sort de Oliveira, Allan G.
collection PubMed
description Automated acoustic recognition of birds is considered an important technology in support of biodiversity monitoring and biodiversity conservation activities. These activities require processing large amounts of soundscape recordings. Typically, recordings are transformed to a number of acoustic features, and a machine learning method is used to build models and recognize the sound events of interest. The main problem is the scalability of data processing, either for developing models or for processing recordings made over long time periods. In those cases, the processing time and resources required might become prohibitive for the average user. To address this problem, we evaluated the applicability of three data reduction methods. These methods were applied to a series of acoustic feature vectors as an additional postprocessing step, which aims to reduce the computational demand during training. The experimental results obtained using Mel-frequency cepstral coefficients (MFCCs) and hidden Markov models (HMMs) support the finding that a reduction in training data by a factor of 10 does not significantly affect the recognition performance.
format Online
Article
Text
id pubmed-6991130
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-69911302020-02-05 Speeding up training of automated bird recognizers by data reduction of audio features de Oliveira, Allan G. Ventura, Thiago M. Ganchev, Todor D. Silva, Lucas N.S. Marques, Marinêz I. Schuchmann, Karl-L. PeerJ Ecosystem Science Automated acoustic recognition of birds is considered an important technology in support of biodiversity monitoring and biodiversity conservation activities. These activities require processing large amounts of soundscape recordings. Typically, recordings are transformed to a number of acoustic features, and a machine learning method is used to build models and recognize the sound events of interest. The main problem is the scalability of data processing, either for developing models or for processing recordings made over long time periods. In those cases, the processing time and resources required might become prohibitive for the average user. To address this problem, we evaluated the applicability of three data reduction methods. These methods were applied to a series of acoustic feature vectors as an additional postprocessing step, which aims to reduce the computational demand during training. The experimental results obtained using Mel-frequency cepstral coefficients (MFCCs) and hidden Markov models (HMMs) support the finding that a reduction in training data by a factor of 10 does not significantly affect the recognition performance. PeerJ Inc. 2020-01-27 /pmc/articles/PMC6991130/ /pubmed/32025373 http://dx.doi.org/10.7717/peerj.8407 Text en ©2020 de Oliveira et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Ecosystem Science
de Oliveira, Allan G.
Ventura, Thiago M.
Ganchev, Todor D.
Silva, Lucas N.S.
Marques, Marinêz I.
Schuchmann, Karl-L.
Speeding up training of automated bird recognizers by data reduction of audio features
title Speeding up training of automated bird recognizers by data reduction of audio features
title_full Speeding up training of automated bird recognizers by data reduction of audio features
title_fullStr Speeding up training of automated bird recognizers by data reduction of audio features
title_full_unstemmed Speeding up training of automated bird recognizers by data reduction of audio features
title_short Speeding up training of automated bird recognizers by data reduction of audio features
title_sort speeding up training of automated bird recognizers by data reduction of audio features
topic Ecosystem Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6991130/
https://www.ncbi.nlm.nih.gov/pubmed/32025373
http://dx.doi.org/10.7717/peerj.8407
work_keys_str_mv AT deoliveiraallang speedinguptrainingofautomatedbirdrecognizersbydatareductionofaudiofeatures
AT venturathiagom speedinguptrainingofautomatedbirdrecognizersbydatareductionofaudiofeatures
AT ganchevtodord speedinguptrainingofautomatedbirdrecognizersbydatareductionofaudiofeatures
AT silvalucasns speedinguptrainingofautomatedbirdrecognizersbydatareductionofaudiofeatures
AT marquesmarinezi speedinguptrainingofautomatedbirdrecognizersbydatareductionofaudiofeatures
AT schuchmannkarll speedinguptrainingofautomatedbirdrecognizersbydatareductionofaudiofeatures