Cargando…

Scalable preprocessing of high volume environmental acoustic data for bioacoustic monitoring

In this work, we examine the problem of efficiently preprocessing and denoising high volume environmental acoustic data, which is a necessary step in many bird monitoring tasks. Preprocessing is typically made up of multiple steps which are considered separately from each other. These are often reso...

Descripción completa

Detalles Bibliográficos
Autores principales:	Brown, Alexander, Garg, Saurabh, Montgomery, James
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2018
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6075764/ https://www.ncbi.nlm.nih.gov/pubmed/30075012 http://dx.doi.org/10.1371/journal.pone.0201542

_version_	1783344611373088768
author	Brown, Alexander Garg, Saurabh Montgomery, James
author_facet	Brown, Alexander Garg, Saurabh Montgomery, James
author_sort	Brown, Alexander
collection	PubMed
description	In this work, we examine the problem of efficiently preprocessing and denoising high volume environmental acoustic data, which is a necessary step in many bird monitoring tasks. Preprocessing is typically made up of multiple steps which are considered separately from each other. These are often resource intensive, particularly because the volume of data involved is high. We focus on addressing two challenges within this problem: how to combine existing preprocessing tasks while maximising the effectiveness of each step, and how to process this pipeline quickly and efficiently, so that it can be used to process high volumes of acoustic data. We describe a distributed system designed specifically for this problem, utilising a master-slave model with data parallelisation. By investigating the impact of individual preprocessing tasks on each other, and their execution times, we determine an efficient and accurate order for preprocessing tasks within the distributed system. We find that, using a single core, our pipeline executes 1.40 times faster compared to manually executing all preprocessing tasks. We then apply our pipeline in the distributed system and evaluate its performance. We find that our system is capable of preprocessing bird acoustic recordings at a rate of 174.8 seconds of audio per second of real time with 32 cores over 8 virtual machines, which is 21.76 times faster than a serial process.
format	Online Article Text
id	pubmed-6075764
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-60757642018-08-28 Scalable preprocessing of high volume environmental acoustic data for bioacoustic monitoring Brown, Alexander Garg, Saurabh Montgomery, James PLoS One Research Article In this work, we examine the problem of efficiently preprocessing and denoising high volume environmental acoustic data, which is a necessary step in many bird monitoring tasks. Preprocessing is typically made up of multiple steps which are considered separately from each other. These are often resource intensive, particularly because the volume of data involved is high. We focus on addressing two challenges within this problem: how to combine existing preprocessing tasks while maximising the effectiveness of each step, and how to process this pipeline quickly and efficiently, so that it can be used to process high volumes of acoustic data. We describe a distributed system designed specifically for this problem, utilising a master-slave model with data parallelisation. By investigating the impact of individual preprocessing tasks on each other, and their execution times, we determine an efficient and accurate order for preprocessing tasks within the distributed system. We find that, using a single core, our pipeline executes 1.40 times faster compared to manually executing all preprocessing tasks. We then apply our pipeline in the distributed system and evaluate its performance. We find that our system is capable of preprocessing bird acoustic recordings at a rate of 174.8 seconds of audio per second of real time with 32 cores over 8 virtual machines, which is 21.76 times faster than a serial process. Public Library of Science 2018-08-03 /pmc/articles/PMC6075764/ /pubmed/30075012 http://dx.doi.org/10.1371/journal.pone.0201542 Text en © 2018 Brown et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Brown, Alexander Garg, Saurabh Montgomery, James Scalable preprocessing of high volume environmental acoustic data for bioacoustic monitoring
title	Scalable preprocessing of high volume environmental acoustic data for bioacoustic monitoring
title_full	Scalable preprocessing of high volume environmental acoustic data for bioacoustic monitoring
title_fullStr	Scalable preprocessing of high volume environmental acoustic data for bioacoustic monitoring
title_full_unstemmed	Scalable preprocessing of high volume environmental acoustic data for bioacoustic monitoring
title_short	Scalable preprocessing of high volume environmental acoustic data for bioacoustic monitoring
title_sort	scalable preprocessing of high volume environmental acoustic data for bioacoustic monitoring
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6075764/ https://www.ncbi.nlm.nih.gov/pubmed/30075012 http://dx.doi.org/10.1371/journal.pone.0201542
work_keys_str_mv	AT brownalexander scalablepreprocessingofhighvolumeenvironmentalacousticdataforbioacousticmonitoring AT gargsaurabh scalablepreprocessingofhighvolumeenvironmentalacousticdataforbioacousticmonitoring AT montgomeryjames scalablepreprocessingofhighvolumeenvironmentalacousticdataforbioacousticmonitoring

Scalable preprocessing of high volume environmental acoustic data for bioacoustic monitoring

Ejemplares similares