Cargando…

Few-Shot Emergency Siren Detection

It is a well-established practice to build a robust system for sound event detection by training supervised deep learning models on large datasets, but audio data collection and labeling are often challenging and require large amounts of effort. This paper proposes a workflow based on few-shot metri...

Descripción completa

Detalles Bibliográficos
Autores principales: Cantarini, Michela, Gabrielli, Leonardo, Squartini, Stefano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9227471/
https://www.ncbi.nlm.nih.gov/pubmed/35746120
http://dx.doi.org/10.3390/s22124338
_version_ 1784734185974398976
author Cantarini, Michela
Gabrielli, Leonardo
Squartini, Stefano
author_facet Cantarini, Michela
Gabrielli, Leonardo
Squartini, Stefano
author_sort Cantarini, Michela
collection PubMed
description It is a well-established practice to build a robust system for sound event detection by training supervised deep learning models on large datasets, but audio data collection and labeling are often challenging and require large amounts of effort. This paper proposes a workflow based on few-shot metric learning for emergency siren detection performed in steps: prototypical networks are trained on publicly available sources or synthetic data in multiple combinations, and at inference time, the best knowledge learned in associating a sound with its class representation is transferred to identify ambulance sirens, given only a few instances for the prototype computation. Performance is evaluated on siren recordings acquired by sensors inside and outside the cabin of an equipped car, investigating the contribution of filtering techniques for background noise reduction. The results show the effectiveness of the proposed approach, achieving AUPRC scores equal to 0.86 and 0.91 in unfiltered and filtered conditions, respectively, outperforming a convolutional baseline model with and without fine-tuning for domain adaptation. Extensive experiments conducted on several recording sensor placements prove that few-shot learning is a reliable technique even in real-world scenarios and gives valuable insights for developing an in-car emergency vehicle detection system.
format Online
Article
Text
id pubmed-9227471
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-92274712022-06-25 Few-Shot Emergency Siren Detection Cantarini, Michela Gabrielli, Leonardo Squartini, Stefano Sensors (Basel) Article It is a well-established practice to build a robust system for sound event detection by training supervised deep learning models on large datasets, but audio data collection and labeling are often challenging and require large amounts of effort. This paper proposes a workflow based on few-shot metric learning for emergency siren detection performed in steps: prototypical networks are trained on publicly available sources or synthetic data in multiple combinations, and at inference time, the best knowledge learned in associating a sound with its class representation is transferred to identify ambulance sirens, given only a few instances for the prototype computation. Performance is evaluated on siren recordings acquired by sensors inside and outside the cabin of an equipped car, investigating the contribution of filtering techniques for background noise reduction. The results show the effectiveness of the proposed approach, achieving AUPRC scores equal to 0.86 and 0.91 in unfiltered and filtered conditions, respectively, outperforming a convolutional baseline model with and without fine-tuning for domain adaptation. Extensive experiments conducted on several recording sensor placements prove that few-shot learning is a reliable technique even in real-world scenarios and gives valuable insights for developing an in-car emergency vehicle detection system. MDPI 2022-06-08 /pmc/articles/PMC9227471/ /pubmed/35746120 http://dx.doi.org/10.3390/s22124338 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Cantarini, Michela
Gabrielli, Leonardo
Squartini, Stefano
Few-Shot Emergency Siren Detection
title Few-Shot Emergency Siren Detection
title_full Few-Shot Emergency Siren Detection
title_fullStr Few-Shot Emergency Siren Detection
title_full_unstemmed Few-Shot Emergency Siren Detection
title_short Few-Shot Emergency Siren Detection
title_sort few-shot emergency siren detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9227471/
https://www.ncbi.nlm.nih.gov/pubmed/35746120
http://dx.doi.org/10.3390/s22124338
work_keys_str_mv AT cantarinimichela fewshotemergencysirendetection
AT gabriellileonardo fewshotemergencysirendetection
AT squartinistefano fewshotemergencysirendetection