Cargando…

A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets

Machine learning algorithms, including recent advances in deep learning, are promising for tools for detection and classification of broadband high frequency signals in passive acoustic recordings. However, these methods are generally data-hungry and progress has been limited by challenges related t...

Descripción completa

Detalles Bibliográficos
Autor principal: Frasier, Kaitlin E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8673644/
https://www.ncbi.nlm.nih.gov/pubmed/34860825
http://dx.doi.org/10.1371/journal.pcbi.1009613
_version_ 1784615492198072320
author Frasier, Kaitlin E.
author_facet Frasier, Kaitlin E.
author_sort Frasier, Kaitlin E.
collection PubMed
description Machine learning algorithms, including recent advances in deep learning, are promising for tools for detection and classification of broadband high frequency signals in passive acoustic recordings. However, these methods are generally data-hungry and progress has been limited by challenges related to the lack of labeled datasets adequate for training and testing. Large quantities of known and as yet unidentified broadband signal types mingle in marine recordings, with variability introduced by acoustic propagation, source depths and orientations, and interacting signals. Manual classification of these datasets is unmanageable without an in-depth knowledge of the acoustic context of each recording location. A signal classification pipeline is presented which combines unsupervised and supervised learning phases with opportunities for expert oversight to label signals of interest. The method is illustrated with a case study using unsupervised clustering to identify five toothed whale echolocation click types and two anthropogenic signal categories. These categories are used to train a deep network to classify detected signals in either averaged time bins or as individual detections, in two independent datasets. Bin-level classification achieved higher overall precision (>99%) than click-level classification. However, click-level classification had the advantage of providing a label for every signal, and achieved higher overall recall, with overall precision from 92 to 94%. The results suggest that unsupervised learning is a viable solution for efficiently generating the large, representative training sets needed for applications of deep learning in passive acoustics.
format Online
Article
Text
id pubmed-8673644
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-86736442021-12-16 A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets Frasier, Kaitlin E. PLoS Comput Biol Research Article Machine learning algorithms, including recent advances in deep learning, are promising for tools for detection and classification of broadband high frequency signals in passive acoustic recordings. However, these methods are generally data-hungry and progress has been limited by challenges related to the lack of labeled datasets adequate for training and testing. Large quantities of known and as yet unidentified broadband signal types mingle in marine recordings, with variability introduced by acoustic propagation, source depths and orientations, and interacting signals. Manual classification of these datasets is unmanageable without an in-depth knowledge of the acoustic context of each recording location. A signal classification pipeline is presented which combines unsupervised and supervised learning phases with opportunities for expert oversight to label signals of interest. The method is illustrated with a case study using unsupervised clustering to identify five toothed whale echolocation click types and two anthropogenic signal categories. These categories are used to train a deep network to classify detected signals in either averaged time bins or as individual detections, in two independent datasets. Bin-level classification achieved higher overall precision (>99%) than click-level classification. However, click-level classification had the advantage of providing a label for every signal, and achieved higher overall recall, with overall precision from 92 to 94%. The results suggest that unsupervised learning is a viable solution for efficiently generating the large, representative training sets needed for applications of deep learning in passive acoustics. Public Library of Science 2021-12-03 /pmc/articles/PMC8673644/ /pubmed/34860825 http://dx.doi.org/10.1371/journal.pcbi.1009613 Text en © 2021 Kaitlin E. Frasier https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Frasier, Kaitlin E.
A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets
title A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets
title_full A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets
title_fullStr A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets
title_full_unstemmed A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets
title_short A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets
title_sort machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8673644/
https://www.ncbi.nlm.nih.gov/pubmed/34860825
http://dx.doi.org/10.1371/journal.pcbi.1009613
work_keys_str_mv AT frasierkaitline amachinelearningpipelineforclassificationofcetaceanecholocationclicksinlargeunderwateracousticdatasets
AT frasierkaitline machinelearningpipelineforclassificationofcetaceanecholocationclicksinlargeunderwateracousticdatasets