Cargando…

A Parallel Classification Model for Marine Mammal Sounds Based on Multi-Dimensional Feature Extraction and Data Augmentation

Due to the poor visibility of the deep-sea environment, acoustic signals are often collected and analyzed to explore the behavior of marine species. With the progress of underwater signal-acquisition technology, the amount of acoustic data obtained from the ocean has exceeded the limit that human ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Cai, Wenyu, Zhu, Jifeng, Zhang, Meiyan, Yang, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572586/
https://www.ncbi.nlm.nih.gov/pubmed/36236544
http://dx.doi.org/10.3390/s22197443
_version_ 1784810652481617920
author Cai, Wenyu
Zhu, Jifeng
Zhang, Meiyan
Yang, Yong
author_facet Cai, Wenyu
Zhu, Jifeng
Zhang, Meiyan
Yang, Yong
author_sort Cai, Wenyu
collection PubMed
description Due to the poor visibility of the deep-sea environment, acoustic signals are often collected and analyzed to explore the behavior of marine species. With the progress of underwater signal-acquisition technology, the amount of acoustic data obtained from the ocean has exceeded the limit that human can process manually, so designing efficient marine-mammal classification algorithms has become a research hotspot. In this paper, we design a classification model based on a multi-channel parallel structure, which can process multi-dimensional acoustic features extracted from audio samples, and fuse the prediction results of different channels through a trainable full connection layer. It uses transfer learning to obtain faster convergence speed, and introduces data augmentation to improve the classification accuracy. The k-fold cross-validation method was used to segment the data set to comprehensively evaluate the prediction accuracy and robustness of the model. The evaluation results showed that the model can achieve a mean accuracy of 95.21% while maintaining a standard deviation of 0.65%. There was excellent consistency in performance over multiple tests.
format Online
Article
Text
id pubmed-9572586
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95725862022-10-17 A Parallel Classification Model for Marine Mammal Sounds Based on Multi-Dimensional Feature Extraction and Data Augmentation Cai, Wenyu Zhu, Jifeng Zhang, Meiyan Yang, Yong Sensors (Basel) Article Due to the poor visibility of the deep-sea environment, acoustic signals are often collected and analyzed to explore the behavior of marine species. With the progress of underwater signal-acquisition technology, the amount of acoustic data obtained from the ocean has exceeded the limit that human can process manually, so designing efficient marine-mammal classification algorithms has become a research hotspot. In this paper, we design a classification model based on a multi-channel parallel structure, which can process multi-dimensional acoustic features extracted from audio samples, and fuse the prediction results of different channels through a trainable full connection layer. It uses transfer learning to obtain faster convergence speed, and introduces data augmentation to improve the classification accuracy. The k-fold cross-validation method was used to segment the data set to comprehensively evaluate the prediction accuracy and robustness of the model. The evaluation results showed that the model can achieve a mean accuracy of 95.21% while maintaining a standard deviation of 0.65%. There was excellent consistency in performance over multiple tests. MDPI 2022-09-30 /pmc/articles/PMC9572586/ /pubmed/36236544 http://dx.doi.org/10.3390/s22197443 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Cai, Wenyu
Zhu, Jifeng
Zhang, Meiyan
Yang, Yong
A Parallel Classification Model for Marine Mammal Sounds Based on Multi-Dimensional Feature Extraction and Data Augmentation
title A Parallel Classification Model for Marine Mammal Sounds Based on Multi-Dimensional Feature Extraction and Data Augmentation
title_full A Parallel Classification Model for Marine Mammal Sounds Based on Multi-Dimensional Feature Extraction and Data Augmentation
title_fullStr A Parallel Classification Model for Marine Mammal Sounds Based on Multi-Dimensional Feature Extraction and Data Augmentation
title_full_unstemmed A Parallel Classification Model for Marine Mammal Sounds Based on Multi-Dimensional Feature Extraction and Data Augmentation
title_short A Parallel Classification Model for Marine Mammal Sounds Based on Multi-Dimensional Feature Extraction and Data Augmentation
title_sort parallel classification model for marine mammal sounds based on multi-dimensional feature extraction and data augmentation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572586/
https://www.ncbi.nlm.nih.gov/pubmed/36236544
http://dx.doi.org/10.3390/s22197443
work_keys_str_mv AT caiwenyu aparallelclassificationmodelformarinemammalsoundsbasedonmultidimensionalfeatureextractionanddataaugmentation
AT zhujifeng aparallelclassificationmodelformarinemammalsoundsbasedonmultidimensionalfeatureextractionanddataaugmentation
AT zhangmeiyan aparallelclassificationmodelformarinemammalsoundsbasedonmultidimensionalfeatureextractionanddataaugmentation
AT yangyong aparallelclassificationmodelformarinemammalsoundsbasedonmultidimensionalfeatureextractionanddataaugmentation
AT caiwenyu parallelclassificationmodelformarinemammalsoundsbasedonmultidimensionalfeatureextractionanddataaugmentation
AT zhujifeng parallelclassificationmodelformarinemammalsoundsbasedonmultidimensionalfeatureextractionanddataaugmentation
AT zhangmeiyan parallelclassificationmodelformarinemammalsoundsbasedonmultidimensionalfeatureextractionanddataaugmentation
AT yangyong parallelclassificationmodelformarinemammalsoundsbasedonmultidimensionalfeatureextractionanddataaugmentation