Cargando…

Automatic Recognition of Giant Panda Attributes from Their Vocalizations Based on Squeeze-and-Excitation Network

The giant panda (Ailuropoda melanoleuca) has long attracted the attention of conservationists as a flagship and umbrella species. Collecting attribute information on the age structure and sex ratio of the wild giant panda populations can support our understanding of their status and the design of mo...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Qijun, Zhang, Yanqiu, Hou, Rong, He, Mengnan, Liu, Peng, Xu, Ping, Zhang, Zhihe, Chen, Peng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610399/
https://www.ncbi.nlm.nih.gov/pubmed/36298365
http://dx.doi.org/10.3390/s22208015
_version_ 1784819260093104128
author Zhao, Qijun
Zhang, Yanqiu
Hou, Rong
He, Mengnan
Liu, Peng
Xu, Ping
Zhang, Zhihe
Chen, Peng
author_facet Zhao, Qijun
Zhang, Yanqiu
Hou, Rong
He, Mengnan
Liu, Peng
Xu, Ping
Zhang, Zhihe
Chen, Peng
author_sort Zhao, Qijun
collection PubMed
description The giant panda (Ailuropoda melanoleuca) has long attracted the attention of conservationists as a flagship and umbrella species. Collecting attribute information on the age structure and sex ratio of the wild giant panda populations can support our understanding of their status and the design of more effective conservation schemes. In view of the shortcomings of traditional methods, which cannot automatically recognize the age and sex of giant pandas, we designed a SENet (Squeeze-and-Excitation Network)-based model to automatically recognize the attributes of giant pandas from their vocalizations. We focused on the recognition of age groups (juvenile and adult) and sex of giant pandas. The reason for using vocalizations is that among the modes of animal communication, sound has the advantages of long transmission distances, strong penetrating power, and rich information. We collected a dataset of calls from 28 captive giant panda individuals, with a total duration of 1298.02 s of recordings. We used MFCC (Mel-frequency Cepstral Coefficients), which is an acoustic feature, as inputs for the SENet. Considering that small datasets are not conducive to convergence in the training process, we increased the size of the training data via SpecAugment. In addition, we used focal loss to reduce the impact of data imbalance. Our results showed that the F1 scores of our method for recognizing age group and sex reached 96.46% ± 5.71% and 85.85% ± 7.99%, respectively, demonstrating that the automatic recognition of giant panda attributes based on their vocalizations is feasible and effective. This more convenient, quick, timesaving, and laborsaving attribute recognition method can be used in the investigation of wild giant pandas in the future.
format Online
Article
Text
id pubmed-9610399
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96103992022-10-28 Automatic Recognition of Giant Panda Attributes from Their Vocalizations Based on Squeeze-and-Excitation Network Zhao, Qijun Zhang, Yanqiu Hou, Rong He, Mengnan Liu, Peng Xu, Ping Zhang, Zhihe Chen, Peng Sensors (Basel) Article The giant panda (Ailuropoda melanoleuca) has long attracted the attention of conservationists as a flagship and umbrella species. Collecting attribute information on the age structure and sex ratio of the wild giant panda populations can support our understanding of their status and the design of more effective conservation schemes. In view of the shortcomings of traditional methods, which cannot automatically recognize the age and sex of giant pandas, we designed a SENet (Squeeze-and-Excitation Network)-based model to automatically recognize the attributes of giant pandas from their vocalizations. We focused on the recognition of age groups (juvenile and adult) and sex of giant pandas. The reason for using vocalizations is that among the modes of animal communication, sound has the advantages of long transmission distances, strong penetrating power, and rich information. We collected a dataset of calls from 28 captive giant panda individuals, with a total duration of 1298.02 s of recordings. We used MFCC (Mel-frequency Cepstral Coefficients), which is an acoustic feature, as inputs for the SENet. Considering that small datasets are not conducive to convergence in the training process, we increased the size of the training data via SpecAugment. In addition, we used focal loss to reduce the impact of data imbalance. Our results showed that the F1 scores of our method for recognizing age group and sex reached 96.46% ± 5.71% and 85.85% ± 7.99%, respectively, demonstrating that the automatic recognition of giant panda attributes based on their vocalizations is feasible and effective. This more convenient, quick, timesaving, and laborsaving attribute recognition method can be used in the investigation of wild giant pandas in the future. MDPI 2022-10-20 /pmc/articles/PMC9610399/ /pubmed/36298365 http://dx.doi.org/10.3390/s22208015 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhao, Qijun
Zhang, Yanqiu
Hou, Rong
He, Mengnan
Liu, Peng
Xu, Ping
Zhang, Zhihe
Chen, Peng
Automatic Recognition of Giant Panda Attributes from Their Vocalizations Based on Squeeze-and-Excitation Network
title Automatic Recognition of Giant Panda Attributes from Their Vocalizations Based on Squeeze-and-Excitation Network
title_full Automatic Recognition of Giant Panda Attributes from Their Vocalizations Based on Squeeze-and-Excitation Network
title_fullStr Automatic Recognition of Giant Panda Attributes from Their Vocalizations Based on Squeeze-and-Excitation Network
title_full_unstemmed Automatic Recognition of Giant Panda Attributes from Their Vocalizations Based on Squeeze-and-Excitation Network
title_short Automatic Recognition of Giant Panda Attributes from Their Vocalizations Based on Squeeze-and-Excitation Network
title_sort automatic recognition of giant panda attributes from their vocalizations based on squeeze-and-excitation network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610399/
https://www.ncbi.nlm.nih.gov/pubmed/36298365
http://dx.doi.org/10.3390/s22208015
work_keys_str_mv AT zhaoqijun automaticrecognitionofgiantpandaattributesfromtheirvocalizationsbasedonsqueezeandexcitationnetwork
AT zhangyanqiu automaticrecognitionofgiantpandaattributesfromtheirvocalizationsbasedonsqueezeandexcitationnetwork
AT hourong automaticrecognitionofgiantpandaattributesfromtheirvocalizationsbasedonsqueezeandexcitationnetwork
AT hemengnan automaticrecognitionofgiantpandaattributesfromtheirvocalizationsbasedonsqueezeandexcitationnetwork
AT liupeng automaticrecognitionofgiantpandaattributesfromtheirvocalizationsbasedonsqueezeandexcitationnetwork
AT xuping automaticrecognitionofgiantpandaattributesfromtheirvocalizationsbasedonsqueezeandexcitationnetwork
AT zhangzhihe automaticrecognitionofgiantpandaattributesfromtheirvocalizationsbasedonsqueezeandexcitationnetwork
AT chenpeng automaticrecognitionofgiantpandaattributesfromtheirvocalizationsbasedonsqueezeandexcitationnetwork