Cargando…

A Lightweight Automatic Wildlife Recognition Model Design Method Mitigating Shortcut Learning

SIMPLE SUMMARY: Due to the complexity of the wild environment, wildlife recognition based on camera trap images is challenging. Indeed, as the backgrounds of images captured from the same infrared camera trap are rather similar, shortcut learning of recognition models are produced, resulting in redu...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhong, Yujie, Li, Xiao, Xie, Jiangjian, Zhang, Junguo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10000094/
https://www.ncbi.nlm.nih.gov/pubmed/36899695
http://dx.doi.org/10.3390/ani13050838
_version_ 1784903790475870208
author Zhong, Yujie
Li, Xiao
Xie, Jiangjian
Zhang, Junguo
author_facet Zhong, Yujie
Li, Xiao
Xie, Jiangjian
Zhang, Junguo
author_sort Zhong, Yujie
collection PubMed
description SIMPLE SUMMARY: Due to the complexity of the wild environment, wildlife recognition based on camera trap images is challenging. Indeed, as the backgrounds of images captured from the same infrared camera trap are rather similar, shortcut learning of recognition models are produced, resulting in reduced generality and poor recognition model performance. Therefore, we propose a data augmentation strategy that integrates image synthesis (IS) and regional background suppression (RBS). This strategy alleviates a model’s focus on the background, guiding it to focus on the wildlife in order to improve the model’s generality, resulting in better recognition performance. Furthermore, in order to offer the lightweight recognition model for deep learning-based real-time wildlife monitoring on edge devices, we developed a model compression strategy that combines adaptive pruning and knowledge distillation. The produced lightweight model can reduce the computational effort of wildlife recognition with less loss of accuracy and is beneficial for real-time wildlife monitoring with the use of edge intelligence. ABSTRACT: Recognizing wildlife based on camera trap images is challenging due to the complexity of the wild environment. Deep learning is an optional approach to solve this problem. However, the backgrounds of images captured from the same infrared camera trap are rather similar, and shortcut learning of recognition models occurs, resulting in reduced generality and poor recognition model performance. Therefore, this paper proposes a data augmentation strategy that integrates image synthesis (IS) and regional background suppression (RBS) to enrich the background scene and suppress the existing background information. This strategy alleviates the model’s focus on the background, guiding it to focus on the wildlife in order to improve the model’s generality, resulting in better recognition performance. Furthermore, to offer a lightweight recognition model for deep learning-based real-time wildlife monitoring on edge devices, we develop a model compression strategy that combines adaptive pruning and knowledge distillation. Specifically, a student model is built using a genetic algorithm-based pruning technique and adaptive batch normalization (GA-ABN). A mean square error (MSE) loss-based knowledge distillation method is then used to fine-tune the student model so as to generate a lightweight recognition model. The produced lightweight model can reduce the computational effort of wildlife recognition with only a 4.73% loss in accuracy. Extensive experiments have demonstrated the advantages of our method, which is beneficial for real-time wildlife monitoring with edge intelligence.
format Online
Article
Text
id pubmed-10000094
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100000942023-03-11 A Lightweight Automatic Wildlife Recognition Model Design Method Mitigating Shortcut Learning Zhong, Yujie Li, Xiao Xie, Jiangjian Zhang, Junguo Animals (Basel) Article SIMPLE SUMMARY: Due to the complexity of the wild environment, wildlife recognition based on camera trap images is challenging. Indeed, as the backgrounds of images captured from the same infrared camera trap are rather similar, shortcut learning of recognition models are produced, resulting in reduced generality and poor recognition model performance. Therefore, we propose a data augmentation strategy that integrates image synthesis (IS) and regional background suppression (RBS). This strategy alleviates a model’s focus on the background, guiding it to focus on the wildlife in order to improve the model’s generality, resulting in better recognition performance. Furthermore, in order to offer the lightweight recognition model for deep learning-based real-time wildlife monitoring on edge devices, we developed a model compression strategy that combines adaptive pruning and knowledge distillation. The produced lightweight model can reduce the computational effort of wildlife recognition with less loss of accuracy and is beneficial for real-time wildlife monitoring with the use of edge intelligence. ABSTRACT: Recognizing wildlife based on camera trap images is challenging due to the complexity of the wild environment. Deep learning is an optional approach to solve this problem. However, the backgrounds of images captured from the same infrared camera trap are rather similar, and shortcut learning of recognition models occurs, resulting in reduced generality and poor recognition model performance. Therefore, this paper proposes a data augmentation strategy that integrates image synthesis (IS) and regional background suppression (RBS) to enrich the background scene and suppress the existing background information. This strategy alleviates the model’s focus on the background, guiding it to focus on the wildlife in order to improve the model’s generality, resulting in better recognition performance. Furthermore, to offer a lightweight recognition model for deep learning-based real-time wildlife monitoring on edge devices, we develop a model compression strategy that combines adaptive pruning and knowledge distillation. Specifically, a student model is built using a genetic algorithm-based pruning technique and adaptive batch normalization (GA-ABN). A mean square error (MSE) loss-based knowledge distillation method is then used to fine-tune the student model so as to generate a lightweight recognition model. The produced lightweight model can reduce the computational effort of wildlife recognition with only a 4.73% loss in accuracy. Extensive experiments have demonstrated the advantages of our method, which is beneficial for real-time wildlife monitoring with edge intelligence. MDPI 2023-02-25 /pmc/articles/PMC10000094/ /pubmed/36899695 http://dx.doi.org/10.3390/ani13050838 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhong, Yujie
Li, Xiao
Xie, Jiangjian
Zhang, Junguo
A Lightweight Automatic Wildlife Recognition Model Design Method Mitigating Shortcut Learning
title A Lightweight Automatic Wildlife Recognition Model Design Method Mitigating Shortcut Learning
title_full A Lightweight Automatic Wildlife Recognition Model Design Method Mitigating Shortcut Learning
title_fullStr A Lightweight Automatic Wildlife Recognition Model Design Method Mitigating Shortcut Learning
title_full_unstemmed A Lightweight Automatic Wildlife Recognition Model Design Method Mitigating Shortcut Learning
title_short A Lightweight Automatic Wildlife Recognition Model Design Method Mitigating Shortcut Learning
title_sort lightweight automatic wildlife recognition model design method mitigating shortcut learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10000094/
https://www.ncbi.nlm.nih.gov/pubmed/36899695
http://dx.doi.org/10.3390/ani13050838
work_keys_str_mv AT zhongyujie alightweightautomaticwildliferecognitionmodeldesignmethodmitigatingshortcutlearning
AT lixiao alightweightautomaticwildliferecognitionmodeldesignmethodmitigatingshortcutlearning
AT xiejiangjian alightweightautomaticwildliferecognitionmodeldesignmethodmitigatingshortcutlearning
AT zhangjunguo alightweightautomaticwildliferecognitionmodeldesignmethodmitigatingshortcutlearning
AT zhongyujie lightweightautomaticwildliferecognitionmodeldesignmethodmitigatingshortcutlearning
AT lixiao lightweightautomaticwildliferecognitionmodeldesignmethodmitigatingshortcutlearning
AT xiejiangjian lightweightautomaticwildliferecognitionmodeldesignmethodmitigatingshortcutlearning
AT zhangjunguo lightweightautomaticwildliferecognitionmodeldesignmethodmitigatingshortcutlearning