Cargando…

Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images

The field of computer vision has shown great potential for the identification of crops at large scales based on multispectral images. However, the challenge in designing crop identification networks lies in striking a balance between accuracy and a lightweight framework. Furthermore, there is a lack...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Yimin, Meng, Ao, Wu, Yanjun, Zou, Le, Jin, Zhou, Xu, Taosheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327894/
https://www.ncbi.nlm.nih.gov/pubmed/37426958
http://dx.doi.org/10.3389/fpls.2023.1124939
_version_ 1785069694012620800
author Hu, Yimin
Meng, Ao
Wu, Yanjun
Zou, Le
Jin, Zhou
Xu, Taosheng
author_facet Hu, Yimin
Meng, Ao
Wu, Yanjun
Zou, Le
Jin, Zhou
Xu, Taosheng
author_sort Hu, Yimin
collection PubMed
description The field of computer vision has shown great potential for the identification of crops at large scales based on multispectral images. However, the challenge in designing crop identification networks lies in striking a balance between accuracy and a lightweight framework. Furthermore, there is a lack of accurate recognition methods for non-large-scale crops. In this paper, we propose an improved encoder-decoder framework based on DeepLab v3+ to accurately identify crops with different planting patterns. The network employs ShuffleNet v2 as the backbone to extract features at multiple levels. The decoder module integrates a convolutional block attention mechanism that combines both channel and spatial attention mechanisms to fuse attention features across the channel and spatial dimensions. We establish two datasets, DS1 and DS2, where DS1 is obtained from areas with large-scale crop planting, and DS2 is obtained from areas with scattered crop planting. On DS1, the improved network achieves a mean intersection over union (mIoU) of 0.972, overall accuracy (OA) of 0.981, and recall of 0.980, indicating a significant improvement of 7.0%, 5.0%, and 5.7%, respectively, compared to the original DeepLab v3+. On DS2, the improved network improves the mIoU, OA, and recall by 5.4%, 3.9%, and 4.4%, respectively. Notably, the number of parameters and giga floating-point operations (GFLOPs) required by the proposed Deep-agriNet is significantly smaller than that of DeepLab v3+ and other classic networks. Our findings demonstrate that Deep-agriNet performs better in identifying crops with different planting scales, and can serve as an effective tool for crop identification in various regions and countries.
format Online
Article
Text
id pubmed-10327894
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-103278942023-07-08 Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images Hu, Yimin Meng, Ao Wu, Yanjun Zou, Le Jin, Zhou Xu, Taosheng Front Plant Sci Plant Science The field of computer vision has shown great potential for the identification of crops at large scales based on multispectral images. However, the challenge in designing crop identification networks lies in striking a balance between accuracy and a lightweight framework. Furthermore, there is a lack of accurate recognition methods for non-large-scale crops. In this paper, we propose an improved encoder-decoder framework based on DeepLab v3+ to accurately identify crops with different planting patterns. The network employs ShuffleNet v2 as the backbone to extract features at multiple levels. The decoder module integrates a convolutional block attention mechanism that combines both channel and spatial attention mechanisms to fuse attention features across the channel and spatial dimensions. We establish two datasets, DS1 and DS2, where DS1 is obtained from areas with large-scale crop planting, and DS2 is obtained from areas with scattered crop planting. On DS1, the improved network achieves a mean intersection over union (mIoU) of 0.972, overall accuracy (OA) of 0.981, and recall of 0.980, indicating a significant improvement of 7.0%, 5.0%, and 5.7%, respectively, compared to the original DeepLab v3+. On DS2, the improved network improves the mIoU, OA, and recall by 5.4%, 3.9%, and 4.4%, respectively. Notably, the number of parameters and giga floating-point operations (GFLOPs) required by the proposed Deep-agriNet is significantly smaller than that of DeepLab v3+ and other classic networks. Our findings demonstrate that Deep-agriNet performs better in identifying crops with different planting scales, and can serve as an effective tool for crop identification in various regions and countries. Frontiers Media S.A. 2023-04-18 /pmc/articles/PMC10327894/ /pubmed/37426958 http://dx.doi.org/10.3389/fpls.2023.1124939 Text en Copyright © 2023 Hu, Meng, Wu, Zou, Jin and Xu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Hu, Yimin
Meng, Ao
Wu, Yanjun
Zou, Le
Jin, Zhou
Xu, Taosheng
Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
title Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
title_full Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
title_fullStr Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
title_full_unstemmed Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
title_short Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
title_sort deep-agrinet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327894/
https://www.ncbi.nlm.nih.gov/pubmed/37426958
http://dx.doi.org/10.3389/fpls.2023.1124939
work_keys_str_mv AT huyimin deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages
AT mengao deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages
AT wuyanjun deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages
AT zoule deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages
AT jinzhou deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages
AT xutaosheng deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages