Cargando…

Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images

The field of computer vision has shown great potential for the identification of crops at large scales based on multispectral images. However, the challenge in designing crop identification networks lies in striking a balance between accuracy and a lightweight framework. Furthermore, there is a lack...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hu, Yimin, Meng, Ao, Wu, Yanjun, Zou, Le, Jin, Zhou, Xu, Taosheng
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Plant Science
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327894/ https://www.ncbi.nlm.nih.gov/pubmed/37426958 http://dx.doi.org/10.3389/fpls.2023.1124939

_version_	1785069694012620800
author	Hu, Yimin Meng, Ao Wu, Yanjun Zou, Le Jin, Zhou Xu, Taosheng
author_facet	Hu, Yimin Meng, Ao Wu, Yanjun Zou, Le Jin, Zhou Xu, Taosheng
author_sort	Hu, Yimin
collection	PubMed
description	The field of computer vision has shown great potential for the identification of crops at large scales based on multispectral images. However, the challenge in designing crop identification networks lies in striking a balance between accuracy and a lightweight framework. Furthermore, there is a lack of accurate recognition methods for non-large-scale crops. In this paper, we propose an improved encoder-decoder framework based on DeepLab v3+ to accurately identify crops with different planting patterns. The network employs ShuffleNet v2 as the backbone to extract features at multiple levels. The decoder module integrates a convolutional block attention mechanism that combines both channel and spatial attention mechanisms to fuse attention features across the channel and spatial dimensions. We establish two datasets, DS1 and DS2, where DS1 is obtained from areas with large-scale crop planting, and DS2 is obtained from areas with scattered crop planting. On DS1, the improved network achieves a mean intersection over union (mIoU) of 0.972, overall accuracy (OA) of 0.981, and recall of 0.980, indicating a significant improvement of 7.0%, 5.0%, and 5.7%, respectively, compared to the original DeepLab v3+. On DS2, the improved network improves the mIoU, OA, and recall by 5.4%, 3.9%, and 4.4%, respectively. Notably, the number of parameters and giga floating-point operations (GFLOPs) required by the proposed Deep-agriNet is significantly smaller than that of DeepLab v3+ and other classic networks. Our findings demonstrate that Deep-agriNet performs better in identifying crops with different planting scales, and can serve as an effective tool for crop identification in various regions and countries.
format	Online Article Text
id	pubmed-10327894
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-103278942023-07-08 Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images Hu, Yimin Meng, Ao Wu, Yanjun Zou, Le Jin, Zhou Xu, Taosheng Front Plant Sci Plant Science The field of computer vision has shown great potential for the identification of crops at large scales based on multispectral images. However, the challenge in designing crop identification networks lies in striking a balance between accuracy and a lightweight framework. Furthermore, there is a lack of accurate recognition methods for non-large-scale crops. In this paper, we propose an improved encoder-decoder framework based on DeepLab v3+ to accurately identify crops with different planting patterns. The network employs ShuffleNet v2 as the backbone to extract features at multiple levels. The decoder module integrates a convolutional block attention mechanism that combines both channel and spatial attention mechanisms to fuse attention features across the channel and spatial dimensions. We establish two datasets, DS1 and DS2, where DS1 is obtained from areas with large-scale crop planting, and DS2 is obtained from areas with scattered crop planting. On DS1, the improved network achieves a mean intersection over union (mIoU) of 0.972, overall accuracy (OA) of 0.981, and recall of 0.980, indicating a significant improvement of 7.0%, 5.0%, and 5.7%, respectively, compared to the original DeepLab v3+. On DS2, the improved network improves the mIoU, OA, and recall by 5.4%, 3.9%, and 4.4%, respectively. Notably, the number of parameters and giga floating-point operations (GFLOPs) required by the proposed Deep-agriNet is significantly smaller than that of DeepLab v3+ and other classic networks. Our findings demonstrate that Deep-agriNet performs better in identifying crops with different planting scales, and can serve as an effective tool for crop identification in various regions and countries. Frontiers Media S.A. 2023-04-18 /pmc/articles/PMC10327894/ /pubmed/37426958 http://dx.doi.org/10.3389/fpls.2023.1124939 Text en Copyright © 2023 Hu, Meng, Wu, Zou, Jin and Xu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Plant Science Hu, Yimin Meng, Ao Wu, Yanjun Zou, Le Jin, Zhou Xu, Taosheng Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
title	Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
title_full	Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
title_fullStr	Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
title_full_unstemmed	Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
title_short	Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
title_sort	deep-agrinet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
topic	Plant Science
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327894/ https://www.ncbi.nlm.nih.gov/pubmed/37426958 http://dx.doi.org/10.3389/fpls.2023.1124939
work_keys_str_mv	AT huyimin deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages AT mengao deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages AT wuyanjun deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages AT zoule deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages AT jinzhou deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages AT xutaosheng deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages

Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images

Ejemplares similares