Cargando…
Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images
The field of computer vision has shown great potential for the identification of crops at large scales based on multispectral images. However, the challenge in designing crop identification networks lies in striking a balance between accuracy and a lightweight framework. Furthermore, there is a lack...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327894/ https://www.ncbi.nlm.nih.gov/pubmed/37426958 http://dx.doi.org/10.3389/fpls.2023.1124939 |
_version_ | 1785069694012620800 |
---|---|
author | Hu, Yimin Meng, Ao Wu, Yanjun Zou, Le Jin, Zhou Xu, Taosheng |
author_facet | Hu, Yimin Meng, Ao Wu, Yanjun Zou, Le Jin, Zhou Xu, Taosheng |
author_sort | Hu, Yimin |
collection | PubMed |
description | The field of computer vision has shown great potential for the identification of crops at large scales based on multispectral images. However, the challenge in designing crop identification networks lies in striking a balance between accuracy and a lightweight framework. Furthermore, there is a lack of accurate recognition methods for non-large-scale crops. In this paper, we propose an improved encoder-decoder framework based on DeepLab v3+ to accurately identify crops with different planting patterns. The network employs ShuffleNet v2 as the backbone to extract features at multiple levels. The decoder module integrates a convolutional block attention mechanism that combines both channel and spatial attention mechanisms to fuse attention features across the channel and spatial dimensions. We establish two datasets, DS1 and DS2, where DS1 is obtained from areas with large-scale crop planting, and DS2 is obtained from areas with scattered crop planting. On DS1, the improved network achieves a mean intersection over union (mIoU) of 0.972, overall accuracy (OA) of 0.981, and recall of 0.980, indicating a significant improvement of 7.0%, 5.0%, and 5.7%, respectively, compared to the original DeepLab v3+. On DS2, the improved network improves the mIoU, OA, and recall by 5.4%, 3.9%, and 4.4%, respectively. Notably, the number of parameters and giga floating-point operations (GFLOPs) required by the proposed Deep-agriNet is significantly smaller than that of DeepLab v3+ and other classic networks. Our findings demonstrate that Deep-agriNet performs better in identifying crops with different planting scales, and can serve as an effective tool for crop identification in various regions and countries. |
format | Online Article Text |
id | pubmed-10327894 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-103278942023-07-08 Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images Hu, Yimin Meng, Ao Wu, Yanjun Zou, Le Jin, Zhou Xu, Taosheng Front Plant Sci Plant Science The field of computer vision has shown great potential for the identification of crops at large scales based on multispectral images. However, the challenge in designing crop identification networks lies in striking a balance between accuracy and a lightweight framework. Furthermore, there is a lack of accurate recognition methods for non-large-scale crops. In this paper, we propose an improved encoder-decoder framework based on DeepLab v3+ to accurately identify crops with different planting patterns. The network employs ShuffleNet v2 as the backbone to extract features at multiple levels. The decoder module integrates a convolutional block attention mechanism that combines both channel and spatial attention mechanisms to fuse attention features across the channel and spatial dimensions. We establish two datasets, DS1 and DS2, where DS1 is obtained from areas with large-scale crop planting, and DS2 is obtained from areas with scattered crop planting. On DS1, the improved network achieves a mean intersection over union (mIoU) of 0.972, overall accuracy (OA) of 0.981, and recall of 0.980, indicating a significant improvement of 7.0%, 5.0%, and 5.7%, respectively, compared to the original DeepLab v3+. On DS2, the improved network improves the mIoU, OA, and recall by 5.4%, 3.9%, and 4.4%, respectively. Notably, the number of parameters and giga floating-point operations (GFLOPs) required by the proposed Deep-agriNet is significantly smaller than that of DeepLab v3+ and other classic networks. Our findings demonstrate that Deep-agriNet performs better in identifying crops with different planting scales, and can serve as an effective tool for crop identification in various regions and countries. Frontiers Media S.A. 2023-04-18 /pmc/articles/PMC10327894/ /pubmed/37426958 http://dx.doi.org/10.3389/fpls.2023.1124939 Text en Copyright © 2023 Hu, Meng, Wu, Zou, Jin and Xu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Hu, Yimin Meng, Ao Wu, Yanjun Zou, Le Jin, Zhou Xu, Taosheng Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images |
title | Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images |
title_full | Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images |
title_fullStr | Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images |
title_full_unstemmed | Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images |
title_short | Deep-agriNet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images |
title_sort | deep-agrinet: a lightweight attention-based encoder-decoder framework for crop identification using multispectral images |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327894/ https://www.ncbi.nlm.nih.gov/pubmed/37426958 http://dx.doi.org/10.3389/fpls.2023.1124939 |
work_keys_str_mv | AT huyimin deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages AT mengao deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages AT wuyanjun deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages AT zoule deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages AT jinzhou deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages AT xutaosheng deepagrinetalightweightattentionbasedencoderdecoderframeworkforcropidentificationusingmultispectralimages |