Cargando…
A wheat spike detection method based on Transformer
Wheat spike detection has important research significance for production estimation and crop field management. With the development of deep learning-based algorithms, researchers tend to solve the detection task by convolutional neural networks (CNNs). However, traditional CNNs equip with the induct...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9630921/ https://www.ncbi.nlm.nih.gov/pubmed/36340370 http://dx.doi.org/10.3389/fpls.2022.1023924 |
_version_ | 1784823713526448128 |
---|---|
author | Zhou, Qiong Huang, Ziliang Zheng, Shijian Jiao, Lin Wang, Liusan Wang, Rujing |
author_facet | Zhou, Qiong Huang, Ziliang Zheng, Shijian Jiao, Lin Wang, Liusan Wang, Rujing |
author_sort | Zhou, Qiong |
collection | PubMed |
description | Wheat spike detection has important research significance for production estimation and crop field management. With the development of deep learning-based algorithms, researchers tend to solve the detection task by convolutional neural networks (CNNs). However, traditional CNNs equip with the inductive bias of locality and scale-invariance, which makes it hard to extract global and long-range dependency. In this paper, we propose a Transformer-based network named Multi-Window Swin Transformer (MW-Swin Transformer). Technically, MW-Swin Transformer introduces the ability of feature pyramid network to extract multi-scale features and inherits the characteristic of Swin Transformer that performs self-attention mechanism by window strategy. Moreover, bounding box regression is a crucial step in detection. We propose a Wheat Intersection over Union loss by incorporating the Euclidean distance, area overlapping, and aspect ratio, thereby leading to better detection accuracy. We merge the proposed network and regression loss into a popular detection architecture, fully convolutional one-stage object detection, and name the unified model WheatFormer. Finally, we construct a wheat spike detection dataset (WSD-2022) to evaluate the performance of the proposed methods. The experimental results show that the proposed network outperforms those state-of-the-art algorithms with 0.459 mAP (mean average precision) and 0.918 AP(50). It has been proved that our Transformer-based method is effective to handle wheat spike detection under complex field conditions. |
format | Online Article Text |
id | pubmed-9630921 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-96309212022-11-04 A wheat spike detection method based on Transformer Zhou, Qiong Huang, Ziliang Zheng, Shijian Jiao, Lin Wang, Liusan Wang, Rujing Front Plant Sci Plant Science Wheat spike detection has important research significance for production estimation and crop field management. With the development of deep learning-based algorithms, researchers tend to solve the detection task by convolutional neural networks (CNNs). However, traditional CNNs equip with the inductive bias of locality and scale-invariance, which makes it hard to extract global and long-range dependency. In this paper, we propose a Transformer-based network named Multi-Window Swin Transformer (MW-Swin Transformer). Technically, MW-Swin Transformer introduces the ability of feature pyramid network to extract multi-scale features and inherits the characteristic of Swin Transformer that performs self-attention mechanism by window strategy. Moreover, bounding box regression is a crucial step in detection. We propose a Wheat Intersection over Union loss by incorporating the Euclidean distance, area overlapping, and aspect ratio, thereby leading to better detection accuracy. We merge the proposed network and regression loss into a popular detection architecture, fully convolutional one-stage object detection, and name the unified model WheatFormer. Finally, we construct a wheat spike detection dataset (WSD-2022) to evaluate the performance of the proposed methods. The experimental results show that the proposed network outperforms those state-of-the-art algorithms with 0.459 mAP (mean average precision) and 0.918 AP(50). It has been proved that our Transformer-based method is effective to handle wheat spike detection under complex field conditions. Frontiers Media S.A. 2022-10-20 /pmc/articles/PMC9630921/ /pubmed/36340370 http://dx.doi.org/10.3389/fpls.2022.1023924 Text en Copyright © 2022 Zhou, Huang, Zheng, Jiao, Wang and Wang https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Zhou, Qiong Huang, Ziliang Zheng, Shijian Jiao, Lin Wang, Liusan Wang, Rujing A wheat spike detection method based on Transformer |
title | A wheat spike detection method based on Transformer |
title_full | A wheat spike detection method based on Transformer |
title_fullStr | A wheat spike detection method based on Transformer |
title_full_unstemmed | A wheat spike detection method based on Transformer |
title_short | A wheat spike detection method based on Transformer |
title_sort | wheat spike detection method based on transformer |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9630921/ https://www.ncbi.nlm.nih.gov/pubmed/36340370 http://dx.doi.org/10.3389/fpls.2022.1023924 |
work_keys_str_mv | AT zhouqiong awheatspikedetectionmethodbasedontransformer AT huangziliang awheatspikedetectionmethodbasedontransformer AT zhengshijian awheatspikedetectionmethodbasedontransformer AT jiaolin awheatspikedetectionmethodbasedontransformer AT wangliusan awheatspikedetectionmethodbasedontransformer AT wangrujing awheatspikedetectionmethodbasedontransformer AT zhouqiong wheatspikedetectionmethodbasedontransformer AT huangziliang wheatspikedetectionmethodbasedontransformer AT zhengshijian wheatspikedetectionmethodbasedontransformer AT jiaolin wheatspikedetectionmethodbasedontransformer AT wangliusan wheatspikedetectionmethodbasedontransformer AT wangrujing wheatspikedetectionmethodbasedontransformer |