Cargando…

A wheat spike detection method based on Transformer

Wheat spike detection has important research significance for production estimation and crop field management. With the development of deep learning-based algorithms, researchers tend to solve the detection task by convolutional neural networks (CNNs). However, traditional CNNs equip with the induct...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Qiong, Huang, Ziliang, Zheng, Shijian, Jiao, Lin, Wang, Liusan, Wang, Rujing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9630921/
https://www.ncbi.nlm.nih.gov/pubmed/36340370
http://dx.doi.org/10.3389/fpls.2022.1023924
_version_ 1784823713526448128
author Zhou, Qiong
Huang, Ziliang
Zheng, Shijian
Jiao, Lin
Wang, Liusan
Wang, Rujing
author_facet Zhou, Qiong
Huang, Ziliang
Zheng, Shijian
Jiao, Lin
Wang, Liusan
Wang, Rujing
author_sort Zhou, Qiong
collection PubMed
description Wheat spike detection has important research significance for production estimation and crop field management. With the development of deep learning-based algorithms, researchers tend to solve the detection task by convolutional neural networks (CNNs). However, traditional CNNs equip with the inductive bias of locality and scale-invariance, which makes it hard to extract global and long-range dependency. In this paper, we propose a Transformer-based network named Multi-Window Swin Transformer (MW-Swin Transformer). Technically, MW-Swin Transformer introduces the ability of feature pyramid network to extract multi-scale features and inherits the characteristic of Swin Transformer that performs self-attention mechanism by window strategy. Moreover, bounding box regression is a crucial step in detection. We propose a Wheat Intersection over Union loss by incorporating the Euclidean distance, area overlapping, and aspect ratio, thereby leading to better detection accuracy. We merge the proposed network and regression loss into a popular detection architecture, fully convolutional one-stage object detection, and name the unified model WheatFormer. Finally, we construct a wheat spike detection dataset (WSD-2022) to evaluate the performance of the proposed methods. The experimental results show that the proposed network outperforms those state-of-the-art algorithms with 0.459 mAP (mean average precision) and 0.918 AP(50). It has been proved that our Transformer-based method is effective to handle wheat spike detection under complex field conditions.
format Online
Article
Text
id pubmed-9630921
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96309212022-11-04 A wheat spike detection method based on Transformer Zhou, Qiong Huang, Ziliang Zheng, Shijian Jiao, Lin Wang, Liusan Wang, Rujing Front Plant Sci Plant Science Wheat spike detection has important research significance for production estimation and crop field management. With the development of deep learning-based algorithms, researchers tend to solve the detection task by convolutional neural networks (CNNs). However, traditional CNNs equip with the inductive bias of locality and scale-invariance, which makes it hard to extract global and long-range dependency. In this paper, we propose a Transformer-based network named Multi-Window Swin Transformer (MW-Swin Transformer). Technically, MW-Swin Transformer introduces the ability of feature pyramid network to extract multi-scale features and inherits the characteristic of Swin Transformer that performs self-attention mechanism by window strategy. Moreover, bounding box regression is a crucial step in detection. We propose a Wheat Intersection over Union loss by incorporating the Euclidean distance, area overlapping, and aspect ratio, thereby leading to better detection accuracy. We merge the proposed network and regression loss into a popular detection architecture, fully convolutional one-stage object detection, and name the unified model WheatFormer. Finally, we construct a wheat spike detection dataset (WSD-2022) to evaluate the performance of the proposed methods. The experimental results show that the proposed network outperforms those state-of-the-art algorithms with 0.459 mAP (mean average precision) and 0.918 AP(50). It has been proved that our Transformer-based method is effective to handle wheat spike detection under complex field conditions. Frontiers Media S.A. 2022-10-20 /pmc/articles/PMC9630921/ /pubmed/36340370 http://dx.doi.org/10.3389/fpls.2022.1023924 Text en Copyright © 2022 Zhou, Huang, Zheng, Jiao, Wang and Wang https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Zhou, Qiong
Huang, Ziliang
Zheng, Shijian
Jiao, Lin
Wang, Liusan
Wang, Rujing
A wheat spike detection method based on Transformer
title A wheat spike detection method based on Transformer
title_full A wheat spike detection method based on Transformer
title_fullStr A wheat spike detection method based on Transformer
title_full_unstemmed A wheat spike detection method based on Transformer
title_short A wheat spike detection method based on Transformer
title_sort wheat spike detection method based on transformer
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9630921/
https://www.ncbi.nlm.nih.gov/pubmed/36340370
http://dx.doi.org/10.3389/fpls.2022.1023924
work_keys_str_mv AT zhouqiong awheatspikedetectionmethodbasedontransformer
AT huangziliang awheatspikedetectionmethodbasedontransformer
AT zhengshijian awheatspikedetectionmethodbasedontransformer
AT jiaolin awheatspikedetectionmethodbasedontransformer
AT wangliusan awheatspikedetectionmethodbasedontransformer
AT wangrujing awheatspikedetectionmethodbasedontransformer
AT zhouqiong wheatspikedetectionmethodbasedontransformer
AT huangziliang wheatspikedetectionmethodbasedontransformer
AT zhengshijian wheatspikedetectionmethodbasedontransformer
AT jiaolin wheatspikedetectionmethodbasedontransformer
AT wangliusan wheatspikedetectionmethodbasedontransformer
AT wangrujing wheatspikedetectionmethodbasedontransformer