Cargando…

Multi-Modal Deep Learning for Weeds Detection in Wheat Field Based on RGB-D Images

Single-modal images carry limited information for features representation, and RGB images fail to detect grass weeds in wheat fields because of their similarity to wheat in shape. We propose a framework based on multi-modal information fusion for accurate detection of weeds in wheat fields in a natu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Xu, Ke, Zhu, Yan, Cao, Weixing, Jiang, Xiaoping, Jiang, Zhijian, Li, Shuailong, Ni, Jun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2021
Materias:	Plant Science
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8604282/ https://www.ncbi.nlm.nih.gov/pubmed/34804085 http://dx.doi.org/10.3389/fpls.2021.732968

_version_	1784601923510337536
author	Xu, Ke Zhu, Yan Cao, Weixing Jiang, Xiaoping Jiang, Zhijian Li, Shuailong Ni, Jun
author_facet	Xu, Ke Zhu, Yan Cao, Weixing Jiang, Xiaoping Jiang, Zhijian Li, Shuailong Ni, Jun
author_sort	Xu, Ke
collection	PubMed
description	Single-modal images carry limited information for features representation, and RGB images fail to detect grass weeds in wheat fields because of their similarity to wheat in shape. We propose a framework based on multi-modal information fusion for accurate detection of weeds in wheat fields in a natural environment, overcoming the limitation of single modality in weeds detection. Firstly, we recode the single-channel depth image into a new three-channel image like the structure of RGB image, which is suitable for feature extraction of convolutional neural network (CNN). Secondly, the multi-scale object detection is realized by fusing the feature maps output by different convolutional layers. The three-channel network structure is designed to take into account the independence of RGB and depth information, respectively, and the complementarity of multi-modal information, and the integrated learning is carried out by weight allocation at the decision level to realize the effective fusion of multi-modal information. The experimental results show that compared with the weed detection method based on RGB image, the accuracy of our method is significantly improved. Experiments with integrated learning shows that mean average precision (mAP) of 36.1% for grass weeds and 42.9% for broad-leaf weeds, and the overall detection precision, as indicated by intersection over ground truth (IoG), is 89.3%, with weights of RGB and depth images at α = 0.4 and β = 0.3. The results suggest that our methods can accurately detect the dominant species of weeds in wheat fields, and that multi-modal fusion can effectively improve object detection performance.
format	Online Article Text
id	pubmed-8604282
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-86042822021-11-20 Multi-Modal Deep Learning for Weeds Detection in Wheat Field Based on RGB-D Images Xu, Ke Zhu, Yan Cao, Weixing Jiang, Xiaoping Jiang, Zhijian Li, Shuailong Ni, Jun Front Plant Sci Plant Science Single-modal images carry limited information for features representation, and RGB images fail to detect grass weeds in wheat fields because of their similarity to wheat in shape. We propose a framework based on multi-modal information fusion for accurate detection of weeds in wheat fields in a natural environment, overcoming the limitation of single modality in weeds detection. Firstly, we recode the single-channel depth image into a new three-channel image like the structure of RGB image, which is suitable for feature extraction of convolutional neural network (CNN). Secondly, the multi-scale object detection is realized by fusing the feature maps output by different convolutional layers. The three-channel network structure is designed to take into account the independence of RGB and depth information, respectively, and the complementarity of multi-modal information, and the integrated learning is carried out by weight allocation at the decision level to realize the effective fusion of multi-modal information. The experimental results show that compared with the weed detection method based on RGB image, the accuracy of our method is significantly improved. Experiments with integrated learning shows that mean average precision (mAP) of 36.1% for grass weeds and 42.9% for broad-leaf weeds, and the overall detection precision, as indicated by intersection over ground truth (IoG), is 89.3%, with weights of RGB and depth images at α = 0.4 and β = 0.3. The results suggest that our methods can accurately detect the dominant species of weeds in wheat fields, and that multi-modal fusion can effectively improve object detection performance. Frontiers Media S.A. 2021-11-05 /pmc/articles/PMC8604282/ /pubmed/34804085 http://dx.doi.org/10.3389/fpls.2021.732968 Text en Copyright © 2021 Xu, Zhu, Cao, Jiang, Jiang, Li and Ni. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Plant Science Xu, Ke Zhu, Yan Cao, Weixing Jiang, Xiaoping Jiang, Zhijian Li, Shuailong Ni, Jun Multi-Modal Deep Learning for Weeds Detection in Wheat Field Based on RGB-D Images
title	Multi-Modal Deep Learning for Weeds Detection in Wheat Field Based on RGB-D Images
title_full	Multi-Modal Deep Learning for Weeds Detection in Wheat Field Based on RGB-D Images
title_fullStr	Multi-Modal Deep Learning for Weeds Detection in Wheat Field Based on RGB-D Images
title_full_unstemmed	Multi-Modal Deep Learning for Weeds Detection in Wheat Field Based on RGB-D Images
title_short	Multi-Modal Deep Learning for Weeds Detection in Wheat Field Based on RGB-D Images
title_sort	multi-modal deep learning for weeds detection in wheat field based on rgb-d images
topic	Plant Science
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8604282/ https://www.ncbi.nlm.nih.gov/pubmed/34804085 http://dx.doi.org/10.3389/fpls.2021.732968
work_keys_str_mv	AT xuke multimodaldeeplearningforweedsdetectioninwheatfieldbasedonrgbdimages AT zhuyan multimodaldeeplearningforweedsdetectioninwheatfieldbasedonrgbdimages AT caoweixing multimodaldeeplearningforweedsdetectioninwheatfieldbasedonrgbdimages AT jiangxiaoping multimodaldeeplearningforweedsdetectioninwheatfieldbasedonrgbdimages AT jiangzhijian multimodaldeeplearningforweedsdetectioninwheatfieldbasedonrgbdimages AT lishuailong multimodaldeeplearningforweedsdetectioninwheatfieldbasedonrgbdimages AT nijun multimodaldeeplearningforweedsdetectioninwheatfieldbasedonrgbdimages

Multi-Modal Deep Learning for Weeds Detection in Wheat Field Based on RGB-D Images

Ejemplares similares