Cargando…

A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion

In view of the differences in appearance and the complex backgrounds of crop diseases, automatic identification of field diseases is an extremely challenging topic in smart agriculture. To address this challenge, a popular approach is to design a Deep Convolutional Neural Network (DCNN) model that e...

Descripción completa

Detalles Bibliográficos
Autores principales: Feng, Xuguang, Zhao, Chunjiang, Wang, Chunshan, Wu, Huarui, Miao, Yisheng, Zhang, Jingjian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263697/
https://www.ncbi.nlm.nih.gov/pubmed/35812910
http://dx.doi.org/10.3389/fpls.2022.918940
_version_ 1784742796546015232
author Feng, Xuguang
Zhao, Chunjiang
Wang, Chunshan
Wu, Huarui
Miao, Yisheng
Zhang, Jingjian
author_facet Feng, Xuguang
Zhao, Chunjiang
Wang, Chunshan
Wu, Huarui
Miao, Yisheng
Zhang, Jingjian
author_sort Feng, Xuguang
collection PubMed
description In view of the differences in appearance and the complex backgrounds of crop diseases, automatic identification of field diseases is an extremely challenging topic in smart agriculture. To address this challenge, a popular approach is to design a Deep Convolutional Neural Network (DCNN) model that extracts visual disease features in the images and then identifies the diseases based on the extracted features. This approach performs well under simple background conditions, but has low accuracy and poor robustness under complex backgrounds. In this paper, an end-to-end disease identification model composed of a disease-spot region detector and a disease classifier (YOLOv5s + BiCMT) was proposed. Specifically, the YOLOv5s network was used to detect the disease-spot regions so as to provide a regional attention mechanism to facilitate the disease identification task of the classifier. For the classifier, a Bidirectional Cross-Modal Transformer (BiCMT) model combining the image and text modal information was constructed, which utilizes the correlation and complementarity between the features of the two modalities to achieve the fusion and recognition of disease features. Meanwhile, the problem of inconsistent lengths among different modal data sequences was solved. Eventually, the YOLOv5s + BiCMT model achieved the optimal results on a small dataset. Its Accuracy, Precision, Sensitivity, and Specificity reached 99.23, 97.37, 97.54, and 99.54%, respectively. This paper proves that the bidirectional cross-modal feature fusion by combining disease images and texts is an effective method to identify vegetable diseases in field environments.
format Online
Article
Text
id pubmed-9263697
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92636972022-07-09 A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion Feng, Xuguang Zhao, Chunjiang Wang, Chunshan Wu, Huarui Miao, Yisheng Zhang, Jingjian Front Plant Sci Plant Science In view of the differences in appearance and the complex backgrounds of crop diseases, automatic identification of field diseases is an extremely challenging topic in smart agriculture. To address this challenge, a popular approach is to design a Deep Convolutional Neural Network (DCNN) model that extracts visual disease features in the images and then identifies the diseases based on the extracted features. This approach performs well under simple background conditions, but has low accuracy and poor robustness under complex backgrounds. In this paper, an end-to-end disease identification model composed of a disease-spot region detector and a disease classifier (YOLOv5s + BiCMT) was proposed. Specifically, the YOLOv5s network was used to detect the disease-spot regions so as to provide a regional attention mechanism to facilitate the disease identification task of the classifier. For the classifier, a Bidirectional Cross-Modal Transformer (BiCMT) model combining the image and text modal information was constructed, which utilizes the correlation and complementarity between the features of the two modalities to achieve the fusion and recognition of disease features. Meanwhile, the problem of inconsistent lengths among different modal data sequences was solved. Eventually, the YOLOv5s + BiCMT model achieved the optimal results on a small dataset. Its Accuracy, Precision, Sensitivity, and Specificity reached 99.23, 97.37, 97.54, and 99.54%, respectively. This paper proves that the bidirectional cross-modal feature fusion by combining disease images and texts is an effective method to identify vegetable diseases in field environments. Frontiers Media S.A. 2022-06-24 /pmc/articles/PMC9263697/ /pubmed/35812910 http://dx.doi.org/10.3389/fpls.2022.918940 Text en Copyright © 2022 Feng, Zhao, Wang, Wu, Miao and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Feng, Xuguang
Zhao, Chunjiang
Wang, Chunshan
Wu, Huarui
Miao, Yisheng
Zhang, Jingjian
A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion
title A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion
title_full A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion
title_fullStr A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion
title_full_unstemmed A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion
title_short A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion
title_sort vegetable leaf disease identification model based on image-text cross-modal feature fusion
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263697/
https://www.ncbi.nlm.nih.gov/pubmed/35812910
http://dx.doi.org/10.3389/fpls.2022.918940
work_keys_str_mv AT fengxuguang avegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion
AT zhaochunjiang avegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion
AT wangchunshan avegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion
AT wuhuarui avegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion
AT miaoyisheng avegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion
AT zhangjingjian avegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion
AT fengxuguang vegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion
AT zhaochunjiang vegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion
AT wangchunshan vegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion
AT wuhuarui vegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion
AT miaoyisheng vegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion
AT zhangjingjian vegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion