Cargando…
A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion
In view of the differences in appearance and the complex backgrounds of crop diseases, automatic identification of field diseases is an extremely challenging topic in smart agriculture. To address this challenge, a popular approach is to design a Deep Convolutional Neural Network (DCNN) model that e...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263697/ https://www.ncbi.nlm.nih.gov/pubmed/35812910 http://dx.doi.org/10.3389/fpls.2022.918940 |
_version_ | 1784742796546015232 |
---|---|
author | Feng, Xuguang Zhao, Chunjiang Wang, Chunshan Wu, Huarui Miao, Yisheng Zhang, Jingjian |
author_facet | Feng, Xuguang Zhao, Chunjiang Wang, Chunshan Wu, Huarui Miao, Yisheng Zhang, Jingjian |
author_sort | Feng, Xuguang |
collection | PubMed |
description | In view of the differences in appearance and the complex backgrounds of crop diseases, automatic identification of field diseases is an extremely challenging topic in smart agriculture. To address this challenge, a popular approach is to design a Deep Convolutional Neural Network (DCNN) model that extracts visual disease features in the images and then identifies the diseases based on the extracted features. This approach performs well under simple background conditions, but has low accuracy and poor robustness under complex backgrounds. In this paper, an end-to-end disease identification model composed of a disease-spot region detector and a disease classifier (YOLOv5s + BiCMT) was proposed. Specifically, the YOLOv5s network was used to detect the disease-spot regions so as to provide a regional attention mechanism to facilitate the disease identification task of the classifier. For the classifier, a Bidirectional Cross-Modal Transformer (BiCMT) model combining the image and text modal information was constructed, which utilizes the correlation and complementarity between the features of the two modalities to achieve the fusion and recognition of disease features. Meanwhile, the problem of inconsistent lengths among different modal data sequences was solved. Eventually, the YOLOv5s + BiCMT model achieved the optimal results on a small dataset. Its Accuracy, Precision, Sensitivity, and Specificity reached 99.23, 97.37, 97.54, and 99.54%, respectively. This paper proves that the bidirectional cross-modal feature fusion by combining disease images and texts is an effective method to identify vegetable diseases in field environments. |
format | Online Article Text |
id | pubmed-9263697 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92636972022-07-09 A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion Feng, Xuguang Zhao, Chunjiang Wang, Chunshan Wu, Huarui Miao, Yisheng Zhang, Jingjian Front Plant Sci Plant Science In view of the differences in appearance and the complex backgrounds of crop diseases, automatic identification of field diseases is an extremely challenging topic in smart agriculture. To address this challenge, a popular approach is to design a Deep Convolutional Neural Network (DCNN) model that extracts visual disease features in the images and then identifies the diseases based on the extracted features. This approach performs well under simple background conditions, but has low accuracy and poor robustness under complex backgrounds. In this paper, an end-to-end disease identification model composed of a disease-spot region detector and a disease classifier (YOLOv5s + BiCMT) was proposed. Specifically, the YOLOv5s network was used to detect the disease-spot regions so as to provide a regional attention mechanism to facilitate the disease identification task of the classifier. For the classifier, a Bidirectional Cross-Modal Transformer (BiCMT) model combining the image and text modal information was constructed, which utilizes the correlation and complementarity between the features of the two modalities to achieve the fusion and recognition of disease features. Meanwhile, the problem of inconsistent lengths among different modal data sequences was solved. Eventually, the YOLOv5s + BiCMT model achieved the optimal results on a small dataset. Its Accuracy, Precision, Sensitivity, and Specificity reached 99.23, 97.37, 97.54, and 99.54%, respectively. This paper proves that the bidirectional cross-modal feature fusion by combining disease images and texts is an effective method to identify vegetable diseases in field environments. Frontiers Media S.A. 2022-06-24 /pmc/articles/PMC9263697/ /pubmed/35812910 http://dx.doi.org/10.3389/fpls.2022.918940 Text en Copyright © 2022 Feng, Zhao, Wang, Wu, Miao and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Feng, Xuguang Zhao, Chunjiang Wang, Chunshan Wu, Huarui Miao, Yisheng Zhang, Jingjian A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion |
title | A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion |
title_full | A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion |
title_fullStr | A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion |
title_full_unstemmed | A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion |
title_short | A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion |
title_sort | vegetable leaf disease identification model based on image-text cross-modal feature fusion |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263697/ https://www.ncbi.nlm.nih.gov/pubmed/35812910 http://dx.doi.org/10.3389/fpls.2022.918940 |
work_keys_str_mv | AT fengxuguang avegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion AT zhaochunjiang avegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion AT wangchunshan avegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion AT wuhuarui avegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion AT miaoyisheng avegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion AT zhangjingjian avegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion AT fengxuguang vegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion AT zhaochunjiang vegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion AT wangchunshan vegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion AT wuhuarui vegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion AT miaoyisheng vegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion AT zhangjingjian vegetableleafdiseaseidentificationmodelbasedonimagetextcrossmodalfeaturefusion |