Cargando…

PMVT: a lightweight vision transformer for plant disease identification on mobile devices

Due to the constraints of agricultural computing resources and the diversity of plant diseases, it is challenging to achieve the desired accuracy rate while keeping the network lightweight. In this paper, we proposed a computationally efficient deep learning architecture based on the mobile vision t...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Guoqiang, Wang, Yuchao, Zhao, Qing, Yuan, Peiyan, Chang, Baofang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562605/
https://www.ncbi.nlm.nih.gov/pubmed/37822342
http://dx.doi.org/10.3389/fpls.2023.1256773
_version_ 1785118165223604224
author Li, Guoqiang
Wang, Yuchao
Zhao, Qing
Yuan, Peiyan
Chang, Baofang
author_facet Li, Guoqiang
Wang, Yuchao
Zhao, Qing
Yuan, Peiyan
Chang, Baofang
author_sort Li, Guoqiang
collection PubMed
description Due to the constraints of agricultural computing resources and the diversity of plant diseases, it is challenging to achieve the desired accuracy rate while keeping the network lightweight. In this paper, we proposed a computationally efficient deep learning architecture based on the mobile vision transformer (MobileViT) for real-time detection of plant diseases, which we called plant-based MobileViT (PMVT). Our proposed model was designed to be highly accurate and low-cost, making it suitable for deployment on mobile devices with limited resources. Specifically, we replaced the convolution block in MobileViT with an inverted residual structure that employs a 7×7 convolution kernel to effectively model long-distance dependencies between different leaves in plant disease images. Furthermore, inspired by the concept of multi-level attention in computer vision tasks, we integrated a convolutional block attention module (CBAM) into the standard ViT encoder. This integration allows the network to effectively avoid irrelevant information and focus on essential features. The PMVT network achieves reduced parameter counts compared to alternative networks on various mobile devices while maintaining high accuracy across different vision tasks. Extensive experiments on multiple agricultural datasets, including wheat, coffee, and rice, demonstrate that the proposed method outperforms the current best lightweight and heavyweight models. On the wheat dataset, PMVT achieves the highest accuracy of 93.6% using approximately 0.98 million (M) parameters. This accuracy is 1.6% higher than that of MobileNetV3. Under the same parameters, PMVT achieved an accuracy of 85.4% on the coffee dataset, surpassing SqueezeNet by 2.3%. Furthermore, out method achieved an accuracy of 93.1% on the rice dataset, surpassing MobileNetV3 by 3.4%. Additionally, we developed a plant disease diagnosis app and successfully used the trained PMVT model to identify plant disease in different scenarios.
format Online
Article
Text
id pubmed-10562605
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-105626052023-10-11 PMVT: a lightweight vision transformer for plant disease identification on mobile devices Li, Guoqiang Wang, Yuchao Zhao, Qing Yuan, Peiyan Chang, Baofang Front Plant Sci Plant Science Due to the constraints of agricultural computing resources and the diversity of plant diseases, it is challenging to achieve the desired accuracy rate while keeping the network lightweight. In this paper, we proposed a computationally efficient deep learning architecture based on the mobile vision transformer (MobileViT) for real-time detection of plant diseases, which we called plant-based MobileViT (PMVT). Our proposed model was designed to be highly accurate and low-cost, making it suitable for deployment on mobile devices with limited resources. Specifically, we replaced the convolution block in MobileViT with an inverted residual structure that employs a 7×7 convolution kernel to effectively model long-distance dependencies between different leaves in plant disease images. Furthermore, inspired by the concept of multi-level attention in computer vision tasks, we integrated a convolutional block attention module (CBAM) into the standard ViT encoder. This integration allows the network to effectively avoid irrelevant information and focus on essential features. The PMVT network achieves reduced parameter counts compared to alternative networks on various mobile devices while maintaining high accuracy across different vision tasks. Extensive experiments on multiple agricultural datasets, including wheat, coffee, and rice, demonstrate that the proposed method outperforms the current best lightweight and heavyweight models. On the wheat dataset, PMVT achieves the highest accuracy of 93.6% using approximately 0.98 million (M) parameters. This accuracy is 1.6% higher than that of MobileNetV3. Under the same parameters, PMVT achieved an accuracy of 85.4% on the coffee dataset, surpassing SqueezeNet by 2.3%. Furthermore, out method achieved an accuracy of 93.1% on the rice dataset, surpassing MobileNetV3 by 3.4%. Additionally, we developed a plant disease diagnosis app and successfully used the trained PMVT model to identify plant disease in different scenarios. Frontiers Media S.A. 2023-09-26 /pmc/articles/PMC10562605/ /pubmed/37822342 http://dx.doi.org/10.3389/fpls.2023.1256773 Text en Copyright © 2023 Li, Wang, Zhao, Yuan and Chang https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Li, Guoqiang
Wang, Yuchao
Zhao, Qing
Yuan, Peiyan
Chang, Baofang
PMVT: a lightweight vision transformer for plant disease identification on mobile devices
title PMVT: a lightweight vision transformer for plant disease identification on mobile devices
title_full PMVT: a lightweight vision transformer for plant disease identification on mobile devices
title_fullStr PMVT: a lightweight vision transformer for plant disease identification on mobile devices
title_full_unstemmed PMVT: a lightweight vision transformer for plant disease identification on mobile devices
title_short PMVT: a lightweight vision transformer for plant disease identification on mobile devices
title_sort pmvt: a lightweight vision transformer for plant disease identification on mobile devices
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562605/
https://www.ncbi.nlm.nih.gov/pubmed/37822342
http://dx.doi.org/10.3389/fpls.2023.1256773
work_keys_str_mv AT liguoqiang pmvtalightweightvisiontransformerforplantdiseaseidentificationonmobiledevices
AT wangyuchao pmvtalightweightvisiontransformerforplantdiseaseidentificationonmobiledevices
AT zhaoqing pmvtalightweightvisiontransformerforplantdiseaseidentificationonmobiledevices
AT yuanpeiyan pmvtalightweightvisiontransformerforplantdiseaseidentificationonmobiledevices
AT changbaofang pmvtalightweightvisiontransformerforplantdiseaseidentificationonmobiledevices