Cargando…
PMVT: a lightweight vision transformer for plant disease identification on mobile devices
Due to the constraints of agricultural computing resources and the diversity of plant diseases, it is challenging to achieve the desired accuracy rate while keeping the network lightweight. In this paper, we proposed a computationally efficient deep learning architecture based on the mobile vision t...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562605/ https://www.ncbi.nlm.nih.gov/pubmed/37822342 http://dx.doi.org/10.3389/fpls.2023.1256773 |
_version_ | 1785118165223604224 |
---|---|
author | Li, Guoqiang Wang, Yuchao Zhao, Qing Yuan, Peiyan Chang, Baofang |
author_facet | Li, Guoqiang Wang, Yuchao Zhao, Qing Yuan, Peiyan Chang, Baofang |
author_sort | Li, Guoqiang |
collection | PubMed |
description | Due to the constraints of agricultural computing resources and the diversity of plant diseases, it is challenging to achieve the desired accuracy rate while keeping the network lightweight. In this paper, we proposed a computationally efficient deep learning architecture based on the mobile vision transformer (MobileViT) for real-time detection of plant diseases, which we called plant-based MobileViT (PMVT). Our proposed model was designed to be highly accurate and low-cost, making it suitable for deployment on mobile devices with limited resources. Specifically, we replaced the convolution block in MobileViT with an inverted residual structure that employs a 7×7 convolution kernel to effectively model long-distance dependencies between different leaves in plant disease images. Furthermore, inspired by the concept of multi-level attention in computer vision tasks, we integrated a convolutional block attention module (CBAM) into the standard ViT encoder. This integration allows the network to effectively avoid irrelevant information and focus on essential features. The PMVT network achieves reduced parameter counts compared to alternative networks on various mobile devices while maintaining high accuracy across different vision tasks. Extensive experiments on multiple agricultural datasets, including wheat, coffee, and rice, demonstrate that the proposed method outperforms the current best lightweight and heavyweight models. On the wheat dataset, PMVT achieves the highest accuracy of 93.6% using approximately 0.98 million (M) parameters. This accuracy is 1.6% higher than that of MobileNetV3. Under the same parameters, PMVT achieved an accuracy of 85.4% on the coffee dataset, surpassing SqueezeNet by 2.3%. Furthermore, out method achieved an accuracy of 93.1% on the rice dataset, surpassing MobileNetV3 by 3.4%. Additionally, we developed a plant disease diagnosis app and successfully used the trained PMVT model to identify plant disease in different scenarios. |
format | Online Article Text |
id | pubmed-10562605 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-105626052023-10-11 PMVT: a lightweight vision transformer for plant disease identification on mobile devices Li, Guoqiang Wang, Yuchao Zhao, Qing Yuan, Peiyan Chang, Baofang Front Plant Sci Plant Science Due to the constraints of agricultural computing resources and the diversity of plant diseases, it is challenging to achieve the desired accuracy rate while keeping the network lightweight. In this paper, we proposed a computationally efficient deep learning architecture based on the mobile vision transformer (MobileViT) for real-time detection of plant diseases, which we called plant-based MobileViT (PMVT). Our proposed model was designed to be highly accurate and low-cost, making it suitable for deployment on mobile devices with limited resources. Specifically, we replaced the convolution block in MobileViT with an inverted residual structure that employs a 7×7 convolution kernel to effectively model long-distance dependencies between different leaves in plant disease images. Furthermore, inspired by the concept of multi-level attention in computer vision tasks, we integrated a convolutional block attention module (CBAM) into the standard ViT encoder. This integration allows the network to effectively avoid irrelevant information and focus on essential features. The PMVT network achieves reduced parameter counts compared to alternative networks on various mobile devices while maintaining high accuracy across different vision tasks. Extensive experiments on multiple agricultural datasets, including wheat, coffee, and rice, demonstrate that the proposed method outperforms the current best lightweight and heavyweight models. On the wheat dataset, PMVT achieves the highest accuracy of 93.6% using approximately 0.98 million (M) parameters. This accuracy is 1.6% higher than that of MobileNetV3. Under the same parameters, PMVT achieved an accuracy of 85.4% on the coffee dataset, surpassing SqueezeNet by 2.3%. Furthermore, out method achieved an accuracy of 93.1% on the rice dataset, surpassing MobileNetV3 by 3.4%. Additionally, we developed a plant disease diagnosis app and successfully used the trained PMVT model to identify plant disease in different scenarios. Frontiers Media S.A. 2023-09-26 /pmc/articles/PMC10562605/ /pubmed/37822342 http://dx.doi.org/10.3389/fpls.2023.1256773 Text en Copyright © 2023 Li, Wang, Zhao, Yuan and Chang https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Li, Guoqiang Wang, Yuchao Zhao, Qing Yuan, Peiyan Chang, Baofang PMVT: a lightweight vision transformer for plant disease identification on mobile devices |
title | PMVT: a lightweight vision transformer for plant disease identification on mobile devices |
title_full | PMVT: a lightweight vision transformer for plant disease identification on mobile devices |
title_fullStr | PMVT: a lightweight vision transformer for plant disease identification on mobile devices |
title_full_unstemmed | PMVT: a lightweight vision transformer for plant disease identification on mobile devices |
title_short | PMVT: a lightweight vision transformer for plant disease identification on mobile devices |
title_sort | pmvt: a lightweight vision transformer for plant disease identification on mobile devices |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562605/ https://www.ncbi.nlm.nih.gov/pubmed/37822342 http://dx.doi.org/10.3389/fpls.2023.1256773 |
work_keys_str_mv | AT liguoqiang pmvtalightweightvisiontransformerforplantdiseaseidentificationonmobiledevices AT wangyuchao pmvtalightweightvisiontransformerforplantdiseaseidentificationonmobiledevices AT zhaoqing pmvtalightweightvisiontransformerforplantdiseaseidentificationonmobiledevices AT yuanpeiyan pmvtalightweightvisiontransformerforplantdiseaseidentificationonmobiledevices AT changbaofang pmvtalightweightvisiontransformerforplantdiseaseidentificationonmobiledevices |