Cargando…
Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer
Plant leaf classification involves identifying and categorizing plant species based on leaf characteristics, such as patterns, shapes, textures, and veins. In recent years, research has been conducted to improve the accuracy of plant classification using machine learning techniques. This involves tr...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10383964/ https://www.ncbi.nlm.nih.gov/pubmed/37514256 http://dx.doi.org/10.3390/plants12142642 |
_version_ | 1785081040680779776 |
---|---|
author | Lee, Chin Poo Lim, Kian Ming Song, Yu Xuan Alqahtani, Ali |
author_facet | Lee, Chin Poo Lim, Kian Ming Song, Yu Xuan Alqahtani, Ali |
author_sort | Lee, Chin Poo |
collection | PubMed |
description | Plant leaf classification involves identifying and categorizing plant species based on leaf characteristics, such as patterns, shapes, textures, and veins. In recent years, research has been conducted to improve the accuracy of plant classification using machine learning techniques. This involves training models on large datasets of plant images and using them to identify different plant species. However, these models are limited by their reliance on large amounts of training data, which can be difficult to obtain for many plant species. To overcome this challenge, this paper proposes a Plant-CNN-ViT ensemble model that combines the strengths of four pre-trained models: Vision Transformer, ResNet-50, DenseNet-201, and Xception. Vision Transformer utilizes self-attention to capture dependencies and focus on important leaf features. ResNet-50 introduces residual connections, aiding in efficient training and hierarchical feature extraction. DenseNet-201 employs dense connections, facilitating information flow and capturing intricate leaf patterns. Xception uses separable convolutions, reducing the computational cost while capturing fine-grained details in leaf images. The proposed Plant-CNN-ViT was evaluated on four plant leaf datasets and achieved remarkable accuracy of 100.00%, 100.00%, 100.00%, and 99.83% on the Flavia dataset, Folio Leaf dataset, Swedish Leaf dataset, and MalayaKew Leaf dataset, respectively. |
format | Online Article Text |
id | pubmed-10383964 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-103839642023-07-30 Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer Lee, Chin Poo Lim, Kian Ming Song, Yu Xuan Alqahtani, Ali Plants (Basel) Article Plant leaf classification involves identifying and categorizing plant species based on leaf characteristics, such as patterns, shapes, textures, and veins. In recent years, research has been conducted to improve the accuracy of plant classification using machine learning techniques. This involves training models on large datasets of plant images and using them to identify different plant species. However, these models are limited by their reliance on large amounts of training data, which can be difficult to obtain for many plant species. To overcome this challenge, this paper proposes a Plant-CNN-ViT ensemble model that combines the strengths of four pre-trained models: Vision Transformer, ResNet-50, DenseNet-201, and Xception. Vision Transformer utilizes self-attention to capture dependencies and focus on important leaf features. ResNet-50 introduces residual connections, aiding in efficient training and hierarchical feature extraction. DenseNet-201 employs dense connections, facilitating information flow and capturing intricate leaf patterns. Xception uses separable convolutions, reducing the computational cost while capturing fine-grained details in leaf images. The proposed Plant-CNN-ViT was evaluated on four plant leaf datasets and achieved remarkable accuracy of 100.00%, 100.00%, 100.00%, and 99.83% on the Flavia dataset, Folio Leaf dataset, Swedish Leaf dataset, and MalayaKew Leaf dataset, respectively. MDPI 2023-07-14 /pmc/articles/PMC10383964/ /pubmed/37514256 http://dx.doi.org/10.3390/plants12142642 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lee, Chin Poo Lim, Kian Ming Song, Yu Xuan Alqahtani, Ali Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer |
title | Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer |
title_full | Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer |
title_fullStr | Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer |
title_full_unstemmed | Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer |
title_short | Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer |
title_sort | plant-cnn-vit: plant classification with ensemble of convolutional neural networks and vision transformer |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10383964/ https://www.ncbi.nlm.nih.gov/pubmed/37514256 http://dx.doi.org/10.3390/plants12142642 |
work_keys_str_mv | AT leechinpoo plantcnnvitplantclassificationwithensembleofconvolutionalneuralnetworksandvisiontransformer AT limkianming plantcnnvitplantclassificationwithensembleofconvolutionalneuralnetworksandvisiontransformer AT songyuxuan plantcnnvitplantclassificationwithensembleofconvolutionalneuralnetworksandvisiontransformer AT alqahtaniali plantcnnvitplantclassificationwithensembleofconvolutionalneuralnetworksandvisiontransformer |