Cargando…

Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer

Plant leaf classification involves identifying and categorizing plant species based on leaf characteristics, such as patterns, shapes, textures, and veins. In recent years, research has been conducted to improve the accuracy of plant classification using machine learning techniques. This involves tr...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Chin Poo, Lim, Kian Ming, Song, Yu Xuan, Alqahtani, Ali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10383964/
https://www.ncbi.nlm.nih.gov/pubmed/37514256
http://dx.doi.org/10.3390/plants12142642
_version_ 1785081040680779776
author Lee, Chin Poo
Lim, Kian Ming
Song, Yu Xuan
Alqahtani, Ali
author_facet Lee, Chin Poo
Lim, Kian Ming
Song, Yu Xuan
Alqahtani, Ali
author_sort Lee, Chin Poo
collection PubMed
description Plant leaf classification involves identifying and categorizing plant species based on leaf characteristics, such as patterns, shapes, textures, and veins. In recent years, research has been conducted to improve the accuracy of plant classification using machine learning techniques. This involves training models on large datasets of plant images and using them to identify different plant species. However, these models are limited by their reliance on large amounts of training data, which can be difficult to obtain for many plant species. To overcome this challenge, this paper proposes a Plant-CNN-ViT ensemble model that combines the strengths of four pre-trained models: Vision Transformer, ResNet-50, DenseNet-201, and Xception. Vision Transformer utilizes self-attention to capture dependencies and focus on important leaf features. ResNet-50 introduces residual connections, aiding in efficient training and hierarchical feature extraction. DenseNet-201 employs dense connections, facilitating information flow and capturing intricate leaf patterns. Xception uses separable convolutions, reducing the computational cost while capturing fine-grained details in leaf images. The proposed Plant-CNN-ViT was evaluated on four plant leaf datasets and achieved remarkable accuracy of 100.00%, 100.00%, 100.00%, and 99.83% on the Flavia dataset, Folio Leaf dataset, Swedish Leaf dataset, and MalayaKew Leaf dataset, respectively.
format Online
Article
Text
id pubmed-10383964
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103839642023-07-30 Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer Lee, Chin Poo Lim, Kian Ming Song, Yu Xuan Alqahtani, Ali Plants (Basel) Article Plant leaf classification involves identifying and categorizing plant species based on leaf characteristics, such as patterns, shapes, textures, and veins. In recent years, research has been conducted to improve the accuracy of plant classification using machine learning techniques. This involves training models on large datasets of plant images and using them to identify different plant species. However, these models are limited by their reliance on large amounts of training data, which can be difficult to obtain for many plant species. To overcome this challenge, this paper proposes a Plant-CNN-ViT ensemble model that combines the strengths of four pre-trained models: Vision Transformer, ResNet-50, DenseNet-201, and Xception. Vision Transformer utilizes self-attention to capture dependencies and focus on important leaf features. ResNet-50 introduces residual connections, aiding in efficient training and hierarchical feature extraction. DenseNet-201 employs dense connections, facilitating information flow and capturing intricate leaf patterns. Xception uses separable convolutions, reducing the computational cost while capturing fine-grained details in leaf images. The proposed Plant-CNN-ViT was evaluated on four plant leaf datasets and achieved remarkable accuracy of 100.00%, 100.00%, 100.00%, and 99.83% on the Flavia dataset, Folio Leaf dataset, Swedish Leaf dataset, and MalayaKew Leaf dataset, respectively. MDPI 2023-07-14 /pmc/articles/PMC10383964/ /pubmed/37514256 http://dx.doi.org/10.3390/plants12142642 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lee, Chin Poo
Lim, Kian Ming
Song, Yu Xuan
Alqahtani, Ali
Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer
title Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer
title_full Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer
title_fullStr Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer
title_full_unstemmed Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer
title_short Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer
title_sort plant-cnn-vit: plant classification with ensemble of convolutional neural networks and vision transformer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10383964/
https://www.ncbi.nlm.nih.gov/pubmed/37514256
http://dx.doi.org/10.3390/plants12142642
work_keys_str_mv AT leechinpoo plantcnnvitplantclassificationwithensembleofconvolutionalneuralnetworksandvisiontransformer
AT limkianming plantcnnvitplantclassificationwithensembleofconvolutionalneuralnetworksandvisiontransformer
AT songyuxuan plantcnnvitplantclassificationwithensembleofconvolutionalneuralnetworksandvisiontransformer
AT alqahtaniali plantcnnvitplantclassificationwithensembleofconvolutionalneuralnetworksandvisiontransformer