Cargando…

Automated classification of tropical shrub species: a hybrid of leaf shape and machine learning approach

Plants play a crucial role in foodstuff, medicine, industry, and environmental protection. The skill of recognising plants is very important in some applications, including conservation of endangered species and rehabilitation of lands after mining activities. However, it is a difficult task to iden...

Descripción completa

Detalles Bibliográficos
Autores principales: Murat, Miraemiliana, Chang, Siow-Wee, Abu, Arpah, Yap, Hwa Jen, Yong, Kien-Thai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5600178/
https://www.ncbi.nlm.nih.gov/pubmed/28924506
http://dx.doi.org/10.7717/peerj.3792
_version_ 1783264199931068416
author Murat, Miraemiliana
Chang, Siow-Wee
Abu, Arpah
Yap, Hwa Jen
Yong, Kien-Thai
author_facet Murat, Miraemiliana
Chang, Siow-Wee
Abu, Arpah
Yap, Hwa Jen
Yong, Kien-Thai
author_sort Murat, Miraemiliana
collection PubMed
description Plants play a crucial role in foodstuff, medicine, industry, and environmental protection. The skill of recognising plants is very important in some applications, including conservation of endangered species and rehabilitation of lands after mining activities. However, it is a difficult task to identify plant species because it requires specialized knowledge. Developing an automated classification system for plant species is necessary and valuable since it can help specialists as well as the public in identifying plant species easily. Shape descriptors were applied on the myDAUN dataset that contains 45 tropical shrub species collected from the University of Malaya (UM), Malaysia. Based on literature review, this is the first study in the development of tropical shrub species image dataset and classification using a hybrid of leaf shape and machine learning approach. Four types of shape descriptors were used in this study namely morphological shape descriptors (MSD), Histogram of Oriented Gradients (HOG), Hu invariant moments (Hu) and Zernike moments (ZM). Single descriptor, as well as the combination of hybrid descriptors were tested and compared. The tropical shrub species are classified using six different classifiers, which are artificial neural network (ANN), random forest (RF), support vector machine (SVM), k-nearest neighbour (k-NN), linear discriminant analysis (LDA) and directed acyclic graph multiclass least squares twin support vector machine (DAG MLSTSVM). In addition, three types of feature selection methods were tested in the myDAUN dataset, Relief, Correlation-based feature selection (CFS) and Pearson’s coefficient correlation (PCC). The well-known Flavia dataset and Swedish Leaf dataset were used as the validation dataset on the proposed methods. The results showed that the hybrid of all descriptors of ANN outperformed the other classifiers with an average classification accuracy of 98.23% for the myDAUN dataset, 95.25% for the Flavia dataset and 99.89% for the Swedish Leaf dataset. In addition, the Relief feature selection method achieved the highest classification accuracy of 98.13% after 80 (or 60%) of the original features were reduced, from 133 to 53 descriptors in the myDAUN dataset with the reduction in computational time. Subsequently, the hybridisation of four descriptors gave the best results compared to others. It is proven that the combination MSD and HOG were good enough for tropical shrubs species classification. Hu and ZM descriptors also improved the accuracy in tropical shrubs species classification in terms of invariant to translation, rotation and scale. ANN outperformed the others for tropical shrub species classification in this study. Feature selection methods can be used in the classification of tropical shrub species, as the comparable results could be obtained with the reduced descriptors and reduced in computational time and cost.
format Online
Article
Text
id pubmed-5600178
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-56001782017-09-18 Automated classification of tropical shrub species: a hybrid of leaf shape and machine learning approach Murat, Miraemiliana Chang, Siow-Wee Abu, Arpah Yap, Hwa Jen Yong, Kien-Thai PeerJ Bioinformatics Plants play a crucial role in foodstuff, medicine, industry, and environmental protection. The skill of recognising plants is very important in some applications, including conservation of endangered species and rehabilitation of lands after mining activities. However, it is a difficult task to identify plant species because it requires specialized knowledge. Developing an automated classification system for plant species is necessary and valuable since it can help specialists as well as the public in identifying plant species easily. Shape descriptors were applied on the myDAUN dataset that contains 45 tropical shrub species collected from the University of Malaya (UM), Malaysia. Based on literature review, this is the first study in the development of tropical shrub species image dataset and classification using a hybrid of leaf shape and machine learning approach. Four types of shape descriptors were used in this study namely morphological shape descriptors (MSD), Histogram of Oriented Gradients (HOG), Hu invariant moments (Hu) and Zernike moments (ZM). Single descriptor, as well as the combination of hybrid descriptors were tested and compared. The tropical shrub species are classified using six different classifiers, which are artificial neural network (ANN), random forest (RF), support vector machine (SVM), k-nearest neighbour (k-NN), linear discriminant analysis (LDA) and directed acyclic graph multiclass least squares twin support vector machine (DAG MLSTSVM). In addition, three types of feature selection methods were tested in the myDAUN dataset, Relief, Correlation-based feature selection (CFS) and Pearson’s coefficient correlation (PCC). The well-known Flavia dataset and Swedish Leaf dataset were used as the validation dataset on the proposed methods. The results showed that the hybrid of all descriptors of ANN outperformed the other classifiers with an average classification accuracy of 98.23% for the myDAUN dataset, 95.25% for the Flavia dataset and 99.89% for the Swedish Leaf dataset. In addition, the Relief feature selection method achieved the highest classification accuracy of 98.13% after 80 (or 60%) of the original features were reduced, from 133 to 53 descriptors in the myDAUN dataset with the reduction in computational time. Subsequently, the hybridisation of four descriptors gave the best results compared to others. It is proven that the combination MSD and HOG were good enough for tropical shrubs species classification. Hu and ZM descriptors also improved the accuracy in tropical shrubs species classification in terms of invariant to translation, rotation and scale. ANN outperformed the others for tropical shrub species classification in this study. Feature selection methods can be used in the classification of tropical shrub species, as the comparable results could be obtained with the reduced descriptors and reduced in computational time and cost. PeerJ Inc. 2017-09-12 /pmc/articles/PMC5600178/ /pubmed/28924506 http://dx.doi.org/10.7717/peerj.3792 Text en ©2017 Murat et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Murat, Miraemiliana
Chang, Siow-Wee
Abu, Arpah
Yap, Hwa Jen
Yong, Kien-Thai
Automated classification of tropical shrub species: a hybrid of leaf shape and machine learning approach
title Automated classification of tropical shrub species: a hybrid of leaf shape and machine learning approach
title_full Automated classification of tropical shrub species: a hybrid of leaf shape and machine learning approach
title_fullStr Automated classification of tropical shrub species: a hybrid of leaf shape and machine learning approach
title_full_unstemmed Automated classification of tropical shrub species: a hybrid of leaf shape and machine learning approach
title_short Automated classification of tropical shrub species: a hybrid of leaf shape and machine learning approach
title_sort automated classification of tropical shrub species: a hybrid of leaf shape and machine learning approach
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5600178/
https://www.ncbi.nlm.nih.gov/pubmed/28924506
http://dx.doi.org/10.7717/peerj.3792
work_keys_str_mv AT muratmiraemiliana automatedclassificationoftropicalshrubspeciesahybridofleafshapeandmachinelearningapproach
AT changsiowwee automatedclassificationoftropicalshrubspeciesahybridofleafshapeandmachinelearningapproach
AT abuarpah automatedclassificationoftropicalshrubspeciesahybridofleafshapeandmachinelearningapproach
AT yaphwajen automatedclassificationoftropicalshrubspeciesahybridofleafshapeandmachinelearningapproach
AT yongkienthai automatedclassificationoftropicalshrubspeciesahybridofleafshapeandmachinelearningapproach