Cargando…

Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method

Near-infrared (NIR) hyperspectroscopy becomes an emerging nondestructive sensing technology for inspection of crop seeds. A large spectral dataset of more than 140,000 wheat kernels in 30 varieties was prepared for classification. Feature selection is a critical segment in large spectral data analys...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Lei, Zhang, Chu, Taha, Mohamed Farag, Wei, Xinhua, He, Yong, Qiu, Zhengjun, Liu, Yufei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7683420/
https://www.ncbi.nlm.nih.gov/pubmed/33240294
http://dx.doi.org/10.3389/fpls.2020.575810
_version_ 1783612873361063936
author Zhou, Lei
Zhang, Chu
Taha, Mohamed Farag
Wei, Xinhua
He, Yong
Qiu, Zhengjun
Liu, Yufei
author_facet Zhou, Lei
Zhang, Chu
Taha, Mohamed Farag
Wei, Xinhua
He, Yong
Qiu, Zhengjun
Liu, Yufei
author_sort Zhou, Lei
collection PubMed
description Near-infrared (NIR) hyperspectroscopy becomes an emerging nondestructive sensing technology for inspection of crop seeds. A large spectral dataset of more than 140,000 wheat kernels in 30 varieties was prepared for classification. Feature selection is a critical segment in large spectral data analysis. A novel convolutional neural network-based feature selector (CNN-FS) was proposed to screen out deeply target-related spectral channels. A convolutional neural network with attention (CNN-ATT) framework was designed for one-dimension data classification. Popular machine learning models including support vector machine (SVM) and partial least square discrimination analysis were used as the benchmark classifiers. Features selected by conventional feature selection algorithms were considered for comparison. Results showed that the designed CNN-ATT produced a higher performance than the compared classifier. The proposed CNN-FS found a subset of features, which made a better representation of raw dataset than conventional selectors did. The CNN-ATT achieved an accuracy of 93.01% using the full spectra and keep its high precision (90.20%) by training on the 60-channel features obtained via the CNN-FS method. The proposed methods have great potential for handling the analyzing tasks on other large spectral datasets. The proposed feature selection structure can be extended to design other new model-based selectors. The combination of NIR hyperspectroscopic technology and the proposed models has great potential for automatic nondestructive classification of single wheat kernels.
format Online
Article
Text
id pubmed-7683420
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-76834202020-11-24 Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method Zhou, Lei Zhang, Chu Taha, Mohamed Farag Wei, Xinhua He, Yong Qiu, Zhengjun Liu, Yufei Front Plant Sci Plant Science Near-infrared (NIR) hyperspectroscopy becomes an emerging nondestructive sensing technology for inspection of crop seeds. A large spectral dataset of more than 140,000 wheat kernels in 30 varieties was prepared for classification. Feature selection is a critical segment in large spectral data analysis. A novel convolutional neural network-based feature selector (CNN-FS) was proposed to screen out deeply target-related spectral channels. A convolutional neural network with attention (CNN-ATT) framework was designed for one-dimension data classification. Popular machine learning models including support vector machine (SVM) and partial least square discrimination analysis were used as the benchmark classifiers. Features selected by conventional feature selection algorithms were considered for comparison. Results showed that the designed CNN-ATT produced a higher performance than the compared classifier. The proposed CNN-FS found a subset of features, which made a better representation of raw dataset than conventional selectors did. The CNN-ATT achieved an accuracy of 93.01% using the full spectra and keep its high precision (90.20%) by training on the 60-channel features obtained via the CNN-FS method. The proposed methods have great potential for handling the analyzing tasks on other large spectral datasets. The proposed feature selection structure can be extended to design other new model-based selectors. The combination of NIR hyperspectroscopic technology and the proposed models has great potential for automatic nondestructive classification of single wheat kernels. Frontiers Media S.A. 2020-11-10 /pmc/articles/PMC7683420/ /pubmed/33240294 http://dx.doi.org/10.3389/fpls.2020.575810 Text en Copyright © 2020 Zhou, Zhang, Taha, Wei, He, Qiu and Liu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Zhou, Lei
Zhang, Chu
Taha, Mohamed Farag
Wei, Xinhua
He, Yong
Qiu, Zhengjun
Liu, Yufei
Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method
title Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method
title_full Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method
title_fullStr Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method
title_full_unstemmed Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method
title_short Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method
title_sort wheat kernel variety identification based on a large near-infrared spectral dataset and a novel deep learning-based feature selection method
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7683420/
https://www.ncbi.nlm.nih.gov/pubmed/33240294
http://dx.doi.org/10.3389/fpls.2020.575810
work_keys_str_mv AT zhoulei wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod
AT zhangchu wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod
AT tahamohamedfarag wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod
AT weixinhua wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod
AT heyong wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod
AT qiuzhengjun wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod
AT liuyufei wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod