Cargando…
Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method
Near-infrared (NIR) hyperspectroscopy becomes an emerging nondestructive sensing technology for inspection of crop seeds. A large spectral dataset of more than 140,000 wheat kernels in 30 varieties was prepared for classification. Feature selection is a critical segment in large spectral data analys...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7683420/ https://www.ncbi.nlm.nih.gov/pubmed/33240294 http://dx.doi.org/10.3389/fpls.2020.575810 |
_version_ | 1783612873361063936 |
---|---|
author | Zhou, Lei Zhang, Chu Taha, Mohamed Farag Wei, Xinhua He, Yong Qiu, Zhengjun Liu, Yufei |
author_facet | Zhou, Lei Zhang, Chu Taha, Mohamed Farag Wei, Xinhua He, Yong Qiu, Zhengjun Liu, Yufei |
author_sort | Zhou, Lei |
collection | PubMed |
description | Near-infrared (NIR) hyperspectroscopy becomes an emerging nondestructive sensing technology for inspection of crop seeds. A large spectral dataset of more than 140,000 wheat kernels in 30 varieties was prepared for classification. Feature selection is a critical segment in large spectral data analysis. A novel convolutional neural network-based feature selector (CNN-FS) was proposed to screen out deeply target-related spectral channels. A convolutional neural network with attention (CNN-ATT) framework was designed for one-dimension data classification. Popular machine learning models including support vector machine (SVM) and partial least square discrimination analysis were used as the benchmark classifiers. Features selected by conventional feature selection algorithms were considered for comparison. Results showed that the designed CNN-ATT produced a higher performance than the compared classifier. The proposed CNN-FS found a subset of features, which made a better representation of raw dataset than conventional selectors did. The CNN-ATT achieved an accuracy of 93.01% using the full spectra and keep its high precision (90.20%) by training on the 60-channel features obtained via the CNN-FS method. The proposed methods have great potential for handling the analyzing tasks on other large spectral datasets. The proposed feature selection structure can be extended to design other new model-based selectors. The combination of NIR hyperspectroscopic technology and the proposed models has great potential for automatic nondestructive classification of single wheat kernels. |
format | Online Article Text |
id | pubmed-7683420 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-76834202020-11-24 Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method Zhou, Lei Zhang, Chu Taha, Mohamed Farag Wei, Xinhua He, Yong Qiu, Zhengjun Liu, Yufei Front Plant Sci Plant Science Near-infrared (NIR) hyperspectroscopy becomes an emerging nondestructive sensing technology for inspection of crop seeds. A large spectral dataset of more than 140,000 wheat kernels in 30 varieties was prepared for classification. Feature selection is a critical segment in large spectral data analysis. A novel convolutional neural network-based feature selector (CNN-FS) was proposed to screen out deeply target-related spectral channels. A convolutional neural network with attention (CNN-ATT) framework was designed for one-dimension data classification. Popular machine learning models including support vector machine (SVM) and partial least square discrimination analysis were used as the benchmark classifiers. Features selected by conventional feature selection algorithms were considered for comparison. Results showed that the designed CNN-ATT produced a higher performance than the compared classifier. The proposed CNN-FS found a subset of features, which made a better representation of raw dataset than conventional selectors did. The CNN-ATT achieved an accuracy of 93.01% using the full spectra and keep its high precision (90.20%) by training on the 60-channel features obtained via the CNN-FS method. The proposed methods have great potential for handling the analyzing tasks on other large spectral datasets. The proposed feature selection structure can be extended to design other new model-based selectors. The combination of NIR hyperspectroscopic technology and the proposed models has great potential for automatic nondestructive classification of single wheat kernels. Frontiers Media S.A. 2020-11-10 /pmc/articles/PMC7683420/ /pubmed/33240294 http://dx.doi.org/10.3389/fpls.2020.575810 Text en Copyright © 2020 Zhou, Zhang, Taha, Wei, He, Qiu and Liu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Zhou, Lei Zhang, Chu Taha, Mohamed Farag Wei, Xinhua He, Yong Qiu, Zhengjun Liu, Yufei Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method |
title | Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method |
title_full | Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method |
title_fullStr | Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method |
title_full_unstemmed | Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method |
title_short | Wheat Kernel Variety Identification Based on a Large Near-Infrared Spectral Dataset and a Novel Deep Learning-Based Feature Selection Method |
title_sort | wheat kernel variety identification based on a large near-infrared spectral dataset and a novel deep learning-based feature selection method |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7683420/ https://www.ncbi.nlm.nih.gov/pubmed/33240294 http://dx.doi.org/10.3389/fpls.2020.575810 |
work_keys_str_mv | AT zhoulei wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod AT zhangchu wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod AT tahamohamedfarag wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod AT weixinhua wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod AT heyong wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod AT qiuzhengjun wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod AT liuyufei wheatkernelvarietyidentificationbasedonalargenearinfraredspectraldatasetandanoveldeeplearningbasedfeatureselectionmethod |