Cargando…

Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors

Near-infrared (NIR) spectral sensors deliver the spectral response of the light absorbed by materials for quantification, qualification or identification. Spectral analysis technology based on the NIR sensor has been a useful tool for complex information processing and high precision identification...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Di, Xie, Lin, Yang, Simon X., Tian, Fengchun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6210373/
https://www.ncbi.nlm.nih.gov/pubmed/30257420
http://dx.doi.org/10.3390/s18103222
_version_ 1783367099342651392
author Wang, Di
Xie, Lin
Yang, Simon X.
Tian, Fengchun
author_facet Wang, Di
Xie, Lin
Yang, Simon X.
Tian, Fengchun
author_sort Wang, Di
collection PubMed
description Near-infrared (NIR) spectral sensors deliver the spectral response of the light absorbed by materials for quantification, qualification or identification. Spectral analysis technology based on the NIR sensor has been a useful tool for complex information processing and high precision identification in the tobacco industry. In this paper, a novel method based on the support vector machine (SVM) is proposed to discriminate the tobacco cultivation region using the near-infrared (NIR) sensors, where the genetic algorithm (GA) is employed for input subset selection to identify the effective principal components (PCs) for the SVM model. With the same number of PCs as the inputs to the SVM model, a number of comparative experiments were conducted between the effective PCs selected by GA and the PCs orderly starting from the first one. The model performance was evaluated in terms of prediction accuracy and four parameters of assessment criteria (true positive rate, true negative rate, positive predictive value and F1 score). From the results, it is interesting to find that some PCs with less information may contribute more to the cultivation regions and are considered as more effective PCs, and the SVM model with the effective PCs selected by GA has a superior discrimination capacity. The proposed GA-SVM model can effectively learn the relationship between tobacco cultivation regions and tobacco NIR sensor data.
format Online
Article
Text
id pubmed-6210373
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-62103732018-11-02 Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors Wang, Di Xie, Lin Yang, Simon X. Tian, Fengchun Sensors (Basel) Article Near-infrared (NIR) spectral sensors deliver the spectral response of the light absorbed by materials for quantification, qualification or identification. Spectral analysis technology based on the NIR sensor has been a useful tool for complex information processing and high precision identification in the tobacco industry. In this paper, a novel method based on the support vector machine (SVM) is proposed to discriminate the tobacco cultivation region using the near-infrared (NIR) sensors, where the genetic algorithm (GA) is employed for input subset selection to identify the effective principal components (PCs) for the SVM model. With the same number of PCs as the inputs to the SVM model, a number of comparative experiments were conducted between the effective PCs selected by GA and the PCs orderly starting from the first one. The model performance was evaluated in terms of prediction accuracy and four parameters of assessment criteria (true positive rate, true negative rate, positive predictive value and F1 score). From the results, it is interesting to find that some PCs with less information may contribute more to the cultivation regions and are considered as more effective PCs, and the SVM model with the effective PCs selected by GA has a superior discrimination capacity. The proposed GA-SVM model can effectively learn the relationship between tobacco cultivation regions and tobacco NIR sensor data. MDPI 2018-09-25 /pmc/articles/PMC6210373/ /pubmed/30257420 http://dx.doi.org/10.3390/s18103222 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Di
Xie, Lin
Yang, Simon X.
Tian, Fengchun
Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors
title Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors
title_full Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors
title_fullStr Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors
title_full_unstemmed Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors
title_short Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors
title_sort support vector machine optimized by genetic algorithm for data analysis of near-infrared spectroscopy sensors
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6210373/
https://www.ncbi.nlm.nih.gov/pubmed/30257420
http://dx.doi.org/10.3390/s18103222
work_keys_str_mv AT wangdi supportvectormachineoptimizedbygeneticalgorithmfordataanalysisofnearinfraredspectroscopysensors
AT xielin supportvectormachineoptimizedbygeneticalgorithmfordataanalysisofnearinfraredspectroscopysensors
AT yangsimonx supportvectormachineoptimizedbygeneticalgorithmfordataanalysisofnearinfraredspectroscopysensors
AT tianfengchun supportvectormachineoptimizedbygeneticalgorithmfordataanalysisofnearinfraredspectroscopysensors