Cargando…
Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors
Near-infrared (NIR) spectral sensors deliver the spectral response of the light absorbed by materials for quantification, qualification or identification. Spectral analysis technology based on the NIR sensor has been a useful tool for complex information processing and high precision identification...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6210373/ https://www.ncbi.nlm.nih.gov/pubmed/30257420 http://dx.doi.org/10.3390/s18103222 |
_version_ | 1783367099342651392 |
---|---|
author | Wang, Di Xie, Lin Yang, Simon X. Tian, Fengchun |
author_facet | Wang, Di Xie, Lin Yang, Simon X. Tian, Fengchun |
author_sort | Wang, Di |
collection | PubMed |
description | Near-infrared (NIR) spectral sensors deliver the spectral response of the light absorbed by materials for quantification, qualification or identification. Spectral analysis technology based on the NIR sensor has been a useful tool for complex information processing and high precision identification in the tobacco industry. In this paper, a novel method based on the support vector machine (SVM) is proposed to discriminate the tobacco cultivation region using the near-infrared (NIR) sensors, where the genetic algorithm (GA) is employed for input subset selection to identify the effective principal components (PCs) for the SVM model. With the same number of PCs as the inputs to the SVM model, a number of comparative experiments were conducted between the effective PCs selected by GA and the PCs orderly starting from the first one. The model performance was evaluated in terms of prediction accuracy and four parameters of assessment criteria (true positive rate, true negative rate, positive predictive value and F1 score). From the results, it is interesting to find that some PCs with less information may contribute more to the cultivation regions and are considered as more effective PCs, and the SVM model with the effective PCs selected by GA has a superior discrimination capacity. The proposed GA-SVM model can effectively learn the relationship between tobacco cultivation regions and tobacco NIR sensor data. |
format | Online Article Text |
id | pubmed-6210373 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-62103732018-11-02 Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors Wang, Di Xie, Lin Yang, Simon X. Tian, Fengchun Sensors (Basel) Article Near-infrared (NIR) spectral sensors deliver the spectral response of the light absorbed by materials for quantification, qualification or identification. Spectral analysis technology based on the NIR sensor has been a useful tool for complex information processing and high precision identification in the tobacco industry. In this paper, a novel method based on the support vector machine (SVM) is proposed to discriminate the tobacco cultivation region using the near-infrared (NIR) sensors, where the genetic algorithm (GA) is employed for input subset selection to identify the effective principal components (PCs) for the SVM model. With the same number of PCs as the inputs to the SVM model, a number of comparative experiments were conducted between the effective PCs selected by GA and the PCs orderly starting from the first one. The model performance was evaluated in terms of prediction accuracy and four parameters of assessment criteria (true positive rate, true negative rate, positive predictive value and F1 score). From the results, it is interesting to find that some PCs with less information may contribute more to the cultivation regions and are considered as more effective PCs, and the SVM model with the effective PCs selected by GA has a superior discrimination capacity. The proposed GA-SVM model can effectively learn the relationship between tobacco cultivation regions and tobacco NIR sensor data. MDPI 2018-09-25 /pmc/articles/PMC6210373/ /pubmed/30257420 http://dx.doi.org/10.3390/s18103222 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Di Xie, Lin Yang, Simon X. Tian, Fengchun Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors |
title | Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors |
title_full | Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors |
title_fullStr | Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors |
title_full_unstemmed | Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors |
title_short | Support Vector Machine Optimized by Genetic Algorithm for Data Analysis of Near-Infrared Spectroscopy Sensors |
title_sort | support vector machine optimized by genetic algorithm for data analysis of near-infrared spectroscopy sensors |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6210373/ https://www.ncbi.nlm.nih.gov/pubmed/30257420 http://dx.doi.org/10.3390/s18103222 |
work_keys_str_mv | AT wangdi supportvectormachineoptimizedbygeneticalgorithmfordataanalysisofnearinfraredspectroscopysensors AT xielin supportvectormachineoptimizedbygeneticalgorithmfordataanalysisofnearinfraredspectroscopysensors AT yangsimonx supportvectormachineoptimizedbygeneticalgorithmfordataanalysisofnearinfraredspectroscopysensors AT tianfengchun supportvectormachineoptimizedbygeneticalgorithmfordataanalysisofnearinfraredspectroscopysensors |