Cargando…
Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models
Developing chemometric models from near-infrared (NIR) spectra requires the use of a representative calibration set of the entire population. Therefore, generally, the calibration procedure requires a large number of resources. For that reason, there is a great interest in identifying the most spect...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7912666/ https://www.ncbi.nlm.nih.gov/pubmed/33498776 http://dx.doi.org/10.3390/foods10020233 |
_version_ | 1783656628093976576 |
---|---|
author | Nogales-Bueno, Julio Rodríguez-Pulido, Francisco José Baca-Bocanegra, Berta Pérez-Marin, Dolores Heredia, Francisco José Garrido-Varo, Ana Hernández-Hierro, José Miguel |
author_facet | Nogales-Bueno, Julio Rodríguez-Pulido, Francisco José Baca-Bocanegra, Berta Pérez-Marin, Dolores Heredia, Francisco José Garrido-Varo, Ana Hernández-Hierro, José Miguel |
author_sort | Nogales-Bueno, Julio |
collection | PubMed |
description | Developing chemometric models from near-infrared (NIR) spectra requires the use of a representative calibration set of the entire population. Therefore, generally, the calibration procedure requires a large number of resources. For that reason, there is a great interest in identifying the most spectrally representative samples within a large population set. In this study, principal component and hierarchical clustering analyses have been compared for their ability to provide different representative calibration sets. The calibration sets generated have been used to control the technological maturity of grapes and total phenolic compounds of grape skins in red and white cultivars. Finally, the accuracy and precision of the models obtained with these calibration sets resulted from the application of the selection algorithms studied have been compared with each other and with the whole set of samples using an external validation set. Most of the standard errors of prediction (SEP) in external validation obtained from the reduced data sets were not significantly different from those obtained using the whole data set. Moreover, sample subsets resulting from hierarchical clustering analysis appear to produce slightly better results. |
format | Online Article Text |
id | pubmed-7912666 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-79126662021-02-28 Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models Nogales-Bueno, Julio Rodríguez-Pulido, Francisco José Baca-Bocanegra, Berta Pérez-Marin, Dolores Heredia, Francisco José Garrido-Varo, Ana Hernández-Hierro, José Miguel Foods Article Developing chemometric models from near-infrared (NIR) spectra requires the use of a representative calibration set of the entire population. Therefore, generally, the calibration procedure requires a large number of resources. For that reason, there is a great interest in identifying the most spectrally representative samples within a large population set. In this study, principal component and hierarchical clustering analyses have been compared for their ability to provide different representative calibration sets. The calibration sets generated have been used to control the technological maturity of grapes and total phenolic compounds of grape skins in red and white cultivars. Finally, the accuracy and precision of the models obtained with these calibration sets resulted from the application of the selection algorithms studied have been compared with each other and with the whole set of samples using an external validation set. Most of the standard errors of prediction (SEP) in external validation obtained from the reduced data sets were not significantly different from those obtained using the whole data set. Moreover, sample subsets resulting from hierarchical clustering analysis appear to produce slightly better results. MDPI 2021-01-23 /pmc/articles/PMC7912666/ /pubmed/33498776 http://dx.doi.org/10.3390/foods10020233 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Nogales-Bueno, Julio Rodríguez-Pulido, Francisco José Baca-Bocanegra, Berta Pérez-Marin, Dolores Heredia, Francisco José Garrido-Varo, Ana Hernández-Hierro, José Miguel Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models |
title | Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models |
title_full | Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models |
title_fullStr | Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models |
title_full_unstemmed | Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models |
title_short | Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models |
title_sort | reduction of the number of samples for cost-effective hyperspectral grape quality predictive models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7912666/ https://www.ncbi.nlm.nih.gov/pubmed/33498776 http://dx.doi.org/10.3390/foods10020233 |
work_keys_str_mv | AT nogalesbuenojulio reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels AT rodriguezpulidofranciscojose reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels AT bacabocanegraberta reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels AT perezmarindolores reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels AT herediafranciscojose reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels AT garridovaroana reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels AT hernandezhierrojosemiguel reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels |