Cargando…

Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models

Developing chemometric models from near-infrared (NIR) spectra requires the use of a representative calibration set of the entire population. Therefore, generally, the calibration procedure requires a large number of resources. For that reason, there is a great interest in identifying the most spect...

Descripción completa

Detalles Bibliográficos
Autores principales: Nogales-Bueno, Julio, Rodríguez-Pulido, Francisco José, Baca-Bocanegra, Berta, Pérez-Marin, Dolores, Heredia, Francisco José, Garrido-Varo, Ana, Hernández-Hierro, José Miguel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7912666/
https://www.ncbi.nlm.nih.gov/pubmed/33498776
http://dx.doi.org/10.3390/foods10020233
_version_ 1783656628093976576
author Nogales-Bueno, Julio
Rodríguez-Pulido, Francisco José
Baca-Bocanegra, Berta
Pérez-Marin, Dolores
Heredia, Francisco José
Garrido-Varo, Ana
Hernández-Hierro, José Miguel
author_facet Nogales-Bueno, Julio
Rodríguez-Pulido, Francisco José
Baca-Bocanegra, Berta
Pérez-Marin, Dolores
Heredia, Francisco José
Garrido-Varo, Ana
Hernández-Hierro, José Miguel
author_sort Nogales-Bueno, Julio
collection PubMed
description Developing chemometric models from near-infrared (NIR) spectra requires the use of a representative calibration set of the entire population. Therefore, generally, the calibration procedure requires a large number of resources. For that reason, there is a great interest in identifying the most spectrally representative samples within a large population set. In this study, principal component and hierarchical clustering analyses have been compared for their ability to provide different representative calibration sets. The calibration sets generated have been used to control the technological maturity of grapes and total phenolic compounds of grape skins in red and white cultivars. Finally, the accuracy and precision of the models obtained with these calibration sets resulted from the application of the selection algorithms studied have been compared with each other and with the whole set of samples using an external validation set. Most of the standard errors of prediction (SEP) in external validation obtained from the reduced data sets were not significantly different from those obtained using the whole data set. Moreover, sample subsets resulting from hierarchical clustering analysis appear to produce slightly better results.
format Online
Article
Text
id pubmed-7912666
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79126662021-02-28 Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models Nogales-Bueno, Julio Rodríguez-Pulido, Francisco José Baca-Bocanegra, Berta Pérez-Marin, Dolores Heredia, Francisco José Garrido-Varo, Ana Hernández-Hierro, José Miguel Foods Article Developing chemometric models from near-infrared (NIR) spectra requires the use of a representative calibration set of the entire population. Therefore, generally, the calibration procedure requires a large number of resources. For that reason, there is a great interest in identifying the most spectrally representative samples within a large population set. In this study, principal component and hierarchical clustering analyses have been compared for their ability to provide different representative calibration sets. The calibration sets generated have been used to control the technological maturity of grapes and total phenolic compounds of grape skins in red and white cultivars. Finally, the accuracy and precision of the models obtained with these calibration sets resulted from the application of the selection algorithms studied have been compared with each other and with the whole set of samples using an external validation set. Most of the standard errors of prediction (SEP) in external validation obtained from the reduced data sets were not significantly different from those obtained using the whole data set. Moreover, sample subsets resulting from hierarchical clustering analysis appear to produce slightly better results. MDPI 2021-01-23 /pmc/articles/PMC7912666/ /pubmed/33498776 http://dx.doi.org/10.3390/foods10020233 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Nogales-Bueno, Julio
Rodríguez-Pulido, Francisco José
Baca-Bocanegra, Berta
Pérez-Marin, Dolores
Heredia, Francisco José
Garrido-Varo, Ana
Hernández-Hierro, José Miguel
Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models
title Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models
title_full Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models
title_fullStr Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models
title_full_unstemmed Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models
title_short Reduction of the Number of Samples for Cost-Effective Hyperspectral Grape Quality Predictive Models
title_sort reduction of the number of samples for cost-effective hyperspectral grape quality predictive models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7912666/
https://www.ncbi.nlm.nih.gov/pubmed/33498776
http://dx.doi.org/10.3390/foods10020233
work_keys_str_mv AT nogalesbuenojulio reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels
AT rodriguezpulidofranciscojose reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels
AT bacabocanegraberta reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels
AT perezmarindolores reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels
AT herediafranciscojose reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels
AT garridovaroana reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels
AT hernandezhierrojosemiguel reductionofthenumberofsamplesforcosteffectivehyperspectralgrapequalitypredictivemodels