Cargando…

A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines

BACKGROUND: The combination of computer vision devices such as multispectral cameras coupled with artificial intelligence has provided a major leap forward in image-based analysis of biological processes. Supervised artificial intelligence algorithms require large ground truth image datasets for mod...

Descripción completa

Detalles Bibliográficos
Autores principales: Navarro, Pedro J, Miller, Leanne, Díaz-Galián, María Victoria, Gila-Navarro, Alberto, Aguila, Diego J, Egea-Cortines, Marcos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9197681/
https://www.ncbi.nlm.nih.gov/pubmed/35701377
http://dx.doi.org/10.1093/gigascience/giac052
_version_ 1784727473724850176
author Navarro, Pedro J
Miller, Leanne
Díaz-Galián, María Victoria
Gila-Navarro, Alberto
Aguila, Diego J
Egea-Cortines, Marcos
author_facet Navarro, Pedro J
Miller, Leanne
Díaz-Galián, María Victoria
Gila-Navarro, Alberto
Aguila, Diego J
Egea-Cortines, Marcos
author_sort Navarro, Pedro J
collection PubMed
description BACKGROUND: The combination of computer vision devices such as multispectral cameras coupled with artificial intelligence has provided a major leap forward in image-based analysis of biological processes. Supervised artificial intelligence algorithms require large ground truth image datasets for model training, which allows to validate or refute research hypotheses and to carry out comparisons between models. However, public datasets of images are scarce and ground truth images are surprisingly few considering the numbers required for training algorithms. RESULTS: We created a dataset of 1,283 multidimensional arrays, using berries from five different grape varieties. Each array has 37 images of wavelengths between 488.38 and 952.76 nm obtained from single berries. Coupled to each multispectral image, we added a dataset with measurements including, weight, anthocyanin content, and Brix index for each independent grape. Thus, the images have paired measures, creating a ground truth dataset. We tested the dataset with 2 neural network algorithms: multilayer perceptron (MLP) and 3-dimensional convolutional neural network (3D-CNN). A perfect (100% accuracy) classification model was fit with either the MLP or 3D-CNN algorithms. CONCLUSIONS: This is the first public dataset of grape ground truth multispectral images. Associated with each multispectral image, there are measures of the weight, anthocyanins, and Brix index. The dataset should be useful to develop deep learning algorithms for classification, dimensionality reduction, regression, and prediction analysis.
format Online
Article
Text
id pubmed-9197681
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-91976812022-06-15 A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines Navarro, Pedro J Miller, Leanne Díaz-Galián, María Victoria Gila-Navarro, Alberto Aguila, Diego J Egea-Cortines, Marcos Gigascience Data Note BACKGROUND: The combination of computer vision devices such as multispectral cameras coupled with artificial intelligence has provided a major leap forward in image-based analysis of biological processes. Supervised artificial intelligence algorithms require large ground truth image datasets for model training, which allows to validate or refute research hypotheses and to carry out comparisons between models. However, public datasets of images are scarce and ground truth images are surprisingly few considering the numbers required for training algorithms. RESULTS: We created a dataset of 1,283 multidimensional arrays, using berries from five different grape varieties. Each array has 37 images of wavelengths between 488.38 and 952.76 nm obtained from single berries. Coupled to each multispectral image, we added a dataset with measurements including, weight, anthocyanin content, and Brix index for each independent grape. Thus, the images have paired measures, creating a ground truth dataset. We tested the dataset with 2 neural network algorithms: multilayer perceptron (MLP) and 3-dimensional convolutional neural network (3D-CNN). A perfect (100% accuracy) classification model was fit with either the MLP or 3D-CNN algorithms. CONCLUSIONS: This is the first public dataset of grape ground truth multispectral images. Associated with each multispectral image, there are measures of the weight, anthocyanins, and Brix index. The dataset should be useful to develop deep learning algorithms for classification, dimensionality reduction, regression, and prediction analysis. Oxford University Press 2022-06-14 /pmc/articles/PMC9197681/ /pubmed/35701377 http://dx.doi.org/10.1093/gigascience/giac052 Text en © The Author(s) 2022. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Note
Navarro, Pedro J
Miller, Leanne
Díaz-Galián, María Victoria
Gila-Navarro, Alberto
Aguila, Diego J
Egea-Cortines, Marcos
A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines
title A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines
title_full A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines
title_fullStr A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines
title_full_unstemmed A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines
title_short A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines
title_sort novel ground truth multispectral image dataset with weight, anthocyanins, and brix index measures of grape berries tested for its utility in machine learning pipelines
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9197681/
https://www.ncbi.nlm.nih.gov/pubmed/35701377
http://dx.doi.org/10.1093/gigascience/giac052
work_keys_str_mv AT navarropedroj anovelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines
AT millerleanne anovelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines
AT diazgalianmariavictoria anovelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines
AT gilanavarroalberto anovelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines
AT aguiladiegoj anovelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines
AT egeacortinesmarcos anovelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines
AT navarropedroj novelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines
AT millerleanne novelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines
AT diazgalianmariavictoria novelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines
AT gilanavarroalberto novelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines
AT aguiladiegoj novelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines
AT egeacortinesmarcos novelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines