Cargando…
A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines
BACKGROUND: The combination of computer vision devices such as multispectral cameras coupled with artificial intelligence has provided a major leap forward in image-based analysis of biological processes. Supervised artificial intelligence algorithms require large ground truth image datasets for mod...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9197681/ https://www.ncbi.nlm.nih.gov/pubmed/35701377 http://dx.doi.org/10.1093/gigascience/giac052 |
_version_ | 1784727473724850176 |
---|---|
author | Navarro, Pedro J Miller, Leanne Díaz-Galián, María Victoria Gila-Navarro, Alberto Aguila, Diego J Egea-Cortines, Marcos |
author_facet | Navarro, Pedro J Miller, Leanne Díaz-Galián, María Victoria Gila-Navarro, Alberto Aguila, Diego J Egea-Cortines, Marcos |
author_sort | Navarro, Pedro J |
collection | PubMed |
description | BACKGROUND: The combination of computer vision devices such as multispectral cameras coupled with artificial intelligence has provided a major leap forward in image-based analysis of biological processes. Supervised artificial intelligence algorithms require large ground truth image datasets for model training, which allows to validate or refute research hypotheses and to carry out comparisons between models. However, public datasets of images are scarce and ground truth images are surprisingly few considering the numbers required for training algorithms. RESULTS: We created a dataset of 1,283 multidimensional arrays, using berries from five different grape varieties. Each array has 37 images of wavelengths between 488.38 and 952.76 nm obtained from single berries. Coupled to each multispectral image, we added a dataset with measurements including, weight, anthocyanin content, and Brix index for each independent grape. Thus, the images have paired measures, creating a ground truth dataset. We tested the dataset with 2 neural network algorithms: multilayer perceptron (MLP) and 3-dimensional convolutional neural network (3D-CNN). A perfect (100% accuracy) classification model was fit with either the MLP or 3D-CNN algorithms. CONCLUSIONS: This is the first public dataset of grape ground truth multispectral images. Associated with each multispectral image, there are measures of the weight, anthocyanins, and Brix index. The dataset should be useful to develop deep learning algorithms for classification, dimensionality reduction, regression, and prediction analysis. |
format | Online Article Text |
id | pubmed-9197681 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-91976812022-06-15 A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines Navarro, Pedro J Miller, Leanne Díaz-Galián, María Victoria Gila-Navarro, Alberto Aguila, Diego J Egea-Cortines, Marcos Gigascience Data Note BACKGROUND: The combination of computer vision devices such as multispectral cameras coupled with artificial intelligence has provided a major leap forward in image-based analysis of biological processes. Supervised artificial intelligence algorithms require large ground truth image datasets for model training, which allows to validate or refute research hypotheses and to carry out comparisons between models. However, public datasets of images are scarce and ground truth images are surprisingly few considering the numbers required for training algorithms. RESULTS: We created a dataset of 1,283 multidimensional arrays, using berries from five different grape varieties. Each array has 37 images of wavelengths between 488.38 and 952.76 nm obtained from single berries. Coupled to each multispectral image, we added a dataset with measurements including, weight, anthocyanin content, and Brix index for each independent grape. Thus, the images have paired measures, creating a ground truth dataset. We tested the dataset with 2 neural network algorithms: multilayer perceptron (MLP) and 3-dimensional convolutional neural network (3D-CNN). A perfect (100% accuracy) classification model was fit with either the MLP or 3D-CNN algorithms. CONCLUSIONS: This is the first public dataset of grape ground truth multispectral images. Associated with each multispectral image, there are measures of the weight, anthocyanins, and Brix index. The dataset should be useful to develop deep learning algorithms for classification, dimensionality reduction, regression, and prediction analysis. Oxford University Press 2022-06-14 /pmc/articles/PMC9197681/ /pubmed/35701377 http://dx.doi.org/10.1093/gigascience/giac052 Text en © The Author(s) 2022. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Data Note Navarro, Pedro J Miller, Leanne Díaz-Galián, María Victoria Gila-Navarro, Alberto Aguila, Diego J Egea-Cortines, Marcos A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines |
title | A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines |
title_full | A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines |
title_fullStr | A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines |
title_full_unstemmed | A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines |
title_short | A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines |
title_sort | novel ground truth multispectral image dataset with weight, anthocyanins, and brix index measures of grape berries tested for its utility in machine learning pipelines |
topic | Data Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9197681/ https://www.ncbi.nlm.nih.gov/pubmed/35701377 http://dx.doi.org/10.1093/gigascience/giac052 |
work_keys_str_mv | AT navarropedroj anovelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines AT millerleanne anovelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines AT diazgalianmariavictoria anovelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines AT gilanavarroalberto anovelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines AT aguiladiegoj anovelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines AT egeacortinesmarcos anovelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines AT navarropedroj novelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines AT millerleanne novelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines AT diazgalianmariavictoria novelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines AT gilanavarroalberto novelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines AT aguiladiegoj novelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines AT egeacortinesmarcos novelgroundtruthmultispectralimagedatasetwithweightanthocyaninsandbrixindexmeasuresofgrapeberriestestedforitsutilityinmachinelearningpipelines |