Cargando…
Indian major basmati paddy seed varieties images dataset
The dataset contains images of 10 out of 32 notified Indian basmati seeds varieties (by the Government of India). Indian basmati paddy varieties included in the dataset are 1121, 1509, 1637, 1718, 1728, BAS-370, CSR 30, Type-3/Dehraduni Basmati, PB-1 and PB-6. Moreover, several images of other seeds...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7653079/ https://www.ncbi.nlm.nih.gov/pubmed/33204776 http://dx.doi.org/10.1016/j.dib.2020.106460 |
_version_ | 1783607827512688640 |
---|---|
author | Sharma, Arun Satish, Deepshikha Sharma, Sushmita Gupta, Dinesh |
author_facet | Sharma, Arun Satish, Deepshikha Sharma, Sushmita Gupta, Dinesh |
author_sort | Sharma, Arun |
collection | PubMed |
description | The dataset contains images of 10 out of 32 notified Indian basmati seeds varieties (by the Government of India). Indian basmati paddy varieties included in the dataset are 1121, 1509, 1637, 1718, 1728, BAS-370, CSR 30, Type-3/Dehraduni Basmati, PB-1 and PB-6. Moreover, several images of other seeds and related entities available in the household have also been included in the dataset. Thus, the dataset contains 11 classes such that ten classes contain images from ten different basmati paddy varieties. In contrast, the 11th class- named “Unknown” contains images from a mixture of two morphologically similar paddy varieties (1121 and 1509), different pulses, other grains and related food entities. The Unknown class is useful in discriminating the paddy seeds from other types of seeds and related food entities. All the images were captured (in standard conditions) manually using an apparatus developed in-house and a tablet with a five-megapixel camera (5MP). The camera was used to capture 3210 RGB coloured images in JPG format. The data pre-processing was performed to generate the ready-to-use images for training and testing machine learning-based models. AI-based paddy seed variety classification models have been developed using the dataset. The dataset can be used to generate different types of AI-based models for adulteration detection, automated classification models (along with independent devices) at the time of rice threshing, and to increase the classification potential (Supplementing images representing additional basmati varieties). |
format | Online Article Text |
id | pubmed-7653079 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-76530792020-11-16 Indian major basmati paddy seed varieties images dataset Sharma, Arun Satish, Deepshikha Sharma, Sushmita Gupta, Dinesh Data Brief Data Article The dataset contains images of 10 out of 32 notified Indian basmati seeds varieties (by the Government of India). Indian basmati paddy varieties included in the dataset are 1121, 1509, 1637, 1718, 1728, BAS-370, CSR 30, Type-3/Dehraduni Basmati, PB-1 and PB-6. Moreover, several images of other seeds and related entities available in the household have also been included in the dataset. Thus, the dataset contains 11 classes such that ten classes contain images from ten different basmati paddy varieties. In contrast, the 11th class- named “Unknown” contains images from a mixture of two morphologically similar paddy varieties (1121 and 1509), different pulses, other grains and related food entities. The Unknown class is useful in discriminating the paddy seeds from other types of seeds and related food entities. All the images were captured (in standard conditions) manually using an apparatus developed in-house and a tablet with a five-megapixel camera (5MP). The camera was used to capture 3210 RGB coloured images in JPG format. The data pre-processing was performed to generate the ready-to-use images for training and testing machine learning-based models. AI-based paddy seed variety classification models have been developed using the dataset. The dataset can be used to generate different types of AI-based models for adulteration detection, automated classification models (along with independent devices) at the time of rice threshing, and to increase the classification potential (Supplementing images representing additional basmati varieties). Elsevier 2020-10-28 /pmc/articles/PMC7653079/ /pubmed/33204776 http://dx.doi.org/10.1016/j.dib.2020.106460 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Data Article Sharma, Arun Satish, Deepshikha Sharma, Sushmita Gupta, Dinesh Indian major basmati paddy seed varieties images dataset |
title | Indian major basmati paddy seed varieties images dataset |
title_full | Indian major basmati paddy seed varieties images dataset |
title_fullStr | Indian major basmati paddy seed varieties images dataset |
title_full_unstemmed | Indian major basmati paddy seed varieties images dataset |
title_short | Indian major basmati paddy seed varieties images dataset |
title_sort | indian major basmati paddy seed varieties images dataset |
topic | Data Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7653079/ https://www.ncbi.nlm.nih.gov/pubmed/33204776 http://dx.doi.org/10.1016/j.dib.2020.106460 |
work_keys_str_mv | AT sharmaarun indianmajorbasmatipaddyseedvarietiesimagesdataset AT satishdeepshikha indianmajorbasmatipaddyseedvarietiesimagesdataset AT sharmasushmita indianmajorbasmatipaddyseedvarietiesimagesdataset AT guptadinesh indianmajorbasmatipaddyseedvarietiesimagesdataset |