Cargando…

Indian major basmati paddy seed varieties images dataset

The dataset contains images of 10 out of 32 notified Indian basmati seeds varieties (by the Government of India). Indian basmati paddy varieties included in the dataset are 1121, 1509, 1637, 1718, 1728, BAS-370, CSR 30, Type-3/Dehraduni Basmati, PB-1 and PB-6. Moreover, several images of other seeds...

Descripción completa

Detalles Bibliográficos
Autores principales: Sharma, Arun, Satish, Deepshikha, Sharma, Sushmita, Gupta, Dinesh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7653079/
https://www.ncbi.nlm.nih.gov/pubmed/33204776
http://dx.doi.org/10.1016/j.dib.2020.106460
_version_ 1783607827512688640
author Sharma, Arun
Satish, Deepshikha
Sharma, Sushmita
Gupta, Dinesh
author_facet Sharma, Arun
Satish, Deepshikha
Sharma, Sushmita
Gupta, Dinesh
author_sort Sharma, Arun
collection PubMed
description The dataset contains images of 10 out of 32 notified Indian basmati seeds varieties (by the Government of India). Indian basmati paddy varieties included in the dataset are 1121, 1509, 1637, 1718, 1728, BAS-370, CSR 30, Type-3/Dehraduni Basmati, PB-1 and PB-6. Moreover, several images of other seeds and related entities available in the household have also been included in the dataset. Thus, the dataset contains 11 classes such that ten classes contain images from ten different basmati paddy varieties. In contrast, the 11th class- named “Unknown” contains images from a mixture of two morphologically similar paddy varieties (1121 and 1509), different pulses, other grains and related food entities. The Unknown class is useful in discriminating the paddy seeds from other types of seeds and related food entities. All the images were captured (in standard conditions) manually using an apparatus developed in-house and a tablet with a five-megapixel camera (5MP). The camera was used to capture 3210 RGB coloured images in JPG format. The data pre-processing was performed to generate the ready-to-use images for training and testing machine learning-based models. AI-based paddy seed variety classification models have been developed using the dataset. The dataset can be used to generate different types of AI-based models for adulteration detection, automated classification models (along with independent devices) at the time of rice threshing, and to increase the classification potential (Supplementing images representing additional basmati varieties).
format Online
Article
Text
id pubmed-7653079
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-76530792020-11-16 Indian major basmati paddy seed varieties images dataset Sharma, Arun Satish, Deepshikha Sharma, Sushmita Gupta, Dinesh Data Brief Data Article The dataset contains images of 10 out of 32 notified Indian basmati seeds varieties (by the Government of India). Indian basmati paddy varieties included in the dataset are 1121, 1509, 1637, 1718, 1728, BAS-370, CSR 30, Type-3/Dehraduni Basmati, PB-1 and PB-6. Moreover, several images of other seeds and related entities available in the household have also been included in the dataset. Thus, the dataset contains 11 classes such that ten classes contain images from ten different basmati paddy varieties. In contrast, the 11th class- named “Unknown” contains images from a mixture of two morphologically similar paddy varieties (1121 and 1509), different pulses, other grains and related food entities. The Unknown class is useful in discriminating the paddy seeds from other types of seeds and related food entities. All the images were captured (in standard conditions) manually using an apparatus developed in-house and a tablet with a five-megapixel camera (5MP). The camera was used to capture 3210 RGB coloured images in JPG format. The data pre-processing was performed to generate the ready-to-use images for training and testing machine learning-based models. AI-based paddy seed variety classification models have been developed using the dataset. The dataset can be used to generate different types of AI-based models for adulteration detection, automated classification models (along with independent devices) at the time of rice threshing, and to increase the classification potential (Supplementing images representing additional basmati varieties). Elsevier 2020-10-28 /pmc/articles/PMC7653079/ /pubmed/33204776 http://dx.doi.org/10.1016/j.dib.2020.106460 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Sharma, Arun
Satish, Deepshikha
Sharma, Sushmita
Gupta, Dinesh
Indian major basmati paddy seed varieties images dataset
title Indian major basmati paddy seed varieties images dataset
title_full Indian major basmati paddy seed varieties images dataset
title_fullStr Indian major basmati paddy seed varieties images dataset
title_full_unstemmed Indian major basmati paddy seed varieties images dataset
title_short Indian major basmati paddy seed varieties images dataset
title_sort indian major basmati paddy seed varieties images dataset
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7653079/
https://www.ncbi.nlm.nih.gov/pubmed/33204776
http://dx.doi.org/10.1016/j.dib.2020.106460
work_keys_str_mv AT sharmaarun indianmajorbasmatipaddyseedvarietiesimagesdataset
AT satishdeepshikha indianmajorbasmatipaddyseedvarietiesimagesdataset
AT sharmasushmita indianmajorbasmatipaddyseedvarietiesimagesdataset
AT guptadinesh indianmajorbasmatipaddyseedvarietiesimagesdataset