Cargando…

BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters

BanglaLekha-Isolated, a Bangla handwritten isolated character dataset is presented in this article. This dataset contains 84 different characters comprising of 50 Bangla basic characters, 10 Bangla numerals and 24 selected compound characters. 2000 handwriting samples for each of the 84 characters w...

Descripción completa

Detalles Bibliográficos
Autores principales: Biswas, Mithun, Islam, Rafiqul, Shom, Gautam Kumar, Shopon, Md., Mohammed, Nabeel, Momen, Sifat, Abedin, Anowarul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5382023/
https://www.ncbi.nlm.nih.gov/pubmed/28409178
http://dx.doi.org/10.1016/j.dib.2017.03.035
_version_ 1782520032869220352
author Biswas, Mithun
Islam, Rafiqul
Shom, Gautam Kumar
Shopon, Md.
Mohammed, Nabeel
Momen, Sifat
Abedin, Anowarul
author_facet Biswas, Mithun
Islam, Rafiqul
Shom, Gautam Kumar
Shopon, Md.
Mohammed, Nabeel
Momen, Sifat
Abedin, Anowarul
author_sort Biswas, Mithun
collection PubMed
description BanglaLekha-Isolated, a Bangla handwritten isolated character dataset is presented in this article. This dataset contains 84 different characters comprising of 50 Bangla basic characters, 10 Bangla numerals and 24 selected compound characters. 2000 handwriting samples for each of the 84 characters were collected, digitized and pre-processed. After discarding mistakes and scribbles, 1,66,105 handwritten character images were included in the final dataset. The dataset also includes labels indicating the age and the gender of the subjects from whom the samples were collected. This dataset could be used not only for optical handwriting recognition research but also to explore the influence of gender and age on handwriting. The dataset is publicly available at https://data.mendeley.com/datasets/hf6sf8zrkc/2.
format Online
Article
Text
id pubmed-5382023
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-53820232017-04-13 BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters Biswas, Mithun Islam, Rafiqul Shom, Gautam Kumar Shopon, Md. Mohammed, Nabeel Momen, Sifat Abedin, Anowarul Data Brief Data Article BanglaLekha-Isolated, a Bangla handwritten isolated character dataset is presented in this article. This dataset contains 84 different characters comprising of 50 Bangla basic characters, 10 Bangla numerals and 24 selected compound characters. 2000 handwriting samples for each of the 84 characters were collected, digitized and pre-processed. After discarding mistakes and scribbles, 1,66,105 handwritten character images were included in the final dataset. The dataset also includes labels indicating the age and the gender of the subjects from whom the samples were collected. This dataset could be used not only for optical handwriting recognition research but also to explore the influence of gender and age on handwriting. The dataset is publicly available at https://data.mendeley.com/datasets/hf6sf8zrkc/2. Elsevier 2017-03-29 /pmc/articles/PMC5382023/ /pubmed/28409178 http://dx.doi.org/10.1016/j.dib.2017.03.035 Text en © 2017 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Biswas, Mithun
Islam, Rafiqul
Shom, Gautam Kumar
Shopon, Md.
Mohammed, Nabeel
Momen, Sifat
Abedin, Anowarul
BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters
title BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters
title_full BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters
title_fullStr BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters
title_full_unstemmed BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters
title_short BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters
title_sort banglalekha-isolated: a multi-purpose comprehensive dataset of handwritten bangla isolated characters
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5382023/
https://www.ncbi.nlm.nih.gov/pubmed/28409178
http://dx.doi.org/10.1016/j.dib.2017.03.035
work_keys_str_mv AT biswasmithun banglalekhaisolatedamultipurposecomprehensivedatasetofhandwrittenbanglaisolatedcharacters
AT islamrafiqul banglalekhaisolatedamultipurposecomprehensivedatasetofhandwrittenbanglaisolatedcharacters
AT shomgautamkumar banglalekhaisolatedamultipurposecomprehensivedatasetofhandwrittenbanglaisolatedcharacters
AT shoponmd banglalekhaisolatedamultipurposecomprehensivedatasetofhandwrittenbanglaisolatedcharacters
AT mohammednabeel banglalekhaisolatedamultipurposecomprehensivedatasetofhandwrittenbanglaisolatedcharacters
AT momensifat banglalekhaisolatedamultipurposecomprehensivedatasetofhandwrittenbanglaisolatedcharacters
AT abedinanowarul banglalekhaisolatedamultipurposecomprehensivedatasetofhandwrittenbanglaisolatedcharacters