Cargando…

Generation of simulated data for Bengali text localization in natural images

In the domain of vision-based applications, the importance of text cannot be underestimated due to its natural capacity to provide accurate and comprehensive information. The application of scene text editing systems enables the modification and enhancement of textual material included in natural im...

Descripción completa

Detalles Bibliográficos
Autores principales: Saha, Sourav, Arafat, Md. Easin, Palash, Md Aminul Haque, Farid, Dewan Md, Kaiser, M. Shamim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562664/
https://www.ncbi.nlm.nih.gov/pubmed/37822883
http://dx.doi.org/10.1016/j.dib.2023.109568
Descripción
Sumario:In the domain of vision-based applications, the importance of text cannot be underestimated due to its natural capacity to provide accurate and comprehensive information. The application of scene text editing systems enables the modification and enhancement of textual material included in natural images while maintaining the integrity of the overall visual layout. The complexity of keeping the original background context and font styles when altering, however, is an extremely difficult challenge considering the changed image must perfectly blend with the original without being altered. This article contains significant simulated data on the dynamic features of digital image editing, advertising, content development, and related fields. The system comprises key components such as 2D simulated text on the styled image [Formula: see text] , text image [Formula: see text] , masking of text [Formula: see text] , real background image [Formula: see text] , real sample image [Formula: see text] , text skeleton [Formula: see text] , and text styled image [Formula: see text]. The source dataset contains diverse components such as background images, color variations, fonts, and text content, while the synthetic dataset consists of 49,000 randomly generated images. The dataset provides both researchers and practitioners with a rich resource for identifying and evaluating these dynamic features. The dataset is publicly accessible via the link: https://data.mendeley.com/datasets/h9kry9y46s/3