Cargando…

FastSS: Fast and smooth segmentation of JPEG compressed printed text documents using DC and AC signal analysis

With the surge of COVID-19 pandemic, the world is moving towards digitization and automation more than it was presumed. The Internet is becoming one of the popular mediums for communication, and multimedia (image, audio, and video) combined with data compression techniques play a pivotal role in han...

Descripción completa

Detalles Bibliográficos
Autores principales: Rajesh, Bulla, Javed, Mohammed, Nagabhushan, P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8764884/
https://www.ncbi.nlm.nih.gov/pubmed/35068992
http://dx.doi.org/10.1007/s11042-021-11858-0
_version_ 1784634253685817344
author Rajesh, Bulla
Javed, Mohammed
Nagabhushan, P.
author_facet Rajesh, Bulla
Javed, Mohammed
Nagabhushan, P.
author_sort Rajesh, Bulla
collection PubMed
description With the surge of COVID-19 pandemic, the world is moving towards digitization and automation more than it was presumed. The Internet is becoming one of the popular mediums for communication, and multimedia (image, audio, and video) combined with data compression techniques play a pivotal role in handling a huge volume of data that is being generated on a daily basis. Developing novel algorithms for automatic analysis of compressed data without decompression is the need of the present hour. JPEG is a popular compression algorithm supported in the digital electronics world that achieves compression by dividing the whole image into non-overlapping blocks of 8 × 8 pixels, and subsequently transforming each block using Discrete Cosine Transform (DCT). This research paper proposes to carry out Fast and Smooth Segmentation (FastSS) directly in JPEG compressed printed text document images at text-line and word-level using DC and AC signals. From each 8 × 8 block, DC and AC signals are analyzed for accomplishing Fast and Smooth segmentation, and subsequently, two Faster segmentation (MFastSS) algorithms are also devised using low resolution-images generated by mapping the DC signal (DC Reduced Image) and encoded DCT (ECM Image) coefficients separately. Proposed models are tested on various JPEG compressed printed text document images created with varied space and fonts. The experimental results have demonstrated that the direct analysis of compressed streams is computationally efficient, and has achieved speed gain more than 90% when compared to uncompressed domains.
format Online
Article
Text
id pubmed-8764884
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-87648842022-01-18 FastSS: Fast and smooth segmentation of JPEG compressed printed text documents using DC and AC signal analysis Rajesh, Bulla Javed, Mohammed Nagabhushan, P. Multimed Tools Appl 1209: Recent Advances on Social Media Analytics and Multimedia Systems: Issues and Challenges With the surge of COVID-19 pandemic, the world is moving towards digitization and automation more than it was presumed. The Internet is becoming one of the popular mediums for communication, and multimedia (image, audio, and video) combined with data compression techniques play a pivotal role in handling a huge volume of data that is being generated on a daily basis. Developing novel algorithms for automatic analysis of compressed data without decompression is the need of the present hour. JPEG is a popular compression algorithm supported in the digital electronics world that achieves compression by dividing the whole image into non-overlapping blocks of 8 × 8 pixels, and subsequently transforming each block using Discrete Cosine Transform (DCT). This research paper proposes to carry out Fast and Smooth Segmentation (FastSS) directly in JPEG compressed printed text document images at text-line and word-level using DC and AC signals. From each 8 × 8 block, DC and AC signals are analyzed for accomplishing Fast and Smooth segmentation, and subsequently, two Faster segmentation (MFastSS) algorithms are also devised using low resolution-images generated by mapping the DC signal (DC Reduced Image) and encoded DCT (ECM Image) coefficients separately. Proposed models are tested on various JPEG compressed printed text document images created with varied space and fonts. The experimental results have demonstrated that the direct analysis of compressed streams is computationally efficient, and has achieved speed gain more than 90% when compared to uncompressed domains. Springer US 2022-01-18 2023 /pmc/articles/PMC8764884/ /pubmed/35068992 http://dx.doi.org/10.1007/s11042-021-11858-0 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle 1209: Recent Advances on Social Media Analytics and Multimedia Systems: Issues and Challenges
Rajesh, Bulla
Javed, Mohammed
Nagabhushan, P.
FastSS: Fast and smooth segmentation of JPEG compressed printed text documents using DC and AC signal analysis
title FastSS: Fast and smooth segmentation of JPEG compressed printed text documents using DC and AC signal analysis
title_full FastSS: Fast and smooth segmentation of JPEG compressed printed text documents using DC and AC signal analysis
title_fullStr FastSS: Fast and smooth segmentation of JPEG compressed printed text documents using DC and AC signal analysis
title_full_unstemmed FastSS: Fast and smooth segmentation of JPEG compressed printed text documents using DC and AC signal analysis
title_short FastSS: Fast and smooth segmentation of JPEG compressed printed text documents using DC and AC signal analysis
title_sort fastss: fast and smooth segmentation of jpeg compressed printed text documents using dc and ac signal analysis
topic 1209: Recent Advances on Social Media Analytics and Multimedia Systems: Issues and Challenges
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8764884/
https://www.ncbi.nlm.nih.gov/pubmed/35068992
http://dx.doi.org/10.1007/s11042-021-11858-0
work_keys_str_mv AT rajeshbulla fastssfastandsmoothsegmentationofjpegcompressedprintedtextdocumentsusingdcandacsignalanalysis
AT javedmohammed fastssfastandsmoothsegmentationofjpegcompressedprintedtextdocumentsusingdcandacsignalanalysis
AT nagabhushanp fastssfastandsmoothsegmentationofjpegcompressedprintedtextdocumentsusingdcandacsignalanalysis