Cargando…

Compression of Text in Selected Languages—Efficiency, Volume, and Time Comparison

The goal of the research was to study the possibility of using the planned language Esperanto for text compression, and to compare the results of the text compression in Esperanto with the compression in natural languages, represented by Polish and English. The authors performed text compression in...

Descripción completa

Detalles Bibliográficos
Autores principales: Stecuła, Beniamin, Stecuła, Kinga, Kapczyński, Adrian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9460191/
https://www.ncbi.nlm.nih.gov/pubmed/36080852
http://dx.doi.org/10.3390/s22176393
_version_ 1784786686500143104
author Stecuła, Beniamin
Stecuła, Kinga
Kapczyński, Adrian
author_facet Stecuła, Beniamin
Stecuła, Kinga
Kapczyński, Adrian
author_sort Stecuła, Beniamin
collection PubMed
description The goal of the research was to study the possibility of using the planned language Esperanto for text compression, and to compare the results of the text compression in Esperanto with the compression in natural languages, represented by Polish and English. The authors performed text compression in the created program in Python using four compression algorithms: zlib, lzma, bz2, and zl4 in four versions of the text: in Polish, English, Esperanto, and Esperanto in x notation (without characters outside ASCII encoding). After creating the compression program, and compressing the proper texts, authors conducted an analysis on the comparison of compression time and the volume of the text before and after compression. The results of the study confirmed the hypothesis, based on which the planned language, Esperanto, gives better text compression results than the natural languages represented by Polish and English. The confirmation by scientific methods that Esperanto is more optimal for text compression is the scientific added value of the paper.
format Online
Article
Text
id pubmed-9460191
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94601912022-09-10 Compression of Text in Selected Languages—Efficiency, Volume, and Time Comparison Stecuła, Beniamin Stecuła, Kinga Kapczyński, Adrian Sensors (Basel) Article The goal of the research was to study the possibility of using the planned language Esperanto for text compression, and to compare the results of the text compression in Esperanto with the compression in natural languages, represented by Polish and English. The authors performed text compression in the created program in Python using four compression algorithms: zlib, lzma, bz2, and zl4 in four versions of the text: in Polish, English, Esperanto, and Esperanto in x notation (without characters outside ASCII encoding). After creating the compression program, and compressing the proper texts, authors conducted an analysis on the comparison of compression time and the volume of the text before and after compression. The results of the study confirmed the hypothesis, based on which the planned language, Esperanto, gives better text compression results than the natural languages represented by Polish and English. The confirmation by scientific methods that Esperanto is more optimal for text compression is the scientific added value of the paper. MDPI 2022-08-25 /pmc/articles/PMC9460191/ /pubmed/36080852 http://dx.doi.org/10.3390/s22176393 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Stecuła, Beniamin
Stecuła, Kinga
Kapczyński, Adrian
Compression of Text in Selected Languages—Efficiency, Volume, and Time Comparison
title Compression of Text in Selected Languages—Efficiency, Volume, and Time Comparison
title_full Compression of Text in Selected Languages—Efficiency, Volume, and Time Comparison
title_fullStr Compression of Text in Selected Languages—Efficiency, Volume, and Time Comparison
title_full_unstemmed Compression of Text in Selected Languages—Efficiency, Volume, and Time Comparison
title_short Compression of Text in Selected Languages—Efficiency, Volume, and Time Comparison
title_sort compression of text in selected languages—efficiency, volume, and time comparison
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9460191/
https://www.ncbi.nlm.nih.gov/pubmed/36080852
http://dx.doi.org/10.3390/s22176393
work_keys_str_mv AT stecułabeniamin compressionoftextinselectedlanguagesefficiencyvolumeandtimecomparison
AT stecułakinga compressionoftextinselectedlanguagesefficiencyvolumeandtimecomparison
AT kapczynskiadrian compressionoftextinselectedlanguagesefficiencyvolumeandtimecomparison