Cargando…

Storage Space Allocation Strategy for Digital Data with Message Importance

This paper mainly focuses on the problem of lossy compression storage based on the data value that represents the subjective assessment of users when the storage size is still not enough after the conventional lossless data compression. To this end, we transform this problem to an optimization, whic...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Shanyun, She, Rui, Zhu, Zheqi, Fan, Pingyi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517127/
https://www.ncbi.nlm.nih.gov/pubmed/33286363
http://dx.doi.org/10.3390/e22050591
_version_ 1783587158471213056
author Liu, Shanyun
She, Rui
Zhu, Zheqi
Fan, Pingyi
author_facet Liu, Shanyun
She, Rui
Zhu, Zheqi
Fan, Pingyi
author_sort Liu, Shanyun
collection PubMed
description This paper mainly focuses on the problem of lossy compression storage based on the data value that represents the subjective assessment of users when the storage size is still not enough after the conventional lossless data compression. To this end, we transform this problem to an optimization, which pursues the least importance-weighted reconstruction error in data reconstruction within limited total storage size, where the importance is adopted to characterize the data value from the viewpoint of users. Based on it, this paper puts forward an optimal allocation strategy in the storage of digital data by the exponential distortion measurement, which can make rational use of all the storage space. In fact, the theoretical results show that it is a kind of restrictive water-filling. It also characterizes the trade-off between the relative weighted reconstruction error and the available storage size. Consequently, if a relatively small part of total data value is allowed to lose, this strategy will improve the performance of data compression. Furthermore, this paper also presents that both the users’ preferences and the special characteristics of data distribution can trigger the small-probability event scenarios where only a fraction of data can cover the vast majority of users’ interests. Whether it is for one of the reasons above, the data with highly clustered message importance is beneficial to compression storage. In contrast, from the perspective of optimal storage space allocation based on data value, the data with a uniform information distribution is incompressible, which is consistent with that in the information theory.
format Online
Article
Text
id pubmed-7517127
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75171272020-11-09 Storage Space Allocation Strategy for Digital Data with Message Importance Liu, Shanyun She, Rui Zhu, Zheqi Fan, Pingyi Entropy (Basel) Article This paper mainly focuses on the problem of lossy compression storage based on the data value that represents the subjective assessment of users when the storage size is still not enough after the conventional lossless data compression. To this end, we transform this problem to an optimization, which pursues the least importance-weighted reconstruction error in data reconstruction within limited total storage size, where the importance is adopted to characterize the data value from the viewpoint of users. Based on it, this paper puts forward an optimal allocation strategy in the storage of digital data by the exponential distortion measurement, which can make rational use of all the storage space. In fact, the theoretical results show that it is a kind of restrictive water-filling. It also characterizes the trade-off between the relative weighted reconstruction error and the available storage size. Consequently, if a relatively small part of total data value is allowed to lose, this strategy will improve the performance of data compression. Furthermore, this paper also presents that both the users’ preferences and the special characteristics of data distribution can trigger the small-probability event scenarios where only a fraction of data can cover the vast majority of users’ interests. Whether it is for one of the reasons above, the data with highly clustered message importance is beneficial to compression storage. In contrast, from the perspective of optimal storage space allocation based on data value, the data with a uniform information distribution is incompressible, which is consistent with that in the information theory. MDPI 2020-05-25 /pmc/articles/PMC7517127/ /pubmed/33286363 http://dx.doi.org/10.3390/e22050591 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liu, Shanyun
She, Rui
Zhu, Zheqi
Fan, Pingyi
Storage Space Allocation Strategy for Digital Data with Message Importance
title Storage Space Allocation Strategy for Digital Data with Message Importance
title_full Storage Space Allocation Strategy for Digital Data with Message Importance
title_fullStr Storage Space Allocation Strategy for Digital Data with Message Importance
title_full_unstemmed Storage Space Allocation Strategy for Digital Data with Message Importance
title_short Storage Space Allocation Strategy for Digital Data with Message Importance
title_sort storage space allocation strategy for digital data with message importance
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517127/
https://www.ncbi.nlm.nih.gov/pubmed/33286363
http://dx.doi.org/10.3390/e22050591
work_keys_str_mv AT liushanyun storagespaceallocationstrategyfordigitaldatawithmessageimportance
AT sherui storagespaceallocationstrategyfordigitaldatawithmessageimportance
AT zhuzheqi storagespaceallocationstrategyfordigitaldatawithmessageimportance
AT fanpingyi storagespaceallocationstrategyfordigitaldatawithmessageimportance