Cargando…

Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems

The empirical entropy is a key statistical measure of data frequency vectors, enabling one to estimate how diverse the data are. From the computational point of view, it is important to quickly compute, approximate, or bound the entropy. In a distributed system, the representative (“global”) frequen...

Descripción completa

Detalles Bibliográficos
Autores principales: Shahar, Amit, Alfassi, Yuval, Keren, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9689552/
https://www.ncbi.nlm.nih.gov/pubmed/36359705
http://dx.doi.org/10.3390/e24111611
_version_ 1784836563930185728
author Shahar, Amit
Alfassi, Yuval
Keren, Daniel
author_facet Shahar, Amit
Alfassi, Yuval
Keren, Daniel
author_sort Shahar, Amit
collection PubMed
description The empirical entropy is a key statistical measure of data frequency vectors, enabling one to estimate how diverse the data are. From the computational point of view, it is important to quickly compute, approximate, or bound the entropy. In a distributed system, the representative (“global”) frequency vector is the average of the “local” frequency vectors, each residing in a distinct node. Typically, the trivial solution of aggregating the local vectors and computing their average incurs a huge communication overhead. Hence, the challenge is to approximate, or bound, the entropy of the global vector, while reducing communication overhead. In this paper, we develop algorithms which achieve this goal.
format Online
Article
Text
id pubmed-9689552
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96895522022-11-25 Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems Shahar, Amit Alfassi, Yuval Keren, Daniel Entropy (Basel) Article The empirical entropy is a key statistical measure of data frequency vectors, enabling one to estimate how diverse the data are. From the computational point of view, it is important to quickly compute, approximate, or bound the entropy. In a distributed system, the representative (“global”) frequency vector is the average of the “local” frequency vectors, each residing in a distinct node. Typically, the trivial solution of aggregating the local vectors and computing their average incurs a huge communication overhead. Hence, the challenge is to approximate, or bound, the entropy of the global vector, while reducing communication overhead. In this paper, we develop algorithms which achieve this goal. MDPI 2022-11-05 /pmc/articles/PMC9689552/ /pubmed/36359705 http://dx.doi.org/10.3390/e24111611 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Shahar, Amit
Alfassi, Yuval
Keren, Daniel
Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems
title Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems
title_full Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems
title_fullStr Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems
title_full_unstemmed Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems
title_short Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems
title_sort communication efficient algorithms for bounding and approximating the empirical entropy in distributed systems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9689552/
https://www.ncbi.nlm.nih.gov/pubmed/36359705
http://dx.doi.org/10.3390/e24111611
work_keys_str_mv AT shaharamit communicationefficientalgorithmsforboundingandapproximatingtheempiricalentropyindistributedsystems
AT alfassiyuval communicationefficientalgorithmsforboundingandapproximatingtheempiricalentropyindistributedsystems
AT kerendaniel communicationefficientalgorithmsforboundingandapproximatingtheempiricalentropyindistributedsystems