Cargando…
Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems
The empirical entropy is a key statistical measure of data frequency vectors, enabling one to estimate how diverse the data are. From the computational point of view, it is important to quickly compute, approximate, or bound the entropy. In a distributed system, the representative (“global”) frequen...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9689552/ https://www.ncbi.nlm.nih.gov/pubmed/36359705 http://dx.doi.org/10.3390/e24111611 |
_version_ | 1784836563930185728 |
---|---|
author | Shahar, Amit Alfassi, Yuval Keren, Daniel |
author_facet | Shahar, Amit Alfassi, Yuval Keren, Daniel |
author_sort | Shahar, Amit |
collection | PubMed |
description | The empirical entropy is a key statistical measure of data frequency vectors, enabling one to estimate how diverse the data are. From the computational point of view, it is important to quickly compute, approximate, or bound the entropy. In a distributed system, the representative (“global”) frequency vector is the average of the “local” frequency vectors, each residing in a distinct node. Typically, the trivial solution of aggregating the local vectors and computing their average incurs a huge communication overhead. Hence, the challenge is to approximate, or bound, the entropy of the global vector, while reducing communication overhead. In this paper, we develop algorithms which achieve this goal. |
format | Online Article Text |
id | pubmed-9689552 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96895522022-11-25 Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems Shahar, Amit Alfassi, Yuval Keren, Daniel Entropy (Basel) Article The empirical entropy is a key statistical measure of data frequency vectors, enabling one to estimate how diverse the data are. From the computational point of view, it is important to quickly compute, approximate, or bound the entropy. In a distributed system, the representative (“global”) frequency vector is the average of the “local” frequency vectors, each residing in a distinct node. Typically, the trivial solution of aggregating the local vectors and computing their average incurs a huge communication overhead. Hence, the challenge is to approximate, or bound, the entropy of the global vector, while reducing communication overhead. In this paper, we develop algorithms which achieve this goal. MDPI 2022-11-05 /pmc/articles/PMC9689552/ /pubmed/36359705 http://dx.doi.org/10.3390/e24111611 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Shahar, Amit Alfassi, Yuval Keren, Daniel Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems |
title | Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems |
title_full | Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems |
title_fullStr | Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems |
title_full_unstemmed | Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems |
title_short | Communication Efficient Algorithms for Bounding and Approximating the Empirical Entropy in Distributed Systems |
title_sort | communication efficient algorithms for bounding and approximating the empirical entropy in distributed systems |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9689552/ https://www.ncbi.nlm.nih.gov/pubmed/36359705 http://dx.doi.org/10.3390/e24111611 |
work_keys_str_mv | AT shaharamit communicationefficientalgorithmsforboundingandapproximatingtheempiricalentropyindistributedsystems AT alfassiyuval communicationefficientalgorithmsforboundingandapproximatingtheempiricalentropyindistributedsystems AT kerendaniel communicationefficientalgorithmsforboundingandapproximatingtheempiricalentropyindistributedsystems |