Cargando…

Extreme Compression for Large Scale Data Store

For the last 5 years Accelogic pioneered and perfected a radically new theory of numerical computing codenamed “Compressive Computing”, which has an extremely profound impact on real-world computer science [1]. At the core of this new theory is the discovery of one of its fundamental theorems which...

Descripción completa

Detalles Bibliográficos
Autores principales: Lauret, Jérôme, Gonzalez, Juan, Van Buren, Gene, Nuñez, Rafael, Canal, Philippe, Naumann, Axel
Lenguaje:eng
Publicado: 2020
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/202024506024
http://cds.cern.ch/record/2752305
_version_ 1780969335129374720
author Lauret, Jérôme
Gonzalez, Juan
Van Buren, Gene
Nuñez, Rafael
Canal, Philippe
Naumann, Axel
author_facet Lauret, Jérôme
Gonzalez, Juan
Van Buren, Gene
Nuñez, Rafael
Canal, Philippe
Naumann, Axel
author_sort Lauret, Jérôme
collection CERN
description For the last 5 years Accelogic pioneered and perfected a radically new theory of numerical computing codenamed “Compressive Computing”, which has an extremely profound impact on real-world computer science [1]. At the core of this new theory is the discovery of one of its fundamental theorems which states that, under very general conditions, the vast majority (typically between 70% and 80%) of the bits used in modern large-scale numerical computations are absolutely irrelevant for the accuracy of the end result. This theory of Compressive Computing provides mechanisms able to identify (with high intelligence and surgical accuracy) the number of bits (i.e., the precision) that can be used to represent numbers without affecting the substance of the end results, as they are computed and vary in real time. The bottom line outcome would be to provide a state-of-the-art compression algorithm that surpasses those currently available in the ROOT framework, with the purpose of enabling substantial economic and operational gains (including speedup) for High Energy and Nuclear Physics data storage/analysis. In our initial studies, a factor of nearly x4 (3.9) compression was achieved with RHIC/STAR data where ROOT compression managed only x1.4.In this contribution, we will present our concepts of “functionally lossless compression”, have a glance at examples and achievements in other communities, present the results and outcome of our current, ongoing R&D;, as well as present a high-level view of our plan to move forward with a ROOT implementation that would deliver a basic solution readily integrated into HENP applications. As a collaboration of experimental scientists, private industry, and the ROOT Team, our aim is to capitalize on the substantial success delivered by the initial effort and produce a robust technology properly packaged as an open-source tool that could be used by virtually every experiment around the world as means for improving data management and accessibility.
id oai-inspirehep.net-1832221
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2020
record_format invenio
spelling oai-inspirehep.net-18322212022-08-17T13:58:23Zdoi:10.1051/epjconf/202024506024http://cds.cern.ch/record/2752305engLauret, JérômeGonzalez, JuanVan Buren, GeneNuñez, RafaelCanal, PhilippeNaumann, AxelExtreme Compression for Large Scale Data StoreComputing and ComputersParticle Physics - ExperimentFor the last 5 years Accelogic pioneered and perfected a radically new theory of numerical computing codenamed “Compressive Computing”, which has an extremely profound impact on real-world computer science [1]. At the core of this new theory is the discovery of one of its fundamental theorems which states that, under very general conditions, the vast majority (typically between 70% and 80%) of the bits used in modern large-scale numerical computations are absolutely irrelevant for the accuracy of the end result. This theory of Compressive Computing provides mechanisms able to identify (with high intelligence and surgical accuracy) the number of bits (i.e., the precision) that can be used to represent numbers without affecting the substance of the end results, as they are computed and vary in real time. The bottom line outcome would be to provide a state-of-the-art compression algorithm that surpasses those currently available in the ROOT framework, with the purpose of enabling substantial economic and operational gains (including speedup) for High Energy and Nuclear Physics data storage/analysis. In our initial studies, a factor of nearly x4 (3.9) compression was achieved with RHIC/STAR data where ROOT compression managed only x1.4.In this contribution, we will present our concepts of “functionally lossless compression”, have a glance at examples and achievements in other communities, present the results and outcome of our current, ongoing R&D;, as well as present a high-level view of our plan to move forward with a ROOT implementation that would deliver a basic solution readily integrated into HENP applications. As a collaboration of experimental scientists, private industry, and the ROOT Team, our aim is to capitalize on the substantial success delivered by the initial effort and produce a robust technology properly packaged as an open-source tool that could be used by virtually every experiment around the world as means for improving data management and accessibility.FERMILAB-CONF-20-634-SCDoai:inspirehep.net:18322212020
spellingShingle Computing and Computers
Particle Physics - Experiment
Lauret, Jérôme
Gonzalez, Juan
Van Buren, Gene
Nuñez, Rafael
Canal, Philippe
Naumann, Axel
Extreme Compression for Large Scale Data Store
title Extreme Compression for Large Scale Data Store
title_full Extreme Compression for Large Scale Data Store
title_fullStr Extreme Compression for Large Scale Data Store
title_full_unstemmed Extreme Compression for Large Scale Data Store
title_short Extreme Compression for Large Scale Data Store
title_sort extreme compression for large scale data store
topic Computing and Computers
Particle Physics - Experiment
url https://dx.doi.org/10.1051/epjconf/202024506024
http://cds.cern.ch/record/2752305
work_keys_str_mv AT lauretjerome extremecompressionforlargescaledatastore
AT gonzalezjuan extremecompressionforlargescaledatastore
AT vanburengene extremecompressionforlargescaledatastore
AT nunezrafael extremecompressionforlargescaledatastore
AT canalphilippe extremecompressionforlargescaledatastore
AT naumannaxel extremecompressionforlargescaledatastore