Cargando…

ROOT — A C++ framework for petabyte data storage, statistical analysis and visualization

ROOT is an object-oriented C++ framework conceived in the high-energy physics (HEP) community, designed for storing and analyzing petabytes of data in an efficient way. Any instance of a C++ class can be stored into a ROOT file in a machine-independent compressed binary format. In ROOT the TTree obj...

Descripción completa

Detalles Bibliográficos
Autores principales: Antcheva, I., Ballintijn, M., Bellenot, B., Biskup, M., Brun, R., Buncic, N., Canal, Ph., Casadei, D., Couet, O., Fine, V., Franco, L., Ganis, G., Gheata, A., Gonzalez Maline, D., Goto, M., Iwaszkiewicz, J., Kreshuk, A., Marcos Segura, D., Maunder, R., Moneta, L., Naumann, A., Offermann, E., Onuchin, V., Panacek, S., Rademakers, F., Russo, P., Tadel, M.
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:https://dx.doi.org/10.1016/j.cpc.2009.08.005
http://cds.cern.ch/record/1262045
_version_ 1780920001389133824
author Antcheva, I.
Ballintijn, M.
Bellenot, B.
Biskup, M.
Brun, R.
Buncic, N.
Canal, Ph.
Casadei, D.
Couet, O.
Fine, V.
Franco, L.
Ganis, G.
Gheata, A.
Gonzalez Maline, D.
Goto, M.
Iwaszkiewicz, J.
Kreshuk, A.
Marcos Segura, D.
Maunder, R.
Moneta, L.
Naumann, A.
Offermann, E.
Onuchin, V.
Panacek, S.
Rademakers, F.
Russo, P.
Tadel, M.
author_facet Antcheva, I.
Ballintijn, M.
Bellenot, B.
Biskup, M.
Brun, R.
Buncic, N.
Canal, Ph.
Casadei, D.
Couet, O.
Fine, V.
Franco, L.
Ganis, G.
Gheata, A.
Gonzalez Maline, D.
Goto, M.
Iwaszkiewicz, J.
Kreshuk, A.
Marcos Segura, D.
Maunder, R.
Moneta, L.
Naumann, A.
Offermann, E.
Onuchin, V.
Panacek, S.
Rademakers, F.
Russo, P.
Tadel, M.
author_sort Antcheva, I.
collection CERN
description ROOT is an object-oriented C++ framework conceived in the high-energy physics (HEP) community, designed for storing and analyzing petabytes of data in an efficient way. Any instance of a C++ class can be stored into a ROOT file in a machine-independent compressed binary format. In ROOT the TTree object container is optimized for statistical data analysis over very large data sets by using vertical data storage techniques. These containers can span a large number of files on local disks, the web, or a number of different shared file systems. In order to analyze this data, the user can chose out of a wide set of mathematical and statistical functions, including linear algebra classes, numerical algorithms such as integration and minimization, and various methods for performing regression analysis (fitting). In particular, the RooFit package allows the user to perform complex data modeling and fitting while the RooStats library provides abstractions and implementations for advanced statistical tools. Multivariate classification methods based on machine learning techniques are available via the TMVA package. A central piece in these analysis tools are the histogram classes which provide binning of one- and multi-dimensional data. Results can be saved in high-quality graphical formats like Postscript and PDF or in bitmap formats like JPG or GIF. The result can also be stored into ROOT macros that allow a full recreation and rework of the graphics. Users typically create their analysis macros step by step, making use of the interactive C++ interpreter CINT, while running over small data samples. Once the development is finished, they can run these macros at full compiled speed over large data sets, using onthe- fly compilation, or by creating a stand-alone batch program. Finally, if processing farms are available, the user can reduce the execution time of intrinsically parallel tasks — e.g. data mining in HEP — by using PROOF, which will take care of optimally distributing the work over the available resources in a transparent way.
id cern-1262045
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling cern-12620452023-01-31T08:19:39Zdoi:10.1016/j.cpc.2009.08.005http://cds.cern.ch/record/1262045engAntcheva, I.Ballintijn, M.Bellenot, B.Biskup, M.Brun, R.Buncic, N.Canal, Ph.Casadei, D.Couet, O.Fine, V.Franco, L.Ganis, G.Gheata, A.Gonzalez Maline, D.Goto, M.Iwaszkiewicz, J.Kreshuk, A.Marcos Segura, D.Maunder, R.Moneta, L.Naumann, A.Offermann, E.Onuchin, V.Panacek, S.Rademakers, F.Russo, P.Tadel, M.ROOT — A C++ framework for petabyte data storage, statistical analysis and visualizationComputing and ComputersOther Fields of PhysicsROOT is an object-oriented C++ framework conceived in the high-energy physics (HEP) community, designed for storing and analyzing petabytes of data in an efficient way. Any instance of a C++ class can be stored into a ROOT file in a machine-independent compressed binary format. In ROOT the TTree object container is optimized for statistical data analysis over very large data sets by using vertical data storage techniques. These containers can span a large number of files on local disks, the web, or a number of different shared file systems. In order to analyze this data, the user can chose out of a wide set of mathematical and statistical functions, including linear algebra classes, numerical algorithms such as integration and minimization, and various methods for performing regression analysis (fitting). In particular, the RooFit package allows the user to perform complex data modeling and fitting while the RooStats library provides abstractions and implementations for advanced statistical tools. Multivariate classification methods based on machine learning techniques are available via the TMVA package. A central piece in these analysis tools are the histogram classes which provide binning of one- and multi-dimensional data. Results can be saved in high-quality graphical formats like Postscript and PDF or in bitmap formats like JPG or GIF. The result can also be stored into ROOT macros that allow a full recreation and rework of the graphics. Users typically create their analysis macros step by step, making use of the interactive C++ interpreter CINT, while running over small data samples. Once the development is finished, they can run these macros at full compiled speed over large data sets, using onthe- fly compilation, or by creating a stand-alone batch program. Finally, if processing farms are available, the user can reduce the execution time of intrinsically parallel tasks — e.g. data mining in HEP — by using PROOF, which will take care of optimally distributing the work over the available resources in a transparent way.ROOT is an object-oriented C++ framework conceived in the high-energy physics (HEP) community, designed for storing and analyzing petabytes of data in an efficient way. Any instance of a C++ class can be stored into a ROOT file in a machine-indepe ndent compressed binary format. In ROOT the TTree object container is optimized for statistical data analysis over very large data sets by using vertical data storage techniques. These containers can span a large number of files on local disks, the web, o r a number of different shared file systems. In order to analyze this data, the user can chose out of a wide set of mathematical and statistical functions, including linear algebra classes, numerical algorithms such as integration and minimization, and va rious methods for performing regression analysis (fitting). In particular, the RooFit package allows the user to perform complex data modeling and fitting while the RooStats library provides abstractions and implementations for advanced statistical tools. Multivariate classification methods based on machine learning techniques are available via the TMVA package. A central piece in these analysis tools are the histogram classes which provide binning of one- and multi-dimensional data. Results can be saved in high-quality graphical formats like Postscript and PDF or in bitmap formats like JPG or GIF. The result can also be stored into ROOT macros that allow a full recreation and rework of the graphics. Users typically create their analysis macros step by st ep, making use of the interactive C++ interpreter CINT, while running over small data samples. Once the development is finished, they can run these macros at full compiled speed over large data sets, using on-the-fly compilation, or by creating a stand-al one batch program. Finally, if processing farms are available, the user can reduce the execution time of intrinsically parallel tasks - e.g. data mining in HEP - by using PROOF, which will take care of optimally distributing the work over the available re sources in a transparent way. Program summary: Program title: ROOT Catalogue identifier: AEFA_v1_0 Program summary URL:http://cpc.cs.qub.ac.uk/summaries/AEFA_v1_0.html Program obtainable from: CPC Program Library, Queen's University, Belfast, N. Ireland L icensing provisions: LGPL No. of lines in distributed program, including test data, etc.: 3@?044@?581 No. of bytes in distributed program, including test data, etc.: 36@?325@?133 Distribution format: tar.gz Programming language: C++ Computer: Intel i386, Intel x86-64, Motorola PPC, Sun Sparc, HP PA-RISC Operating system: GNU/Linux, Windows XP/Vista, Mac OS X, FreeBSD, OpenBSD, Solaris, HP-UX, AIX Has the code been vectorized or parallelized?: Yes RAM:>55 Mbytes Classification: 4, 9, 11.9, 14 Nature of pro blem: Storage, analysis and visualization of scientific data Solution method: Object store, wide range of analysis algorithms and visualization methods Additional comments: For an up-to-date author list see: http://root.cern.ch/drupal/content/root-develop ment-team and http://root.cern.ch/drupal/content/former-root-developers Running time: Depending on the data size and complexity of analysis algorithms References:[1]http://root.cern.ch.ROOT is an object-oriented C++ framework conceived in the high-energy physics (HEP) community, designed for storing and analyzing petabytes of data in an efficient way. Any instance of a C++ class can be stored into a ROOT file in a machine-independent compressed binary format. In ROOT the TTree object container is optimized for statistical data analysis over very large data sets by using vertical data storage techniques. These containers can span a large number of files on local disks, the web, or a number of different shared file systems. In order to analyze this data, the user can chose out of a wide set of mathematical and statistical functions, including linear algebra classes, numerical algorithms such as integration and minimization, and various methods for performing regression analysis (fitting). In particular, the RooFit package allows the user to perform complex data modeling and fitting while the RooStats library provides abstractions and implementations for advanced statistical tools. Multivariate classification methods based on machine learning techniques are available via the TMVA package. A central piece in these analysis tools are the histogram classes which provide binning of one- and multi-dimensional data. Results can be saved in high-quality graphical formats like Postscript and PDF or in bitmap formats like JPG or GIF. The result can also be stored into ROOT macros that allow a full recreation and rework of the graphics. Users typically create their analysis macros step by step, making use of the interactive C++ interpreter CINT, while running over small data samples. Once the development is finished, they can run these macros at full compiled speed over large data sets, using on-the-fly compilation, or by creating a stand-alone batch program. Finally, if processing farms are available, the user can reduce the execution time of intrinsically parallel tasks — e.g. data mining in HEP — by using PROOF, which will take care of optimally distributing the work over the available resources in a transparent way.ROOT is an object-oriented C++ framework conceived in the high-energy physics (HEP) community, designed for storing and analyzing petabytes of data in an efficient way. Any instance of a C++ class can be stored into a ROOT file in a machine-independent compressed binary format. In ROOT the TTree object container is optimized for statistical data analysis over very large data sets by using vertical data storage techniques. These containers can span a large number of files on local disks, the web, or a number of different shared file systems. In order to analyze this data, the user can chose out of a wide set of mathematical and statistical functions, including linear algebra classes, numerical algorithms such as integration and minimization, and various methods for performing regression analysis (fitting). In particular, ROOT offers packages for complex data modeling and fitting, as well as multivariate classification based on machine learning techniques. A central piece in these analysis tools are the histogram classes which provide binning of one- and multi-dimensional data. Results can be saved in high-quality graphical formats like Postscript and PDF or in bitmap formats like JPG or GIF. The result can also be stored into ROOT macros that allow a full recreation and rework of the graphics. Users typically create their analysis macros step by step, making use of the interactive C++ interpreter CINT, while running over small data samples. Once the development is finished, they can run these macros at full compiled speed over large data sets, using on-the-fly compilation, or by creating a stand-alone batch program. Finally, if processing farms are available, the user can reduce the execution time of intrinsically parallel tasks - e.g. data mining in HEP - by using PROOF, which will take care of optimally distributing the work over the available resources in a transparent way.arXiv:1508.07749FERMILAB-PUB-09-661-CDoai:cds.cern.ch:12620452015-08-31
spellingShingle Computing and Computers
Other Fields of Physics
Antcheva, I.
Ballintijn, M.
Bellenot, B.
Biskup, M.
Brun, R.
Buncic, N.
Canal, Ph.
Casadei, D.
Couet, O.
Fine, V.
Franco, L.
Ganis, G.
Gheata, A.
Gonzalez Maline, D.
Goto, M.
Iwaszkiewicz, J.
Kreshuk, A.
Marcos Segura, D.
Maunder, R.
Moneta, L.
Naumann, A.
Offermann, E.
Onuchin, V.
Panacek, S.
Rademakers, F.
Russo, P.
Tadel, M.
ROOT — A C++ framework for petabyte data storage, statistical analysis and visualization
title ROOT — A C++ framework for petabyte data storage, statistical analysis and visualization
title_full ROOT — A C++ framework for petabyte data storage, statistical analysis and visualization
title_fullStr ROOT — A C++ framework for petabyte data storage, statistical analysis and visualization
title_full_unstemmed ROOT — A C++ framework for petabyte data storage, statistical analysis and visualization
title_short ROOT — A C++ framework for petabyte data storage, statistical analysis and visualization
title_sort root — a c++ framework for petabyte data storage, statistical analysis and visualization
topic Computing and Computers
Other Fields of Physics
url https://dx.doi.org/10.1016/j.cpc.2009.08.005
http://cds.cern.ch/record/1262045
work_keys_str_mv AT antchevai rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT ballintijnm rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT bellenotb rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT biskupm rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT brunr rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT buncicn rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT canalph rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT casadeid rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT coueto rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT finev rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT francol rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT ganisg rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT gheataa rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT gonzalezmalined rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT gotom rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT iwaszkiewiczj rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT kreshuka rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT marcossegurad rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT maunderr rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT monetal rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT naumanna rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT offermanne rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT onuchinv rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT panaceks rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT rademakersf rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT russop rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization
AT tadelm rootacframeworkforpetabytedatastoragestatisticalanalysisandvisualization