Cargando…

ATLAS Data Management Accounting with Hadoop Pig and HBase

The ATLAS Distributed Data Management system requires accounting of its contents at the metadata layer. This presents a hard problem due to the large scale of the system, the high dimensionality of attributes, and the high rate of concurrent modifications of data. The system must efficiently account...

Descripción completa

Detalles Bibliográficos
Autores principales: Lassnig, M, Garonne, V, Dimitrov, G, Canali, L
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/396/5/052044
http://cds.cern.ch/record/1456853
_version_ 1780925101649166336
author Lassnig, M
Garonne, V
Dimitrov, G
Canali, L
author_facet Lassnig, M
Garonne, V
Dimitrov, G
Canali, L
author_sort Lassnig, M
collection CERN
description The ATLAS Distributed Data Management system requires accounting of its contents at the metadata layer. This presents a hard problem due to the large scale of the system, the high dimensionality of attributes, and the high rate of concurrent modifications of data. The system must efficiently account more than 90PB of disk and tape that store upwards of 500 million files across 100 sites globally. In this work a generic accounting system is presented, which is able to scale to the requirements of ATLAS. The design and architecture is presented, and the implementation is discussed. An emphasis is placed on the design choices such that the underlying data models are generally applicable to different kinds of accounting, reporting and monitoring.
id cern-1456853
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling cern-14568532019-09-30T06:29:59Zdoi:10.1088/1742-6596/396/5/052044http://cds.cern.ch/record/1456853engLassnig, MGaronne, VDimitrov, GCanali, LATLAS Data Management Accounting with Hadoop Pig and HBaseDetectors and Experimental TechniquesThe ATLAS Distributed Data Management system requires accounting of its contents at the metadata layer. This presents a hard problem due to the large scale of the system, the high dimensionality of attributes, and the high rate of concurrent modifications of data. The system must efficiently account more than 90PB of disk and tape that store upwards of 500 million files across 100 sites globally. In this work a generic accounting system is presented, which is able to scale to the requirements of ATLAS. The design and architecture is presented, and the implementation is discussed. An emphasis is placed on the design choices such that the underlying data models are generally applicable to different kinds of accounting, reporting and monitoring.ATL-SOFT-PROC-2012-059oai:cds.cern.ch:14568532012-06-20
spellingShingle Detectors and Experimental Techniques
Lassnig, M
Garonne, V
Dimitrov, G
Canali, L
ATLAS Data Management Accounting with Hadoop Pig and HBase
title ATLAS Data Management Accounting with Hadoop Pig and HBase
title_full ATLAS Data Management Accounting with Hadoop Pig and HBase
title_fullStr ATLAS Data Management Accounting with Hadoop Pig and HBase
title_full_unstemmed ATLAS Data Management Accounting with Hadoop Pig and HBase
title_short ATLAS Data Management Accounting with Hadoop Pig and HBase
title_sort atlas data management accounting with hadoop pig and hbase
topic Detectors and Experimental Techniques
url https://dx.doi.org/10.1088/1742-6596/396/5/052044
http://cds.cern.ch/record/1456853
work_keys_str_mv AT lassnigm atlasdatamanagementaccountingwithhadooppigandhbase
AT garonnev atlasdatamanagementaccountingwithhadooppigandhbase
AT dimitrovg atlasdatamanagementaccountingwithhadooppigandhbase
AT canalil atlasdatamanagementaccountingwithhadooppigandhbase