Cargando…

Popularity framework to process dataset tracers and its application on dynamic replica reduction in the ATLAS experiment

The ATLAS experiment's data management system is constantly tracing file movement operations that occur on the Worldwide LHC Computing Grid (WLCG). Due to the large scale of the WLCG, direct statistical analysis of the traces is impossible in real-time. Factors that contribute to the scalabilit...

Descripción completa

Detalles Bibliográficos
Autores principales: Molfetas, A, Megino, F, Tykhonov, A, Garonne, V, Campana, S, Lassnig, M, Barisits, M, Dimitrov, G, Viegas, F
Lenguaje:eng
Publicado: 2011
Materias:
Acceso en línea:http://cds.cern.ch/record/1359245
_version_ 1780922611347226624
author Molfetas, A
Megino, F
Tykhonov, A
Garonne, V
Campana, S
Lassnig, M
Barisits, M
Dimitrov, G
Viegas, F
author_facet Molfetas, A
Megino, F
Tykhonov, A
Garonne, V
Campana, S
Lassnig, M
Barisits, M
Dimitrov, G
Viegas, F
author_sort Molfetas, A
collection CERN
description The ATLAS experiment's data management system is constantly tracing file movement operations that occur on the Worldwide LHC Computing Grid (WLCG). Due to the large scale of the WLCG, direct statistical analysis of the traces is impossible in real-time. Factors that contribute to the scalability problems include the capability for users to initiatiate on-demand queries, high dimensionality of tracer entries combined with very low cardinality parameters, the large size of the namespace as well as rapid rate of file transactions occuring on the Grid. These scalability issues are alleviated through the adoption of an incremental model that aggregates data for all combinations occurring in selected tracer fields on a daily basis. Using this model it is possible to query on-demand relevant statistics about system usage. We present an implementation of this popularity model in the experiment's distributed data management system, DQ2, and describe a direct application example of the popularity framework, an automated cleaning system, which uses the statistics to dynamically detect and reduce unpopular replicas from grid sites. This paper describes the architecture employed by the cleaning system and reports on the results collected from a prototype during the first months of the ATLAS detector operation.
id cern-1359245
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2011
record_format invenio
spelling cern-13592452019-09-30T06:29:59Zhttp://cds.cern.ch/record/1359245engMolfetas, AMegino, FTykhonov, AGaronne, VCampana, SLassnig, MBarisits, MDimitrov, GViegas, FPopularity framework to process dataset tracers and its application on dynamic replica reduction in the ATLAS experimentDetectors and Experimental TechniquesThe ATLAS experiment's data management system is constantly tracing file movement operations that occur on the Worldwide LHC Computing Grid (WLCG). Due to the large scale of the WLCG, direct statistical analysis of the traces is impossible in real-time. Factors that contribute to the scalability problems include the capability for users to initiatiate on-demand queries, high dimensionality of tracer entries combined with very low cardinality parameters, the large size of the namespace as well as rapid rate of file transactions occuring on the Grid. These scalability issues are alleviated through the adoption of an incremental model that aggregates data for all combinations occurring in selected tracer fields on a daily basis. Using this model it is possible to query on-demand relevant statistics about system usage. We present an implementation of this popularity model in the experiment's distributed data management system, DQ2, and describe a direct application example of the popularity framework, an automated cleaning system, which uses the statistics to dynamically detect and reduce unpopular replicas from grid sites. This paper describes the architecture employed by the cleaning system and reports on the results collected from a prototype during the first months of the ATLAS detector operation.ATL-SOFT-PROC-2011-041oai:cds.cern.ch:13592452011-06-16
spellingShingle Detectors and Experimental Techniques
Molfetas, A
Megino, F
Tykhonov, A
Garonne, V
Campana, S
Lassnig, M
Barisits, M
Dimitrov, G
Viegas, F
Popularity framework to process dataset tracers and its application on dynamic replica reduction in the ATLAS experiment
title Popularity framework to process dataset tracers and its application on dynamic replica reduction in the ATLAS experiment
title_full Popularity framework to process dataset tracers and its application on dynamic replica reduction in the ATLAS experiment
title_fullStr Popularity framework to process dataset tracers and its application on dynamic replica reduction in the ATLAS experiment
title_full_unstemmed Popularity framework to process dataset tracers and its application on dynamic replica reduction in the ATLAS experiment
title_short Popularity framework to process dataset tracers and its application on dynamic replica reduction in the ATLAS experiment
title_sort popularity framework to process dataset tracers and its application on dynamic replica reduction in the atlas experiment
topic Detectors and Experimental Techniques
url http://cds.cern.ch/record/1359245
work_keys_str_mv AT molfetasa popularityframeworktoprocessdatasettracersanditsapplicationondynamicreplicareductionintheatlasexperiment
AT meginof popularityframeworktoprocessdatasettracersanditsapplicationondynamicreplicareductionintheatlasexperiment
AT tykhonova popularityframeworktoprocessdatasettracersanditsapplicationondynamicreplicareductionintheatlasexperiment
AT garonnev popularityframeworktoprocessdatasettracersanditsapplicationondynamicreplicareductionintheatlasexperiment
AT campanas popularityframeworktoprocessdatasettracersanditsapplicationondynamicreplicareductionintheatlasexperiment
AT lassnigm popularityframeworktoprocessdatasettracersanditsapplicationondynamicreplicareductionintheatlasexperiment
AT barisitsm popularityframeworktoprocessdatasettracersanditsapplicationondynamicreplicareductionintheatlasexperiment
AT dimitrovg popularityframeworktoprocessdatasettracersanditsapplicationondynamicreplicareductionintheatlasexperiment
AT viegasf popularityframeworktoprocessdatasettracersanditsapplicationondynamicreplicareductionintheatlasexperiment