Cargando…

Distributed Data Collection for the ATLAS EventIndex

The ATLAS EventIndex contains records of all events processed by ATLAS, in all processing stages. These records include the references to the files containing each event (the GUID of the file) and the internal “pointer” to each event in the file. This information is collected by all jobs that run at...

Descripción completa

Detalles Bibliográficos
Autores principales: Sánchez, Javier, Fernandez Casani, Alvaro, Gonzalez de la Hoz, Santiago
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/664/4/042046
http://cds.cern.ch/record/2016348
_version_ 1780946696972271616
author Sánchez, Javier
Fernandez Casani, Alvaro
Gonzalez de la Hoz, Santiago
author_facet Sánchez, Javier
Fernandez Casani, Alvaro
Gonzalez de la Hoz, Santiago
author_sort Sánchez, Javier
collection CERN
description The ATLAS EventIndex contains records of all events processed by ATLAS, in all processing stages. These records include the references to the files containing each event (the GUID of the file) and the internal “pointer” to each event in the file. This information is collected by all jobs that run at Tier-0 or on the Grid and process ATLAS events. Each job produces a snippet of information for each permanent output file. This information is packed and transferred to a central broker at CERN using an ActiveMQ messaging system, and then is unpacked, sorted and reformatted in order to be stored and catalogued into a central Hadoop server. This contribution describes in detail the Producer/Consumer architecture to convey this information from the running jobs through the messaging system to the Hadoop server.
id cern-2016348
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling cern-20163482022-08-10T12:54:40Zdoi:10.1088/1742-6596/664/4/042046http://cds.cern.ch/record/2016348engSánchez, JavierFernandez Casani, AlvaroGonzalez de la Hoz, SantiagoDistributed Data Collection for the ATLAS EventIndexParticle Physics - ExperimentThe ATLAS EventIndex contains records of all events processed by ATLAS, in all processing stages. These records include the references to the files containing each event (the GUID of the file) and the internal “pointer” to each event in the file. This information is collected by all jobs that run at Tier-0 or on the Grid and process ATLAS events. Each job produces a snippet of information for each permanent output file. This information is packed and transferred to a central broker at CERN using an ActiveMQ messaging system, and then is unpacked, sorted and reformatted in order to be stored and catalogued into a central Hadoop server. This contribution describes in detail the Producer/Consumer architecture to convey this information from the running jobs through the messaging system to the Hadoop server.ATL-SOFT-PROC-2015-031oai:cds.cern.ch:20163482015-05-14
spellingShingle Particle Physics - Experiment
Sánchez, Javier
Fernandez Casani, Alvaro
Gonzalez de la Hoz, Santiago
Distributed Data Collection for the ATLAS EventIndex
title Distributed Data Collection for the ATLAS EventIndex
title_full Distributed Data Collection for the ATLAS EventIndex
title_fullStr Distributed Data Collection for the ATLAS EventIndex
title_full_unstemmed Distributed Data Collection for the ATLAS EventIndex
title_short Distributed Data Collection for the ATLAS EventIndex
title_sort distributed data collection for the atlas eventindex
topic Particle Physics - Experiment
url https://dx.doi.org/10.1088/1742-6596/664/4/042046
http://cds.cern.ch/record/2016348
work_keys_str_mv AT sanchezjavier distributeddatacollectionfortheatlaseventindex
AT fernandezcasanialvaro distributeddatacollectionfortheatlaseventindex
AT gonzalezdelahozsantiago distributeddatacollectionfortheatlaseventindex