Cargando…

ATLAS EventIndex general dataflow and monitoring infrastructure

The ATLAS EventIndex has been running in production since mid-2015, reliably collecting information worldwide about all produced events and storing them in a central Hadoop infrastructure at CERN. A subset of this information is copied to an Oracle relational database for fast dataset discovery, eve...

Descripción completa

Detalles Bibliográficos
Autores principales: Fernandez Casani, Alvaro, Barberis, Dario, Favareto, Andrea, Garcia Montoro, Carlos, Gonzalez de la Hoz, Santiago, Hrivnac, Julius, Prokoshin, Fedor, Salt, Jose, Sanchez, Javier, Toebbicke, Rainer, Yuan, Ruijun
Lenguaje:eng
Publicado: 2017
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/898/6/062010
http://cds.cern.ch/record/2243484
_version_ 1780953318166626304
author Fernandez Casani, Alvaro
Barberis, Dario
Favareto, Andrea
Garcia Montoro, Carlos
Gonzalez de la Hoz, Santiago
Hrivnac, Julius
Prokoshin, Fedor
Salt, Jose
Sanchez, Javier
Toebbicke, Rainer
Yuan, Ruijun
author_facet Fernandez Casani, Alvaro
Barberis, Dario
Favareto, Andrea
Garcia Montoro, Carlos
Gonzalez de la Hoz, Santiago
Hrivnac, Julius
Prokoshin, Fedor
Salt, Jose
Sanchez, Javier
Toebbicke, Rainer
Yuan, Ruijun
author_sort Fernandez Casani, Alvaro
collection CERN
description The ATLAS EventIndex has been running in production since mid-2015, reliably collecting information worldwide about all produced events and storing them in a central Hadoop infrastructure at CERN. A subset of this information is copied to an Oracle relational database for fast dataset discovery, event-picking, crosschecks with other ATLAS systems and checks for event duplication. The system design and its optimization is serving event picking from requests of a few events up to scales of tens of thousand of events, and in addition, data consistency checks are performed for large production campaigns. Detecting duplicate events with a scope of physics collections has recently arisen as an important use case. This paper describes the general architecture of the project and the data flow and operation issues, which are addressed by recent developments to improve the throughput of the overall system. In this direction, the data collection system is reducing the usage of the messaging infrastructure to overcome the performance shortcomings detected during production peaks; an object storage approach is instead used to convey the event index information, and messages to signal their location and status. Recent changes in the Producer/Consumer architecture are also presented in detail, as well as the monitoring infrastructure.
id cern-2243484
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2017
record_format invenio
spelling cern-22434842019-10-15T15:18:12Zdoi:10.1088/1742-6596/898/6/062010http://cds.cern.ch/record/2243484engFernandez Casani, AlvaroBarberis, DarioFavareto, AndreaGarcia Montoro, CarlosGonzalez de la Hoz, SantiagoHrivnac, JuliusProkoshin, FedorSalt, JoseSanchez, JavierToebbicke, RainerYuan, RuijunATLAS EventIndex general dataflow and monitoring infrastructureParticle Physics - ExperimentThe ATLAS EventIndex has been running in production since mid-2015, reliably collecting information worldwide about all produced events and storing them in a central Hadoop infrastructure at CERN. A subset of this information is copied to an Oracle relational database for fast dataset discovery, event-picking, crosschecks with other ATLAS systems and checks for event duplication. The system design and its optimization is serving event picking from requests of a few events up to scales of tens of thousand of events, and in addition, data consistency checks are performed for large production campaigns. Detecting duplicate events with a scope of physics collections has recently arisen as an important use case. This paper describes the general architecture of the project and the data flow and operation issues, which are addressed by recent developments to improve the throughput of the overall system. In this direction, the data collection system is reducing the usage of the messaging infrastructure to overcome the performance shortcomings detected during production peaks; an object storage approach is instead used to convey the event index information, and messages to signal their location and status. Recent changes in the Producer/Consumer architecture are also presented in detail, as well as the monitoring infrastructure.ATL-SOFT-PROC-2017-031oai:cds.cern.ch:22434842017-01-31
spellingShingle Particle Physics - Experiment
Fernandez Casani, Alvaro
Barberis, Dario
Favareto, Andrea
Garcia Montoro, Carlos
Gonzalez de la Hoz, Santiago
Hrivnac, Julius
Prokoshin, Fedor
Salt, Jose
Sanchez, Javier
Toebbicke, Rainer
Yuan, Ruijun
ATLAS EventIndex general dataflow and monitoring infrastructure
title ATLAS EventIndex general dataflow and monitoring infrastructure
title_full ATLAS EventIndex general dataflow and monitoring infrastructure
title_fullStr ATLAS EventIndex general dataflow and monitoring infrastructure
title_full_unstemmed ATLAS EventIndex general dataflow and monitoring infrastructure
title_short ATLAS EventIndex general dataflow and monitoring infrastructure
title_sort atlas eventindex general dataflow and monitoring infrastructure
topic Particle Physics - Experiment
url https://dx.doi.org/10.1088/1742-6596/898/6/062010
http://cds.cern.ch/record/2243484
work_keys_str_mv AT fernandezcasanialvaro atlaseventindexgeneraldataflowandmonitoringinfrastructure
AT barberisdario atlaseventindexgeneraldataflowandmonitoringinfrastructure
AT favaretoandrea atlaseventindexgeneraldataflowandmonitoringinfrastructure
AT garciamontorocarlos atlaseventindexgeneraldataflowandmonitoringinfrastructure
AT gonzalezdelahozsantiago atlaseventindexgeneraldataflowandmonitoringinfrastructure
AT hrivnacjulius atlaseventindexgeneraldataflowandmonitoringinfrastructure
AT prokoshinfedor atlaseventindexgeneraldataflowandmonitoringinfrastructure
AT saltjose atlaseventindexgeneraldataflowandmonitoringinfrastructure
AT sanchezjavier atlaseventindexgeneraldataflowandmonitoringinfrastructure
AT toebbickerainer atlaseventindexgeneraldataflowandmonitoringinfrastructure
AT yuanruijun atlaseventindexgeneraldataflowandmonitoringinfrastructure