Cargando…

The ATLAS EventIndex and its evolution towards Run 3

The ATLAS experiment produced so far hundreds of petabytes of data and expects to have one order of magnitude more in the future. This data are spread among hundreds of computing Grid sites around the world. The EventIndex is the complete catalogue of all ATLAS events, real and simulated, keeping th...

Descripción completa

Detalles Bibliográficos
Autores principales: Villaplana Perez, Miguel, Alexandrov, Evgeny, Aleksandrov, Igor, Baranowski, Zbigniew, Barberis, Dario, Dimitrov, Gancho, Fernandez Casani, Alvaro, Gallas, Elizabeth, Garcia Montoro, Carlos, Gonzalez de la Hoz, Santiago, Hrivnac, Julius, Iakovlev, Alexander, Kazymov, Andrei, Mineev, Mikhail, Prokoshin, Fedor, Rybkin, Grigori, Sánchez, Javier, Salt, José, Vasileva, Petya Tsvetanova
Lenguaje:eng
Publicado: 2019
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/1525/1/012056
http://cds.cern.ch/record/2675026
Descripción
Sumario:The ATLAS experiment produced so far hundreds of petabytes of data and expects to have one order of magnitude more in the future. This data are spread among hundreds of computing Grid sites around the world. The EventIndex is the complete catalogue of all ATLAS events, real and simulated, keeping the references to all permanent files that contain a given event in any processing stage. It provides the means to select and access event data in the ATLAS distributed storage system, and provides support for completeness and consistency checks and trigger and offline selection overlap studies. The EventIndex employs various data handling technologies like Hadoop and Oracle databases, and is integrated with other systems of the ATLAS distributed computing infrastructure, including those for data, metadata, and production management. The project is in operation since the start of LHC Run 2 in 2015, and is in permanent development in order to fit the production and analysis demands and follow technology evolutions. The main data store in Hadoop, based on MapFiles and HBase, has worked well during Run 2 but new solutions are explored for the future. This paper reports on the current system performance and on the studies of a new data storage prototype that can carry the EventIndex through Run 3.