Cargando…

Federating LHCb datasets using the DIRAC File catalog

In the distributed computing model of LHCb the File Catalog (FC) is a central component that keeps track of each file and replica stored on the Grid. It is federating the LHCb data files in a logical namespace used by all LHCb applications. As a replica catalog, it is used for brokering jobs to site...

Descripción completa

Detalles Bibliográficos
Autores principales: Haen, Christophe, Charpentier, Philippe, Frank, Markus, Tsaregorodtsev, Andrei
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:http://cds.cern.ch/record/2011551
_version_ 1780946559464112128
author Haen, Christophe
Charpentier, Philippe
Frank, Markus
Tsaregorodtsev, Andrei
author_facet Haen, Christophe
Charpentier, Philippe
Frank, Markus
Tsaregorodtsev, Andrei
author_sort Haen, Christophe
collection CERN
description In the distributed computing model of LHCb the File Catalog (FC) is a central component that keeps track of each file and replica stored on the Grid. It is federating the LHCb data files in a logical namespace used by all LHCb applications. As a replica catalog, it is used for brokering jobs to sites where their input data is meant to be present, but also by jobs for finding alternative replicas if necessary. The LCG File Catalog (LFC) used originally by LHCb and other experiments is now being retired and needs to be replaced. The DIRAC File Catalog (DFC) was developed within the framework of the DIRAC Project and presented during CHEP 2012. From the technical point of view, the code powering the DFC follows an Aspect oriented programming (AOP): each type of entity that is manipulated by the DFC (Users, Files, Replicas, etc) is treated as a separate 'concern' in the AOP terminology. Hence, the database schema can also be adapted to the needs of a Virtual Organization. LHCb opted for a highly tuned MySQL database, with optimized requests and stored procedures. This paper will present the improvements brought to the DFC presented at CHEP 2012, its performance with respect to the LFC, as well as the migration procedure used to migrate the LHCb data from the LFC to the DFC. Finally it will show how a combination of the DFC and the LHCb framework Gaudi allow LHCb to build a data federation at low cost. Keywords : DIRAC, LHCbDIRAC, DFC, FileCatalog, Gaudi, DataFederation
id cern-2011551
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling cern-20115512019-09-30T06:29:59Zhttp://cds.cern.ch/record/2011551engHaen, ChristopheCharpentier, PhilippeFrank, MarkusTsaregorodtsev, AndreiFederating LHCb datasets using the DIRAC File catalogTalkIn the distributed computing model of LHCb the File Catalog (FC) is a central component that keeps track of each file and replica stored on the Grid. It is federating the LHCb data files in a logical namespace used by all LHCb applications. As a replica catalog, it is used for brokering jobs to sites where their input data is meant to be present, but also by jobs for finding alternative replicas if necessary. The LCG File Catalog (LFC) used originally by LHCb and other experiments is now being retired and needs to be replaced. The DIRAC File Catalog (DFC) was developed within the framework of the DIRAC Project and presented during CHEP 2012. From the technical point of view, the code powering the DFC follows an Aspect oriented programming (AOP): each type of entity that is manipulated by the DFC (Users, Files, Replicas, etc) is treated as a separate 'concern' in the AOP terminology. Hence, the database schema can also be adapted to the needs of a Virtual Organization. LHCb opted for a highly tuned MySQL database, with optimized requests and stored procedures. This paper will present the improvements brought to the DFC presented at CHEP 2012, its performance with respect to the LFC, as well as the migration procedure used to migrate the LHCb data from the LFC to the DFC. Finally it will show how a combination of the DFC and the LHCb framework Gaudi allow LHCb to build a data federation at low cost. Keywords : DIRAC, LHCbDIRAC, DFC, FileCatalog, Gaudi, DataFederationLHCb-TALK-2015-062oai:cds.cern.ch:20115512015
spellingShingle Talk
Haen, Christophe
Charpentier, Philippe
Frank, Markus
Tsaregorodtsev, Andrei
Federating LHCb datasets using the DIRAC File catalog
title Federating LHCb datasets using the DIRAC File catalog
title_full Federating LHCb datasets using the DIRAC File catalog
title_fullStr Federating LHCb datasets using the DIRAC File catalog
title_full_unstemmed Federating LHCb datasets using the DIRAC File catalog
title_short Federating LHCb datasets using the DIRAC File catalog
title_sort federating lhcb datasets using the dirac file catalog
topic Talk
url http://cds.cern.ch/record/2011551
work_keys_str_mv AT haenchristophe federatinglhcbdatasetsusingthediracfilecatalog
AT charpentierphilippe federatinglhcbdatasetsusingthediracfilecatalog
AT frankmarkus federatinglhcbdatasetsusingthediracfilecatalog
AT tsaregorodtsevandrei federatinglhcbdatasetsusingthediracfilecatalog