Cargando…

Data management in EGEE

Data management is one of the cornerstones in the distributed production computing environment that the EGEE project aims to provide for a e-Science infrastructure. We have designed and implemented a set of services and client components, addressing the diverse requirements of all user communities....

Descripción completa

Detalles Bibliográficos
Autores principales: Frohner, A, Baud, J -P, Garcia Rioja, R M, Grosdidier, G, Mollon, R, Smith, D, Tedesco, P
Lenguaje:eng
Publicado: 2010
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/219/6/062012
http://cds.cern.ch/record/1270565
_version_ 1780920202168369152
author Frohner, A
Baud, J -P
Garcia Rioja, R M
Grosdidier, G
Mollon, R
Smith, D
Tedesco, P
author_facet Frohner, A
Baud, J -P
Garcia Rioja, R M
Grosdidier, G
Mollon, R
Smith, D
Tedesco, P
author_sort Frohner, A
collection CERN
description Data management is one of the cornerstones in the distributed production computing environment that the EGEE project aims to provide for a e-Science infrastructure. We have designed and implemented a set of services and client components, addressing the diverse requirements of all user communities. LHC experiments as main users will generate and distribute approximately 15 PB of data per year worldwide using this infrastructure. Another key user community, biomedical projects, have strict security requirements with less emphasis on the volume of data. We maintain three service groups for grid data management: The Disk Pool Manager (DPM) Storage Element (with more than 100 instances deployed world-wide), the LCG File Catalogue (LFC) and the File Transfer Service (FTS) which sustains an aggregated transfer rate of 1.5GB/sec. They are complemented by individual client components and also tools which help coordinating more complex uses cases with multiple services (GFAL-client, lcg util, eds-cli). In this paper we show how these services, keeping clean and standard interfaces among each other, can work together to cover the data flow and how they can be used as individual components to cover diverse requirements. We will also describe areas that we consider for further improvements, both for performance and functionality.
id cern-1270565
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2010
record_format invenio
spelling cern-12705652022-08-17T13:24:58Zdoi:10.1088/1742-6596/219/6/062012http://cds.cern.ch/record/1270565engFrohner, ABaud, J -PGarcia Rioja, R MGrosdidier, GMollon, RSmith, DTedesco, PData management in EGEEComputing and ComputersData management is one of the cornerstones in the distributed production computing environment that the EGEE project aims to provide for a e-Science infrastructure. We have designed and implemented a set of services and client components, addressing the diverse requirements of all user communities. LHC experiments as main users will generate and distribute approximately 15 PB of data per year worldwide using this infrastructure. Another key user community, biomedical projects, have strict security requirements with less emphasis on the volume of data. We maintain three service groups for grid data management: The Disk Pool Manager (DPM) Storage Element (with more than 100 instances deployed world-wide), the LCG File Catalogue (LFC) and the File Transfer Service (FTS) which sustains an aggregated transfer rate of 1.5GB/sec. They are complemented by individual client components and also tools which help coordinating more complex uses cases with multiple services (GFAL-client, lcg util, eds-cli). In this paper we show how these services, keeping clean and standard interfaces among each other, can work together to cover the data flow and how they can be used as individual components to cover diverse requirements. We will also describe areas that we consider for further improvements, both for performance and functionality.oai:cds.cern.ch:12705652010
spellingShingle Computing and Computers
Frohner, A
Baud, J -P
Garcia Rioja, R M
Grosdidier, G
Mollon, R
Smith, D
Tedesco, P
Data management in EGEE
title Data management in EGEE
title_full Data management in EGEE
title_fullStr Data management in EGEE
title_full_unstemmed Data management in EGEE
title_short Data management in EGEE
title_sort data management in egee
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/219/6/062012
http://cds.cern.ch/record/1270565
work_keys_str_mv AT frohnera datamanagementinegee
AT baudjp datamanagementinegee
AT garciariojarm datamanagementinegee
AT grosdidierg datamanagementinegee
AT mollonr datamanagementinegee
AT smithd datamanagementinegee
AT tedescop datamanagementinegee