Cargando…

Technologies for Large Data Management in Scientific Computing

In recent years, intense usage of computing has been the main strategy of investigations in several scientific research projects. The progress in computing technology has opened unprecedented opportunities for systematic collection of experimental data and the associated analysis that were considere...

Descripción completa

Detalles Bibliográficos
Autor principal: Pace, A
Lenguaje:eng
Publicado: 2013
Materias:
Acceso en línea:https://dx.doi.org/10.1142/S0129183114300012
http://cds.cern.ch/record/1630387
_version_ 1780934197378023424
author Pace, A
author_facet Pace, A
author_sort Pace, A
collection CERN
description In recent years, intense usage of computing has been the main strategy of investigations in several scientific research projects. The progress in computing technology has opened unprecedented opportunities for systematic collection of experimental data and the associated analysis that were considered impossible only few years ago. This paper focusses on the strategies in use: it reviews the various components that are necessary for an effective solution that ensures the storage, the long term preservation, and the worldwide distribution of large quantities of data that are necessary in a large scientific research project. The paper also mentions several examples of data management solutions used in High Energy Physics for the CERN Large Hadron Collider (LHC) experiments in Geneva, Switzerland which generate more than 30,000 terabytes of data every year that need to be preserved, analyzed, and made available to a community of several tenth of thousands scientists worldwide.
id cern-1630387
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2013
record_format invenio
spelling cern-16303872022-08-10T20:54:59Zdoi:10.1142/S0129183114300012http://cds.cern.ch/record/1630387engPace, ATechnologies for Large Data Management in Scientific ComputingComputing and ComputersIn recent years, intense usage of computing has been the main strategy of investigations in several scientific research projects. The progress in computing technology has opened unprecedented opportunities for systematic collection of experimental data and the associated analysis that were considered impossible only few years ago. This paper focusses on the strategies in use: it reviews the various components that are necessary for an effective solution that ensures the storage, the long term preservation, and the worldwide distribution of large quantities of data that are necessary in a large scientific research project. The paper also mentions several examples of data management solutions used in High Energy Physics for the CERN Large Hadron Collider (LHC) experiments in Geneva, Switzerland which generate more than 30,000 terabytes of data every year that need to be preserved, analyzed, and made available to a community of several tenth of thousands scientists worldwide.CERN-IT-2013-005oai:cds.cern.ch:16303872013-09-30
spellingShingle Computing and Computers
Pace, A
Technologies for Large Data Management in Scientific Computing
title Technologies for Large Data Management in Scientific Computing
title_full Technologies for Large Data Management in Scientific Computing
title_fullStr Technologies for Large Data Management in Scientific Computing
title_full_unstemmed Technologies for Large Data Management in Scientific Computing
title_short Technologies for Large Data Management in Scientific Computing
title_sort technologies for large data management in scientific computing
topic Computing and Computers
url https://dx.doi.org/10.1142/S0129183114300012
http://cds.cern.ch/record/1630387
work_keys_str_mv AT pacea technologiesforlargedatamanagementinscientificcomputing