Cargando…

Data grids: a new computational infrastructure for data-intensive science

Twenty-first-century scientific and engineering enterprises are increasingly characterized by their geographic dispersion and their reliance on large data archives. These characteristics bring with them unique challenges. First, the increasing size and complexity of modern data collections require s...

Descripción completa

Detalles Bibliográficos
Autor principal: Avery, P
Lenguaje:eng
Publicado: 2002
Materias:
Acceso en línea:https://dx.doi.org/10.1098/rsta.2002.0988
http://cds.cern.ch/record/590855
Descripción
Sumario:Twenty-first-century scientific and engineering enterprises are increasingly characterized by their geographic dispersion and their reliance on large data archives. These characteristics bring with them unique challenges. First, the increasing size and complexity of modern data collections require significant investments in information technologies to store, retrieve and analyse them. Second, the increased distribution of people and resources in these projects has made resource sharing and collaboration across significant geographic and organizational boundaries critical to their success. In this paper I explore how computing infrastructures based on data grids offer data-intensive enterprises a comprehensive, scalable framework for collaboration and resource sharing. A detailed example of a data grid framework is presented for a Large Hadron Collider experiment, where a hierarchical set of laboratory and university resources comprising petaflops of processing power and a multi- petabyte data archive must be efficiently used by a global collaboration. (14 refs).