Cargando…

Grid collector: an event catalog with automated file management

High Energy Nuclear Physics (HENP) experiments such as STAR at BNL and ATLAS at CERN produce large amounts of data that are stored as files on mass storage systems in computer centers. In these files, the basic unit of data is an event. Analysis is typically performed on a selected set of events. Th...

Descripción completa

Detalles Bibliográficos
Autores principales: Ke Sheng Wu, Wei Ming Zlang, Sim, A, Jun Min Gu, Shoshani, A
Lenguaje:eng
Publicado: 2004
Materias:
Acceso en línea:http://cds.cern.ch/record/818322
_version_ 1780905466996457472
author Ke Sheng Wu
Wei Ming Zlang
Sim, A
Jun Min Gu
Shoshani, A
author_facet Ke Sheng Wu
Wei Ming Zlang
Sim, A
Jun Min Gu
Shoshani, A
author_sort Ke Sheng Wu
collection CERN
description High Energy Nuclear Physics (HENP) experiments such as STAR at BNL and ATLAS at CERN produce large amounts of data that are stored as files on mass storage systems in computer centers. In these files, the basic unit of data is an event. Analysis is typically performed on a selected set of events. The files containing these events have to be located, copied from mass storage systems to disks before analysis, and removed when no longer needed. These file management tasks are tedious and time consuming. Typically, all events contained in the files are read into memory before a selection is made. Since the time to read the events dominate the overall execution time, reading the unwanted event needlessly increases the analysis time. The Grid Collector is a set of software modules that works together to address these two issues. It automates the file management tasks and provides "direct" access to the selected events for analyses. It is currently integrated with the STAR analysis framework. The users can select events based on tags, such as, "production date between March 10 and 20, and the number of charged tracks > 100:" The Grid Collector locates the files containing relevant events, transfers the files across the Grid if necessary, and delivers the events to the analysis code through the familiar iterators. There has been some research efforts to address the file management issues, the Grid Collector is unique in that it addresses the event access issue together with the file management issues. This makes it more useful to a large varieties of users. (22 refs).
id cern-818322
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2004
record_format invenio
spelling cern-8183222019-09-30T06:29:59Zhttp://cds.cern.ch/record/818322engKe Sheng WuWei Ming ZlangSim, AJun Min GuShoshani, AGrid collector: an event catalog with automated file managementComputing and ComputersHigh Energy Nuclear Physics (HENP) experiments such as STAR at BNL and ATLAS at CERN produce large amounts of data that are stored as files on mass storage systems in computer centers. In these files, the basic unit of data is an event. Analysis is typically performed on a selected set of events. The files containing these events have to be located, copied from mass storage systems to disks before analysis, and removed when no longer needed. These file management tasks are tedious and time consuming. Typically, all events contained in the files are read into memory before a selection is made. Since the time to read the events dominate the overall execution time, reading the unwanted event needlessly increases the analysis time. The Grid Collector is a set of software modules that works together to address these two issues. It automates the file management tasks and provides "direct" access to the selected events for analyses. It is currently integrated with the STAR analysis framework. The users can select events based on tags, such as, "production date between March 10 and 20, and the number of charged tracks > 100:" The Grid Collector locates the files containing relevant events, transfers the files across the Grid if necessary, and delivers the events to the analysis code through the familiar iterators. There has been some research efforts to address the file management issues, the Grid Collector is unique in that it addresses the event access issue together with the file management issues. This makes it more useful to a large varieties of users. (22 refs).oai:cds.cern.ch:8183222004
spellingShingle Computing and Computers
Ke Sheng Wu
Wei Ming Zlang
Sim, A
Jun Min Gu
Shoshani, A
Grid collector: an event catalog with automated file management
title Grid collector: an event catalog with automated file management
title_full Grid collector: an event catalog with automated file management
title_fullStr Grid collector: an event catalog with automated file management
title_full_unstemmed Grid collector: an event catalog with automated file management
title_short Grid collector: an event catalog with automated file management
title_sort grid collector: an event catalog with automated file management
topic Computing and Computers
url http://cds.cern.ch/record/818322
work_keys_str_mv AT keshengwu gridcollectoraneventcatalogwithautomatedfilemanagement
AT weimingzlang gridcollectoraneventcatalogwithautomatedfilemanagement
AT sima gridcollectoraneventcatalogwithautomatedfilemanagement
AT junmingu gridcollectoraneventcatalogwithautomatedfilemanagement
AT shoshania gridcollectoraneventcatalogwithautomatedfilemanagement