Cargando…

Design optimization of the Grid data analysis workflow in CMS

The chain of the typical CMS analysis workflow execution starts once configured and submitted by the physicist and ends when the outputs become available in the physicist storage area. During the execution, the workflow interacts with CMS computing infrastructure and services. These workflows have s...

Descripción completa

Detalles Bibliográficos
Autor principal: Riahi, Hassen
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:http://cds.cern.ch/record/2635940
_version_ 1780959889247436800
author Riahi, Hassen
author_facet Riahi, Hassen
author_sort Riahi, Hassen
collection CERN
description The chain of the typical CMS analysis workflow execution starts once configured and submitted by the physicist and ends when the outputs become available in the physicist storage area. During the execution, the workflow interacts with CMS computing infrastructure and services. These workflows have shown several bottlenecks through the years introducing delays in the execution of the end-user analysis and consequently differing the availability of physics results to the collaboration and squandering often the distributed resources. This thesis focuses on the study aiming to optimize the design of data analysis workflow of CMS, executing over LHC distributed computing infrastructure. Automation tool and an asynchronous stage-out system have been designed and implemented to reach this goal. The performance of the new tools in production environment are also shown.
id oai-inspirehep.net-1653855
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling oai-inspirehep.net-16538552019-09-30T06:29:59Zhttp://cds.cern.ch/record/2635940engRiahi, HassenDesign optimization of the Grid data analysis workflow in CMSComputing and ComputersThe chain of the typical CMS analysis workflow execution starts once configured and submitted by the physicist and ends when the outputs become available in the physicist storage area. During the execution, the workflow interacts with CMS computing infrastructure and services. These workflows have shown several bottlenecks through the years introducing delays in the execution of the end-user analysis and consequently differing the availability of physics results to the collaboration and squandering often the distributed resources. This thesis focuses on the study aiming to optimize the design of data analysis workflow of CMS, executing over LHC distributed computing infrastructure. Automation tool and an asynchronous stage-out system have been designed and implemented to reach this goal. The performance of the new tools in production environment are also shown.CERN-THESIS-2012-495oai:inspirehep.net:16538552012
spellingShingle Computing and Computers
Riahi, Hassen
Design optimization of the Grid data analysis workflow in CMS
title Design optimization of the Grid data analysis workflow in CMS
title_full Design optimization of the Grid data analysis workflow in CMS
title_fullStr Design optimization of the Grid data analysis workflow in CMS
title_full_unstemmed Design optimization of the Grid data analysis workflow in CMS
title_short Design optimization of the Grid data analysis workflow in CMS
title_sort design optimization of the grid data analysis workflow in cms
topic Computing and Computers
url http://cds.cern.ch/record/2635940
work_keys_str_mv AT riahihassen designoptimizationofthegriddataanalysisworkflowincms