Cargando…
Storage Element performance optimization for CMS analysis jobs
Tier-2 computing sites in the Worldwide Large Hadron Collider Computing Grid (WLCG) host CPU-resources (Compute Element, CE) and storage resources (Storage Element, SE). The vast amount of data that needs to processed from the Large Hadron Collider (LHC) experiments requires good and efficient use o...
Autores principales: | , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2012
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/396/4/042037 http://cds.cern.ch/record/1457866 |
Sumario: | Tier-2 computing sites in the Worldwide Large Hadron Collider Computing
Grid (WLCG) host CPU-resources (Compute Element, CE) and storage
resources (Storage Element, SE). The vast amount of data that needs to
processed from the Large Hadron Collider (LHC) experiments requires good
and efficient use of the available resources. Having a good CPU
efficiency for the end users analysis jobs requires that the performance
of the storage system is able to scale with I/O requests from hundreds
or even thousands of simultaneous jobs.
In this presentation we report on the work on improving the SE
performance at the Helsinki Institute of Physics (HIP) Tier-2 used for
the Compact Muon Experiment (CMS) at the LHC. Statistics from CMS grid
jobs are collected and stored in the CMS Dashboard for further analysis,
which allows for easy performance monitoring by the sites and by the CMS
collaboration. As part of the monitoring framework CMS uses the JobRobot
which sends every four hours 100 analysis jobs to each site. CMS also
uses the HammerCloud (HC) tool for site monitoring and stress testing
and HC has replaced the JobRobot. The performance of the
analysis workflow submitted with JobRobot or HC can be used to track the
performance due to site configuration changes, since the analysis
workflow is kept the same for all sites and for months in time. The CPU
efficiency of the JobRobot jobs at HIP was increased approximately by 50
% to more than 90 %, by tuning the SE and by improvements in the CMSSW
and dCache software. The performance of the CMS analysis jobs improved
significantly too. Similar work has been done on other CMS Tier-sites,
since on average the CPU efficiency for CMSSW jobs has increased during
2011. Better monitoring of the SE allows faster detection of problems,
so that the performance level can be kept high. The next storage upgrade
at HIP will consist of SAS disk enclosures which can be stress tested on
demand with HC workflows, to make sure that the I/O-performance is good. |
---|