Cargando…
Monitoring the Readiness and Utilization of the Distributed CMS Computing Facilities during the first year of LHC running
The CMS experiment has adopted a computing system where resources are distributed worldwide in more than 100 sites. The operation of the system requires a stable and reliable behavior of the underlying infrastructure. CMS has established procedures to extensively test all relevant aspects of a site...
Autores principales: | , , |
---|---|
Lenguaje: | eng |
Publicado: |
2011
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/1326925 |
_version_ | 1780921667907747840 |
---|---|
author | Flix, Jose Hernandez, Jose M Sciaba , Andrea |
author_facet | Flix, Jose Hernandez, Jose M Sciaba , Andrea |
author_sort | Flix, Jose |
collection | CERN |
description | The CMS experiment has adopted a computing system where resources are distributed worldwide in more than 100 sites. The operation of the system requires a stable and reliable behavior of the underlying infrastructure. CMS has established procedures to extensively test all relevant aspects of a site and their capability to sustain the various CMS computing workflows at the required scale. The Site Readiness monitoring infrastructure has been instrumental in understanding how the system as a whole was improving towards LHC operations, measuring the reliability of sites when running CMS activities, and providing sites with the information they need to solve eventual problems. This paper reviews the complete automation of the Site Readiness program, with the description of monitoring tools, the impact in improving the overall reliability of the Grid from the point of view of the CMS computing system, as well as the resource utilization and performance seen at the sites during the first year of LHC running. |
id | cern-1326925 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2011 |
record_format | invenio |
spelling | cern-13269252019-09-30T06:29:59Zhttp://cds.cern.ch/record/1326925engFlix, JoseHernandez, Jose MSciaba , AndreaMonitoring the Readiness and Utilization of the Distributed CMS Computing Facilities during the first year of LHC runningDetectors and Experimental TechniquesThe CMS experiment has adopted a computing system where resources are distributed worldwide in more than 100 sites. The operation of the system requires a stable and reliable behavior of the underlying infrastructure. CMS has established procedures to extensively test all relevant aspects of a site and their capability to sustain the various CMS computing workflows at the required scale. The Site Readiness monitoring infrastructure has been instrumental in understanding how the system as a whole was improving towards LHC operations, measuring the reliability of sites when running CMS activities, and providing sites with the information they need to solve eventual problems. This paper reviews the complete automation of the Site Readiness program, with the description of monitoring tools, the impact in improving the overall reliability of the Grid from the point of view of the CMS computing system, as well as the resource utilization and performance seen at the sites during the first year of LHC running.CMS-CR-2011-024oai:cds.cern.ch:13269252011-01-14 |
spellingShingle | Detectors and Experimental Techniques Flix, Jose Hernandez, Jose M Sciaba , Andrea Monitoring the Readiness and Utilization of the Distributed CMS Computing Facilities during the first year of LHC running |
title | Monitoring the Readiness and Utilization of the Distributed CMS Computing Facilities during the first year of LHC running |
title_full | Monitoring the Readiness and Utilization of the Distributed CMS Computing Facilities during the first year of LHC running |
title_fullStr | Monitoring the Readiness and Utilization of the Distributed CMS Computing Facilities during the first year of LHC running |
title_full_unstemmed | Monitoring the Readiness and Utilization of the Distributed CMS Computing Facilities during the first year of LHC running |
title_short | Monitoring the Readiness and Utilization of the Distributed CMS Computing Facilities during the first year of LHC running |
title_sort | monitoring the readiness and utilization of the distributed cms computing facilities during the first year of lhc running |
topic | Detectors and Experimental Techniques |
url | http://cds.cern.ch/record/1326925 |
work_keys_str_mv | AT flixjose monitoringthereadinessandutilizationofthedistributedcmscomputingfacilitiesduringthefirstyearoflhcrunning AT hernandezjosem monitoringthereadinessandutilizationofthedistributedcmscomputingfacilitiesduringthefirstyearoflhcrunning AT sciabaandrea monitoringthereadinessandutilizationofthedistributedcmscomputingfacilitiesduringthefirstyearoflhcrunning |