Cargando…
Increasing Availability by Implementing Software Redundancy in the CMS Detector Control System
The Detector Control System (DCS) of the Compact Muon Solenoid (CMS) experiment ran with high availability throughout the first physics data-taking period of the Large Hadron Collider (LHC). This was achieved through the consistent improvement of the control software and the provision of a 24-hour e...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2015
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.18429/JACoW-ICALEPCS2015-WEPGF013 http://cds.cern.ch/record/2213496 |
_version_ | 1780951998029365248 |
---|---|
author | Masetti, Lorenzo Andre, Jean-Marc Andronidis, Anastasios Behrens, Ulf Branson, James Chaze, Olivier Cittolin, Sergio Darlea, Georgiana-Lavinia Deldicque, Christian Dobson, Marc Dupont, Aymeric Erhan, Samim Gigi, Dominique Glege, Frank Gomez-Ceballos, Guillelmo Hegeman, Jeroen Holme, Oliver Holzner, Andre Janulis, Mindaugas Jiménez Estupiñán, Raúl Meijers, Frans Meschi, Emilio Mommsen, Remigius Morovic, Srecko Nunez-Barranco-Fernandez, Carlos O'Dell, Vivian Orsini, Luciano Paus, Christoph Petrucci, Andrea Pieri, Marco Racz, Attila Roberts, Penelope Sakulin, Hannes Schwick, Christoph Stieger, Benjamin Sumorok, Konstanty Veverka, Jan Zaza, Salvatore Zejdl, Petr |
author_facet | Masetti, Lorenzo Andre, Jean-Marc Andronidis, Anastasios Behrens, Ulf Branson, James Chaze, Olivier Cittolin, Sergio Darlea, Georgiana-Lavinia Deldicque, Christian Dobson, Marc Dupont, Aymeric Erhan, Samim Gigi, Dominique Glege, Frank Gomez-Ceballos, Guillelmo Hegeman, Jeroen Holme, Oliver Holzner, Andre Janulis, Mindaugas Jiménez Estupiñán, Raúl Meijers, Frans Meschi, Emilio Mommsen, Remigius Morovic, Srecko Nunez-Barranco-Fernandez, Carlos O'Dell, Vivian Orsini, Luciano Paus, Christoph Petrucci, Andrea Pieri, Marco Racz, Attila Roberts, Penelope Sakulin, Hannes Schwick, Christoph Stieger, Benjamin Sumorok, Konstanty Veverka, Jan Zaza, Salvatore Zejdl, Petr |
author_sort | Masetti, Lorenzo |
collection | CERN |
description | The Detector Control System (DCS) of the Compact Muon Solenoid (CMS) experiment ran with high availability throughout the first physics data-taking period of the Large Hadron Collider (LHC). This was achieved through the consistent improvement of the control software and the provision of a 24-hour expert on-call service. One remaining potential cause of significant downtime was the failure of the computers hosting the DCS software. To minimize the impact of these failures after the restart of the LHC in 2015, it was decided to implement a redundant software layer for the control system where two computers host each DCS application. By customizing and extending the redundancy concept offered by WinCC Open Architecture (WinCC OA), the CMS DCS can now run in a fully redundant software configuration. The implementation involves one host being active, handling all monitoring and control tasks, with the second host running in a minimally functional, passive configuration. Data from the active host is constantly copied to the passive host to enable a rapid switchover as needed. This paper describes details of the implementation and practical experience of redundancy in the CMS DCS. |
id | oai-inspirehep.net-1481644 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2015 |
record_format | invenio |
spelling | oai-inspirehep.net-14816442019-09-30T06:29:59Zdoi:10.18429/JACoW-ICALEPCS2015-WEPGF013http://cds.cern.ch/record/2213496engMasetti, LorenzoAndre, Jean-MarcAndronidis, AnastasiosBehrens, UlfBranson, JamesChaze, OlivierCittolin, SergioDarlea, Georgiana-LaviniaDeldicque, ChristianDobson, MarcDupont, AymericErhan, SamimGigi, DominiqueGlege, FrankGomez-Ceballos, GuillelmoHegeman, JeroenHolme, OliverHolzner, AndreJanulis, MindaugasJiménez Estupiñán, RaúlMeijers, FransMeschi, EmilioMommsen, RemigiusMorovic, SreckoNunez-Barranco-Fernandez, CarlosO'Dell, VivianOrsini, LucianoPaus, ChristophPetrucci, AndreaPieri, MarcoRacz, AttilaRoberts, PenelopeSakulin, HannesSchwick, ChristophStieger, BenjaminSumorok, KonstantyVeverka, JanZaza, SalvatoreZejdl, PetrIncreasing Availability by Implementing Software Redundancy in the CMS Detector Control SystemAccelerators and Storage RingsDetectors and Experimental TechniquesThe Detector Control System (DCS) of the Compact Muon Solenoid (CMS) experiment ran with high availability throughout the first physics data-taking period of the Large Hadron Collider (LHC). This was achieved through the consistent improvement of the control software and the provision of a 24-hour expert on-call service. One remaining potential cause of significant downtime was the failure of the computers hosting the DCS software. To minimize the impact of these failures after the restart of the LHC in 2015, it was decided to implement a redundant software layer for the control system where two computers host each DCS application. By customizing and extending the redundancy concept offered by WinCC Open Architecture (WinCC OA), the CMS DCS can now run in a fully redundant software configuration. The implementation involves one host being active, handling all monitoring and control tasks, with the second host running in a minimally functional, passive configuration. Data from the active host is constantly copied to the passive host to enable a rapid switchover as needed. This paper describes details of the implementation and practical experience of redundancy in the CMS DCS.oai:inspirehep.net:14816442015 |
spellingShingle | Accelerators and Storage Rings Detectors and Experimental Techniques Masetti, Lorenzo Andre, Jean-Marc Andronidis, Anastasios Behrens, Ulf Branson, James Chaze, Olivier Cittolin, Sergio Darlea, Georgiana-Lavinia Deldicque, Christian Dobson, Marc Dupont, Aymeric Erhan, Samim Gigi, Dominique Glege, Frank Gomez-Ceballos, Guillelmo Hegeman, Jeroen Holme, Oliver Holzner, Andre Janulis, Mindaugas Jiménez Estupiñán, Raúl Meijers, Frans Meschi, Emilio Mommsen, Remigius Morovic, Srecko Nunez-Barranco-Fernandez, Carlos O'Dell, Vivian Orsini, Luciano Paus, Christoph Petrucci, Andrea Pieri, Marco Racz, Attila Roberts, Penelope Sakulin, Hannes Schwick, Christoph Stieger, Benjamin Sumorok, Konstanty Veverka, Jan Zaza, Salvatore Zejdl, Petr Increasing Availability by Implementing Software Redundancy in the CMS Detector Control System |
title | Increasing Availability by Implementing Software Redundancy in the CMS Detector Control System |
title_full | Increasing Availability by Implementing Software Redundancy in the CMS Detector Control System |
title_fullStr | Increasing Availability by Implementing Software Redundancy in the CMS Detector Control System |
title_full_unstemmed | Increasing Availability by Implementing Software Redundancy in the CMS Detector Control System |
title_short | Increasing Availability by Implementing Software Redundancy in the CMS Detector Control System |
title_sort | increasing availability by implementing software redundancy in the cms detector control system |
topic | Accelerators and Storage Rings Detectors and Experimental Techniques |
url | https://dx.doi.org/10.18429/JACoW-ICALEPCS2015-WEPGF013 http://cds.cern.ch/record/2213496 |
work_keys_str_mv | AT masettilorenzo increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT andrejeanmarc increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT andronidisanastasios increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT behrensulf increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT bransonjames increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT chazeolivier increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT cittolinsergio increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT darleageorgianalavinia increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT deldicquechristian increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT dobsonmarc increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT dupontaymeric increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT erhansamim increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT gigidominique increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT glegefrank increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT gomezceballosguillelmo increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT hegemanjeroen increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT holmeoliver increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT holznerandre increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT janulismindaugas increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT jimenezestupinanraul increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT meijersfrans increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT meschiemilio increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT mommsenremigius increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT morovicsrecko increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT nunezbarrancofernandezcarlos increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT odellvivian increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT orsiniluciano increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT pauschristoph increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT petrucciandrea increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT pierimarco increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT raczattila increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT robertspenelope increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT sakulinhannes increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT schwickchristoph increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT stiegerbenjamin increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT sumorokkonstanty increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT veverkajan increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT zazasalvatore increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem AT zejdlpetr increasingavailabilitybyimplementingsoftwareredundancyinthecmsdetectorcontrolsystem |