Cargando…

Network Resiliency Implementation in the ATLAS TDAQ System

The ATLAS TDAQ (Trigger and Data Acquisition) system performs the real-time selection of events produced by the detector. For this purpose approximately 2000 computers are deployed and interconnected through various high speed networks, whose architecture has already been described. This article foc...

Descripción completa

Detalles Bibliográficos
Autores principales: Stancu, S N, Al-Shabibi, A, Batraneanu, S M, Ballestrero, S, Caramarcu, C, Martin, B, Savu, D O, Sjoen, R V, Valsan, L
Lenguaje:eng
Publicado: 2010
Materias:
Acceso en línea:http://cds.cern.ch/record/1269936
_version_ 1780920183929438208
author Stancu, S N
Al-Shabibi, A
Batraneanu, S M
Ballestrero, S
Caramarcu, C
Martin, B
Savu, D O
Sjoen, R V
Valsan, L
author_facet Stancu, S N
Al-Shabibi, A
Batraneanu, S M
Ballestrero, S
Caramarcu, C
Martin, B
Savu, D O
Sjoen, R V
Valsan, L
author_sort Stancu, S N
collection CERN
description The ATLAS TDAQ (Trigger and Data Acquisition) system performs the real-time selection of events produced by the detector. For this purpose approximately 2000 computers are deployed and interconnected through various high speed networks, whose architecture has already been described. This article focuses on the implementation and validation of network connectivity resiliency (previously presented at a conceptual level). Redundancy and eventually load balancing are achieved through the synergy of various protocols: 802.3ad link aggregation, OSPF (Open Shortest Path First), VRRP (Virtual Router Redundancy Protocol), MST (Multiple Spanning Trees). An innovative method for cost-effective redundant connectivity of high-throughput high-availability servers is presented. Furthermore, real-life examples showing how redundancy works, and more importantly how it might fail despite careful planning are presented.
id cern-1269936
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2010
record_format invenio
spelling cern-12699362019-09-30T06:29:59Zhttp://cds.cern.ch/record/1269936engStancu, S NAl-Shabibi, ABatraneanu, S MBallestrero, SCaramarcu, CMartin, BSavu, D OSjoen, R VValsan, LNetwork Resiliency Implementation in the ATLAS TDAQ SystemDetectors and Experimental TechniquesThe ATLAS TDAQ (Trigger and Data Acquisition) system performs the real-time selection of events produced by the detector. For this purpose approximately 2000 computers are deployed and interconnected through various high speed networks, whose architecture has already been described. This article focuses on the implementation and validation of network connectivity resiliency (previously presented at a conceptual level). Redundancy and eventually load balancing are achieved through the synergy of various protocols: 802.3ad link aggregation, OSPF (Open Shortest Path First), VRRP (Virtual Router Redundancy Protocol), MST (Multiple Spanning Trees). An innovative method for cost-effective redundant connectivity of high-throughput high-availability servers is presented. Furthermore, real-life examples showing how redundancy works, and more importantly how it might fail despite careful planning are presented.ATL-DAQ-PROC-2010-007oai:cds.cern.ch:12699362010-06-05
spellingShingle Detectors and Experimental Techniques
Stancu, S N
Al-Shabibi, A
Batraneanu, S M
Ballestrero, S
Caramarcu, C
Martin, B
Savu, D O
Sjoen, R V
Valsan, L
Network Resiliency Implementation in the ATLAS TDAQ System
title Network Resiliency Implementation in the ATLAS TDAQ System
title_full Network Resiliency Implementation in the ATLAS TDAQ System
title_fullStr Network Resiliency Implementation in the ATLAS TDAQ System
title_full_unstemmed Network Resiliency Implementation in the ATLAS TDAQ System
title_short Network Resiliency Implementation in the ATLAS TDAQ System
title_sort network resiliency implementation in the atlas tdaq system
topic Detectors and Experimental Techniques
url http://cds.cern.ch/record/1269936
work_keys_str_mv AT stancusn networkresiliencyimplementationintheatlastdaqsystem
AT alshabibia networkresiliencyimplementationintheatlastdaqsystem
AT batraneanusm networkresiliencyimplementationintheatlastdaqsystem
AT ballestreros networkresiliencyimplementationintheatlastdaqsystem
AT caramarcuc networkresiliencyimplementationintheatlastdaqsystem
AT martinb networkresiliencyimplementationintheatlastdaqsystem
AT savudo networkresiliencyimplementationintheatlastdaqsystem
AT sjoenrv networkresiliencyimplementationintheatlastdaqsystem
AT valsanl networkresiliencyimplementationintheatlastdaqsystem