Cargando…

FTS3: Quantitative Monitoring

The overall success of LHC data processing depends heavily on stable, reliable and fast data distribution. The Worldwide LHC Computing Grid (WLCG) relies on the File Transfer Service (FTS) as the data movement middleware for moving sets of files from one site to another. This paper describes the com...

Descripción completa

Detalles Bibliográficos
Autores principales: Riahi, H, Salichos, M, Keeble, O, Andreeva, J, Ayllon, A A, Girolamo, A Di, Magini, N, Roiser, S, Simon, M K
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/664/6/062051
http://cds.cern.ch/record/2134612
_version_ 1780949915935965184
author Riahi, H
Salichos, M
Keeble, O
Andreeva, J
Ayllon, A A
Girolamo, A Di
Magini, N
Roiser, S
Simon, M K
author_facet Riahi, H
Salichos, M
Keeble, O
Andreeva, J
Ayllon, A A
Girolamo, A Di
Magini, N
Roiser, S
Simon, M K
author_sort Riahi, H
collection CERN
description The overall success of LHC data processing depends heavily on stable, reliable and fast data distribution. The Worldwide LHC Computing Grid (WLCG) relies on the File Transfer Service (FTS) as the data movement middleware for moving sets of files from one site to another. This paper describes the components of FTS3 monitoring infrastructure and how they are built to satisfy the common and particular requirements of the LHC experiments. We show how the system provides a complete and detailed cross-virtual organization (VO) picture of transfers for sites, operators and VOs. This information has proven critical due to the shared nature of the infrastructure, allowing a complete view of all transfers on shared network links between various workflows and VOs using the same FTS transfer manager. We also report on the performance of the FTS service itself, using data generated by the aforementioned monitoring infrastructure both during the commissioning and the first phase of production. We also explain how this monitoring information and network metrics produced can be used both as a starting point for troubleshooting data transfer issues, but also as a mechanism to collect information such as transfer efficiency between sites, achieved throughput and its evolution over time, most common errors, etc, and take decision upon them to further optimize transfer workflows. The service setup is subject to sites policies to control the network resource usage, as well as all the VOs making use of the Grid resources at the site to satisfy their requirements. FTS3 is the new version of FTS and has been deployed in production in August 2014.
id oai-inspirehep.net-1413969
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling oai-inspirehep.net-14139692022-08-10T13:01:00Zdoi:10.1088/1742-6596/664/6/062051http://cds.cern.ch/record/2134612engRiahi, HSalichos, MKeeble, OAndreeva, JAyllon, A AGirolamo, A DiMagini, NRoiser, SSimon, M KFTS3: Quantitative MonitoringComputing and ComputersThe overall success of LHC data processing depends heavily on stable, reliable and fast data distribution. The Worldwide LHC Computing Grid (WLCG) relies on the File Transfer Service (FTS) as the data movement middleware for moving sets of files from one site to another. This paper describes the components of FTS3 monitoring infrastructure and how they are built to satisfy the common and particular requirements of the LHC experiments. We show how the system provides a complete and detailed cross-virtual organization (VO) picture of transfers for sites, operators and VOs. This information has proven critical due to the shared nature of the infrastructure, allowing a complete view of all transfers on shared network links between various workflows and VOs using the same FTS transfer manager. We also report on the performance of the FTS service itself, using data generated by the aforementioned monitoring infrastructure both during the commissioning and the first phase of production. We also explain how this monitoring information and network metrics produced can be used both as a starting point for troubleshooting data transfer issues, but also as a mechanism to collect information such as transfer efficiency between sites, achieved throughput and its evolution over time, most common errors, etc, and take decision upon them to further optimize transfer workflows. The service setup is subject to sites policies to control the network resource usage, as well as all the VOs making use of the Grid resources at the site to satisfy their requirements. FTS3 is the new version of FTS and has been deployed in production in August 2014.oai:inspirehep.net:14139692015
spellingShingle Computing and Computers
Riahi, H
Salichos, M
Keeble, O
Andreeva, J
Ayllon, A A
Girolamo, A Di
Magini, N
Roiser, S
Simon, M K
FTS3: Quantitative Monitoring
title FTS3: Quantitative Monitoring
title_full FTS3: Quantitative Monitoring
title_fullStr FTS3: Quantitative Monitoring
title_full_unstemmed FTS3: Quantitative Monitoring
title_short FTS3: Quantitative Monitoring
title_sort fts3: quantitative monitoring
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/664/6/062051
http://cds.cern.ch/record/2134612
work_keys_str_mv AT riahih fts3quantitativemonitoring
AT salichosm fts3quantitativemonitoring
AT keebleo fts3quantitativemonitoring
AT andreevaj fts3quantitativemonitoring
AT ayllonaa fts3quantitativemonitoring
AT girolamoadi fts3quantitativemonitoring
AT maginin fts3quantitativemonitoring
AT roisers fts3quantitativemonitoring
AT simonmk fts3quantitativemonitoring