Cargando…

No file left behind - monitoring transfer latencies in PhEDEx

The CMS experiment has to move Petabytes of data among dozens of computing centres with low latency in order to make efficient use of its resources. Transfer operations are well established to achieve the desired level of throughput, but operators lack a system to identify early on transfers that wi...

Descripción completa

Detalles Bibliográficos
Autor principal: Ratnikova, Natalia
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:http://cds.cern.ch/record/1457456
_version_ 1780925119266291712
author Ratnikova, Natalia
author_facet Ratnikova, Natalia
author_sort Ratnikova, Natalia
collection CERN
description The CMS experiment has to move Petabytes of data among dozens of computing centres with low latency in order to make efficient use of its resources. Transfer operations are well established to achieve the desired level of throughput, but operators lack a system to identify early on transfers that will need manual intervention to reach completion. File transfer latencies are sensitive to the underlying problems in the transfer infrastructure, and their measurement can be used as prompt trigger for preventive actions. For this reason, PhEDEx, the CMS transfer management system, has recently implemented a monitoring system to measure the transfer latencies at the level of individual files. For the first time now, the system can predict the completion time for the transfer of a data set. The operators can detect abnormal patterns in transfer latencies early, and correct the issues while the transfer is still in progress. Statistics are aggregated for blocks of files, recording a historical log to monitor the long-term evolution of transfer latencies, which are used as cumulative metrics to evaluate the performance of the transfer infrastructure, and to plan the global data placement strategy. In this contribution, we present the typical patterns of transfer latencies that may be identified with the latency monitor, and we show how we are able to detect the sources of latency arising from the underlying infrastructure (such as stuck files) which need operator intervention.
id cern-1457456
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling cern-14574562019-09-30T06:29:59Zhttp://cds.cern.ch/record/1457456engRatnikova, NataliaNo file left behind - monitoring transfer latencies in PhEDExDetectors and Experimental TechniquesThe CMS experiment has to move Petabytes of data among dozens of computing centres with low latency in order to make efficient use of its resources. Transfer operations are well established to achieve the desired level of throughput, but operators lack a system to identify early on transfers that will need manual intervention to reach completion. File transfer latencies are sensitive to the underlying problems in the transfer infrastructure, and their measurement can be used as prompt trigger for preventive actions. For this reason, PhEDEx, the CMS transfer management system, has recently implemented a monitoring system to measure the transfer latencies at the level of individual files. For the first time now, the system can predict the completion time for the transfer of a data set. The operators can detect abnormal patterns in transfer latencies early, and correct the issues while the transfer is still in progress. Statistics are aggregated for blocks of files, recording a historical log to monitor the long-term evolution of transfer latencies, which are used as cumulative metrics to evaluate the performance of the transfer infrastructure, and to plan the global data placement strategy. In this contribution, we present the typical patterns of transfer latencies that may be identified with the latency monitor, and we show how we are able to detect the sources of latency arising from the underlying infrastructure (such as stuck files) which need operator intervention.CMS-CR-2012-140oai:cds.cern.ch:14574562012-06-13
spellingShingle Detectors and Experimental Techniques
Ratnikova, Natalia
No file left behind - monitoring transfer latencies in PhEDEx
title No file left behind - monitoring transfer latencies in PhEDEx
title_full No file left behind - monitoring transfer latencies in PhEDEx
title_fullStr No file left behind - monitoring transfer latencies in PhEDEx
title_full_unstemmed No file left behind - monitoring transfer latencies in PhEDEx
title_short No file left behind - monitoring transfer latencies in PhEDEx
title_sort no file left behind - monitoring transfer latencies in phedex
topic Detectors and Experimental Techniques
url http://cds.cern.ch/record/1457456
work_keys_str_mv AT ratnikovanatalia nofileleftbehindmonitoringtransferlatenciesinphedex