Cargando…

Time-To-Complete Prediction for Data Transfers

Currently, there is no prediction provided to users for the amount of time a particular data transfer from one site in the Worldwide LHC Computing Grid to another will take to complete. To develop a time-to-complete prediction, network performance data and per-file information is gathered from two s...

Descripción completa

Detalles Bibliográficos
Autor principal: Toler, Wesley
Lenguaje:eng
Publicado: 2016
Materias:
Acceso en línea:http://cds.cern.ch/record/2209151
_version_ 1780951758066941952
author Toler, Wesley
author_facet Toler, Wesley
author_sort Toler, Wesley
collection CERN
description Currently, there is no prediction provided to users for the amount of time a particular data transfer from one site in the Worldwide LHC Computing Grid to another will take to complete. To develop a time-to-complete prediction, network performance data and per-file information is gathered from two separate databases and fused, and the resulting cleaned data is fitted using random forest regression. Results are shown for two separate links: the link from CERN Data Centre to Brookhaven National Laboratory’s ATLAS data center, and the link from CERN Data Centre to SARA-MATRIX in Amsterdam. A total RMS error of 25.93 minutes between predicted and test data is found for the CERN-PROD -> BNL-ATLAS link, while the CERN-PROD -> SARA-MATRIX link yields a total RMS error of 3.00 minutes.
id cern-2209151
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2016
record_format invenio
spelling cern-22091512019-09-30T06:29:59Zhttp://cds.cern.ch/record/2209151engToler, WesleyTime-To-Complete Prediction for Data TransfersInformation Transfer and ManagementCurrently, there is no prediction provided to users for the amount of time a particular data transfer from one site in the Worldwide LHC Computing Grid to another will take to complete. To develop a time-to-complete prediction, network performance data and per-file information is gathered from two separate databases and fused, and the resulting cleaned data is fitted using random forest regression. Results are shown for two separate links: the link from CERN Data Centre to Brookhaven National Laboratory’s ATLAS data center, and the link from CERN Data Centre to SARA-MATRIX in Amsterdam. A total RMS error of 25.93 minutes between predicted and test data is found for the CERN-PROD -> BNL-ATLAS link, while the CERN-PROD -> SARA-MATRIX link yields a total RMS error of 3.00 minutes.CERN-STUDENTS-Note-2016-109oai:cds.cern.ch:22091512016-08-19
spellingShingle Information Transfer and Management
Toler, Wesley
Time-To-Complete Prediction for Data Transfers
title Time-To-Complete Prediction for Data Transfers
title_full Time-To-Complete Prediction for Data Transfers
title_fullStr Time-To-Complete Prediction for Data Transfers
title_full_unstemmed Time-To-Complete Prediction for Data Transfers
title_short Time-To-Complete Prediction for Data Transfers
title_sort time-to-complete prediction for data transfers
topic Information Transfer and Management
url http://cds.cern.ch/record/2209151
work_keys_str_mv AT tolerwesley timetocompletepredictionfordatatransfers