Cargando…
Time-To-Complete Prediction for Data Transfers
Currently, there is no prediction provided to users for the amount of time a particular data transfer from one site in the Worldwide LHC Computing Grid to another will take to complete. To develop a time-to-complete prediction, network performance data and per-file information is gathered from two s...
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2016
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2209151 |
_version_ | 1780951758066941952 |
---|---|
author | Toler, Wesley |
author_facet | Toler, Wesley |
author_sort | Toler, Wesley |
collection | CERN |
description | Currently, there is no prediction provided to users for the amount of time a particular data transfer from one site in the Worldwide LHC Computing Grid to another will take to complete. To develop a time-to-complete prediction, network performance data and per-file information is gathered from two separate databases and fused, and the resulting cleaned data is fitted using random forest regression. Results are shown for two separate links: the link from CERN Data Centre to Brookhaven National Laboratory’s ATLAS data center, and the link from CERN Data Centre to SARA-MATRIX in Amsterdam. A total RMS error of 25.93 minutes between predicted and test data is found for the CERN-PROD -> BNL-ATLAS link, while the CERN-PROD -> SARA-MATRIX link yields a total RMS error of 3.00 minutes. |
id | cern-2209151 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2016 |
record_format | invenio |
spelling | cern-22091512019-09-30T06:29:59Zhttp://cds.cern.ch/record/2209151engToler, WesleyTime-To-Complete Prediction for Data TransfersInformation Transfer and ManagementCurrently, there is no prediction provided to users for the amount of time a particular data transfer from one site in the Worldwide LHC Computing Grid to another will take to complete. To develop a time-to-complete prediction, network performance data and per-file information is gathered from two separate databases and fused, and the resulting cleaned data is fitted using random forest regression. Results are shown for two separate links: the link from CERN Data Centre to Brookhaven National Laboratory’s ATLAS data center, and the link from CERN Data Centre to SARA-MATRIX in Amsterdam. A total RMS error of 25.93 minutes between predicted and test data is found for the CERN-PROD -> BNL-ATLAS link, while the CERN-PROD -> SARA-MATRIX link yields a total RMS error of 3.00 minutes.CERN-STUDENTS-Note-2016-109oai:cds.cern.ch:22091512016-08-19 |
spellingShingle | Information Transfer and Management Toler, Wesley Time-To-Complete Prediction for Data Transfers |
title | Time-To-Complete Prediction for Data Transfers |
title_full | Time-To-Complete Prediction for Data Transfers |
title_fullStr | Time-To-Complete Prediction for Data Transfers |
title_full_unstemmed | Time-To-Complete Prediction for Data Transfers |
title_short | Time-To-Complete Prediction for Data Transfers |
title_sort | time-to-complete prediction for data transfers |
topic | Information Transfer and Management |
url | http://cds.cern.ch/record/2209151 |
work_keys_str_mv | AT tolerwesley timetocompletepredictionfordatatransfers |