Cargando…

A runtime estimation framework for ALICE

The European Organization for Nuclear Research (CERN) is the largest research organization for particle physics. ALICE, short for ALarge Ion Collider Experiment, serves as one of the main detectors at CERN and produces approximately 15 petabytes of data each year. The computing associated with an AL...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pumma, Sarunya, Feng, Wu-chun, Phunchongharn, Phond, Chapeland, Sylvain, Achalakul, Tiranee
Lenguaje:	eng
Publicado:	2017
Materias:	Computing and Computers
Acceso en línea:	https://dx.doi.org/10.1016/j.future.2017.02.040 http://cds.cern.ch/record/2272372

_version_	1780954952955330560
author	Pumma, Sarunya Feng, Wu-chun Phunchongharn, Phond Chapeland, Sylvain Achalakul, Tiranee
author_facet	Pumma, Sarunya Feng, Wu-chun Phunchongharn, Phond Chapeland, Sylvain Achalakul, Tiranee
author_sort	Pumma, Sarunya
collection	CERN
description	The European Organization for Nuclear Research (CERN) is the largest research organization for particle physics. ALICE, short for ALarge Ion Collider Experiment, serves as one of the main detectors at CERN and produces approximately 15 petabytes of data each year. The computing associated with an ALICE experiment consists of both online and offline processing. An online cluster retrieves data while an offline cluster farm performs a broad range of data analysis. Online processing occurs as collision events are streamed from the detector to the online cluster. This process compresses and calibrates the data before storing it in a data storage system for subsequent offline processing, e.g., event reconstruction. Due to the large volume of stored data to process, offline processing seeks to minimize execution time and data-staging time of the applications via a two-tier offline cluster — the Event Processing Node (EPN) as the first tier and the World LHC Grid Computing (WLGC) as the second tier. This two-tier cluster requires a smart job scheduler to efficiently manage the running of the application. Thus, we propose a runtime estimation method for this offline processing in the ALICE environment. Our approach exploits application profiles to predict the runtime of a high-performance computing (HPC) application without the need for any additional metadata. To evaluate our proposed framework, we performed our experiment on the actual ALICE applications. In addition, we also test the efficacy of our runtime estimation method to predict the run times of the HPC applications on the Amazon EC2 cloud. The results show that our approach generally delivers accurate predictions, i.e., low error percentages.
id	oai-inspirehep.net-1605075
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2017
record_format	invenio
spelling	oai-inspirehep.net-16050752019-09-30T06:29:59Zdoi:10.1016/j.future.2017.02.040http://cds.cern.ch/record/2272372engPumma, SarunyaFeng, Wu-chunPhunchongharn, PhondChapeland, SylvainAchalakul, TiraneeA runtime estimation framework for ALICEComputing and ComputersComputing and ComputersThe European Organization for Nuclear Research (CERN) is the largest research organization for particle physics. ALICE, short for ALarge Ion Collider Experiment, serves as one of the main detectors at CERN and produces approximately 15 petabytes of data each year. The computing associated with an ALICE experiment consists of both online and offline processing. An online cluster retrieves data while an offline cluster farm performs a broad range of data analysis. Online processing occurs as collision events are streamed from the detector to the online cluster. This process compresses and calibrates the data before storing it in a data storage system for subsequent offline processing, e.g., event reconstruction. Due to the large volume of stored data to process, offline processing seeks to minimize execution time and data-staging time of the applications via a two-tier offline cluster — the Event Processing Node (EPN) as the first tier and the World LHC Grid Computing (WLGC) as the second tier. This two-tier cluster requires a smart job scheduler to efficiently manage the running of the application. Thus, we propose a runtime estimation method for this offline processing in the ALICE environment. Our approach exploits application profiles to predict the runtime of a high-performance computing (HPC) application without the need for any additional metadata. To evaluate our proposed framework, we performed our experiment on the actual ALICE applications. In addition, we also test the efficacy of our runtime estimation method to predict the run times of the HPC applications on the Amazon EC2 cloud. The results show that our approach generally delivers accurate predictions, i.e., low error percentages.oai:inspirehep.net:16050752017
spellingShingle	Computing and Computers Computing and Computers Pumma, Sarunya Feng, Wu-chun Phunchongharn, Phond Chapeland, Sylvain Achalakul, Tiranee A runtime estimation framework for ALICE
title	A runtime estimation framework for ALICE
title_full	A runtime estimation framework for ALICE
title_fullStr	A runtime estimation framework for ALICE
title_full_unstemmed	A runtime estimation framework for ALICE
title_short	A runtime estimation framework for ALICE
title_sort	runtime estimation framework for alice
topic	Computing and Computers Computing and Computers
url	https://dx.doi.org/10.1016/j.future.2017.02.040 http://cds.cern.ch/record/2272372
work_keys_str_mv	AT pummasarunya aruntimeestimationframeworkforalice AT fengwuchun aruntimeestimationframeworkforalice AT phunchongharnphond aruntimeestimationframeworkforalice AT chapelandsylvain aruntimeestimationframeworkforalice AT achalakultiranee aruntimeestimationframeworkforalice AT pummasarunya runtimeestimationframeworkforalice AT fengwuchun runtimeestimationframeworkforalice AT phunchongharnphond runtimeestimationframeworkforalice AT chapelandsylvain runtimeestimationframeworkforalice AT achalakultiranee runtimeestimationframeworkforalice

A runtime estimation framework for ALICE

Ejemplares similares