Cargando…

Evolution of the Hadoop platform and ecosystem for high energy physics

The interest in using scalable data processing solutions based on Apache Hadoop ecosystem is constantly growing in the High Energy Physics (HEP) community. This drives the need for increased reliability and availability of the central Hadoop service and underlying infrastructure provided to the comm...

Descripción completa

Detalles Bibliográficos
Autores principales: Baranowski, Zbigniew, Kleszcz, Emil, Kothuri, Prasanth, Canali, Luca, Castellotti, Riccardo, Martin Marquez, Manuel, Matos de Barros, Nuno Guilherme, Motesnitsalis, Evangelos, Mrowczynski, Piotr, Luna Duran, Jose Carlos
Lenguaje:eng
Publicado: 2019
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/201921404058
http://cds.cern.ch/record/2700232
_version_ 1780964496604397568
author Baranowski, Zbigniew
Kleszcz, Emil
Kothuri, Prasanth
Canali, Luca
Castellotti, Riccardo
Martin Marquez, Manuel
Matos de Barros, Nuno Guilherme
Motesnitsalis, Evangelos
Mrowczynski, Piotr
Luna Duran, Jose Carlos
author_facet Baranowski, Zbigniew
Kleszcz, Emil
Kothuri, Prasanth
Canali, Luca
Castellotti, Riccardo
Martin Marquez, Manuel
Matos de Barros, Nuno Guilherme
Motesnitsalis, Evangelos
Mrowczynski, Piotr
Luna Duran, Jose Carlos
author_sort Baranowski, Zbigniew
collection CERN
description The interest in using scalable data processing solutions based on Apache Hadoop ecosystem is constantly growing in the High Energy Physics (HEP) community. This drives the need for increased reliability and availability of the central Hadoop service and underlying infrastructure provided to the community by the CERN IT department. This paper reports on the overall status of the Hadoop platform and related Hadoop and Spark service at CERN, detailing recent enhancements and features introduced in many areas including the service configuration, availability, alerting, monitoring and data protection, in order to meet the new requirements posed by the users’ community.
id oai-inspirehep.net-1761010
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2019
record_format invenio
spelling oai-inspirehep.net-17610102022-08-10T12:24:09Zdoi:10.1051/epjconf/201921404058http://cds.cern.ch/record/2700232engBaranowski, ZbigniewKleszcz, EmilKothuri, PrasanthCanali, LucaCastellotti, RiccardoMartin Marquez, ManuelMatos de Barros, Nuno GuilhermeMotesnitsalis, EvangelosMrowczynski, PiotrLuna Duran, Jose CarlosEvolution of the Hadoop platform and ecosystem for high energy physicsComputing and ComputersThe interest in using scalable data processing solutions based on Apache Hadoop ecosystem is constantly growing in the High Energy Physics (HEP) community. This drives the need for increased reliability and availability of the central Hadoop service and underlying infrastructure provided to the community by the CERN IT department. This paper reports on the overall status of the Hadoop platform and related Hadoop and Spark service at CERN, detailing recent enhancements and features introduced in many areas including the service configuration, availability, alerting, monitoring and data protection, in order to meet the new requirements posed by the users’ community.oai:inspirehep.net:17610102019
spellingShingle Computing and Computers
Baranowski, Zbigniew
Kleszcz, Emil
Kothuri, Prasanth
Canali, Luca
Castellotti, Riccardo
Martin Marquez, Manuel
Matos de Barros, Nuno Guilherme
Motesnitsalis, Evangelos
Mrowczynski, Piotr
Luna Duran, Jose Carlos
Evolution of the Hadoop platform and ecosystem for high energy physics
title Evolution of the Hadoop platform and ecosystem for high energy physics
title_full Evolution of the Hadoop platform and ecosystem for high energy physics
title_fullStr Evolution of the Hadoop platform and ecosystem for high energy physics
title_full_unstemmed Evolution of the Hadoop platform and ecosystem for high energy physics
title_short Evolution of the Hadoop platform and ecosystem for high energy physics
title_sort evolution of the hadoop platform and ecosystem for high energy physics
topic Computing and Computers
url https://dx.doi.org/10.1051/epjconf/201921404058
http://cds.cern.ch/record/2700232
work_keys_str_mv AT baranowskizbigniew evolutionofthehadoopplatformandecosystemforhighenergyphysics
AT kleszczemil evolutionofthehadoopplatformandecosystemforhighenergyphysics
AT kothuriprasanth evolutionofthehadoopplatformandecosystemforhighenergyphysics
AT canaliluca evolutionofthehadoopplatformandecosystemforhighenergyphysics
AT castellottiriccardo evolutionofthehadoopplatformandecosystemforhighenergyphysics
AT martinmarquezmanuel evolutionofthehadoopplatformandecosystemforhighenergyphysics
AT matosdebarrosnunoguilherme evolutionofthehadoopplatformandecosystemforhighenergyphysics
AT motesnitsalisevangelos evolutionofthehadoopplatformandecosystemforhighenergyphysics
AT mrowczynskipiotr evolutionofthehadoopplatformandecosystemforhighenergyphysics
AT lunaduranjosecarlos evolutionofthehadoopplatformandecosystemforhighenergyphysics