Cargando…

Managing the CERN Batch System with Kubernetes

The CERN Batch Service faces many challenges in order to get ready for the computing demands of future LHC runs. These challenges require that we look at all potential resources, assessing how efficiently we use them and that we explore different alternatives to exploit opportunistic resources in ou...

Descripción completa

Detalles Bibliográficos
Autores principales:	Alvarez, Luis Fernandez, Datskova, Olga, Jones, Ben, McCance, Gavin
Lenguaje:	eng
Publicado:	2020
Materias:	Computing and Computers
Acceso en línea:	https://dx.doi.org/10.1051/epjconf/202024507048 http://cds.cern.ch/record/2753507

_version_	1780969441300840448
author	Alvarez, Luis Fernandez Datskova, Olga Jones, Ben McCance, Gavin
author_facet	Alvarez, Luis Fernandez Datskova, Olga Jones, Ben McCance, Gavin
author_sort	Alvarez, Luis Fernandez
collection	CERN
description	The CERN Batch Service faces many challenges in order to get ready for the computing demands of future LHC runs. These challenges require that we look at all potential resources, assessing how efficiently we use them and that we explore different alternatives to exploit opportunistic resources in our infrastructure as well as outside of the CERN computing centre. Several projects, like BEER, Helix Nebula Science Cloud and the new OCRE project, have proven our ability to run batch workloads on a wide range of non-traditional resources. However, the challenge is not only to obtain the raw compute resources needed but how to define an operational model that is cost and time efficient, scalable and flexible enough to adapt to a heterogeneous infrastructure. In order to tackle both the provisioning and operational challenges it was decided to use Kubernetes. By using Kubernetes we benefit from a de-facto standard in containerised environments, available in nearly all cloud providers and surrounded by a vibrant ecosystem of open-source projects. Leveraging Kubernetes’ built-in functionality, and other open-source tools such as Helm, Terraform and GitLab CI, we have deployed a first cluster prototype which we discuss in detail. The effort has simplified many of the existing operational procedures we currently have, but has also made us rethink established procedures and assumptions that were only valid in a VM-based cloud environment. This contribution presents how we have adopted Kubernetes into the CERN Batch Service, the impact its adoption has in daily operations, a comparison on resource usage efficiency and the experience so far evolving our infrastructure towards this model.
id	oai-inspirehep.net-1832104
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2020
record_format	invenio
spelling	oai-inspirehep.net-18321042021-03-04T20:31:41Zdoi:10.1051/epjconf/202024507048http://cds.cern.ch/record/2753507engAlvarez, Luis FernandezDatskova, OlgaJones, BenMcCance, GavinManaging the CERN Batch System with KubernetesComputing and ComputersThe CERN Batch Service faces many challenges in order to get ready for the computing demands of future LHC runs. These challenges require that we look at all potential resources, assessing how efficiently we use them and that we explore different alternatives to exploit opportunistic resources in our infrastructure as well as outside of the CERN computing centre. Several projects, like BEER, Helix Nebula Science Cloud and the new OCRE project, have proven our ability to run batch workloads on a wide range of non-traditional resources. However, the challenge is not only to obtain the raw compute resources needed but how to define an operational model that is cost and time efficient, scalable and flexible enough to adapt to a heterogeneous infrastructure. In order to tackle both the provisioning and operational challenges it was decided to use Kubernetes. By using Kubernetes we benefit from a de-facto standard in containerised environments, available in nearly all cloud providers and surrounded by a vibrant ecosystem of open-source projects. Leveraging Kubernetes’ built-in functionality, and other open-source tools such as Helm, Terraform and GitLab CI, we have deployed a first cluster prototype which we discuss in detail. The effort has simplified many of the existing operational procedures we currently have, but has also made us rethink established procedures and assumptions that were only valid in a VM-based cloud environment. This contribution presents how we have adopted Kubernetes into the CERN Batch Service, the impact its adoption has in daily operations, a comparison on resource usage efficiency and the experience so far evolving our infrastructure towards this model.oai:inspirehep.net:18321042020
spellingShingle	Computing and Computers Alvarez, Luis Fernandez Datskova, Olga Jones, Ben McCance, Gavin Managing the CERN Batch System with Kubernetes
title	Managing the CERN Batch System with Kubernetes
title_full	Managing the CERN Batch System with Kubernetes
title_fullStr	Managing the CERN Batch System with Kubernetes
title_full_unstemmed	Managing the CERN Batch System with Kubernetes
title_short	Managing the CERN Batch System with Kubernetes
title_sort	managing the cern batch system with kubernetes
topic	Computing and Computers
url	https://dx.doi.org/10.1051/epjconf/202024507048 http://cds.cern.ch/record/2753507
work_keys_str_mv	AT alvarezluisfernandez managingthecernbatchsystemwithkubernetes AT datskovaolga managingthecernbatchsystemwithkubernetes AT jonesben managingthecernbatchsystemwithkubernetes AT mccancegavin managingthecernbatchsystemwithkubernetes

Managing the CERN Batch System with Kubernetes

Ejemplares similares