Cargando…

Evaluation and Implementation of Various Persistent Storage Options for CMSWEB Services in Kubernetes Infrastructure at CERN

This paper summarizes the various storage options that we implemented for the CMSWEB cluster in Kubernetes infrastructure. All CMSWEB services require storage for logs, while some services also require storage for data. We also provide a feasibility analysis of various storage options and describe t...

Descripción completa

Detalles Bibliográficos
Autores principales: Imran, Muhammad, Kuznetsov, Valentin, Paparrigopoulos, Panos, Trigazis, Spyridon, Pfeiffer, Andreas
Lenguaje:eng
Publicado: 2022
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/2438/1/012035
http://cds.cern.ch/record/2802587
_version_ 1780972752737402880
author Imran, Muhammad
Kuznetsov, Valentin
Paparrigopoulos, Panos
Trigazis, Spyridon
Pfeiffer, Andreas
author_facet Imran, Muhammad
Kuznetsov, Valentin
Paparrigopoulos, Panos
Trigazis, Spyridon
Pfeiffer, Andreas
author_sort Imran, Muhammad
collection CERN
description This paper summarizes the various storage options that we implemented for the CMSWEB cluster in Kubernetes infrastructure. All CMSWEB services require storage for logs, while some services also require storage for data. We also provide a feasibility analysis of various storage options and describe the pros/cons of each technique from the perspective of the CMSWEB cluster and its users. In the end, we also propose recommendations according to the service needs. The first option is the CephFS which can be mounted multiple times across various clusters and VMs and works very well with k8s. We use it both for data and the logs. The second option is the Cinder volume. It is the block storage that runs the filesystem on top of it. It can only be attached to one instance at a time. We use this option only for the data. The third option is S3 storage. It is object storage that offers a scalable storage service that can be used by applications compatible with the Amazon S3 protocol. It is used for the logs. For S3, we explored two mechanisms. For the first scenario, we consider fluentd that runs as a sidecar container in the service pods and sends logs to S3 bucket. For the second scenario, we considered filebeat that runs as a sidecar container in the service pod and scaps those logs to fluentd which runs as a daemonset in each node and sends those logs to S3 in the end. The fourth option is EOS. We configured EOS inside the pods of the CMSWEB services. The fifth option that we explored is to use dedicated VMs that have Ceph volume attached to them. In EOS and VM, the logs from the service pods are sent to EOS/VM using the rsync approach. The last option is to send service logs to Elasticsearch. It has been implemented using fluentd that runs as a daemonset in each node. In parallel to the sending logs to S3 fluentd also sends those logs to the Elasticsearch infrastructure at CERN.
id cern-2802587
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2022
record_format invenio
spelling cern-28025872023-08-23T07:29:02Zdoi:10.1088/1742-6596/2438/1/012035http://cds.cern.ch/record/2802587engImran, MuhammadKuznetsov, ValentinPaparrigopoulos, PanosTrigazis, SpyridonPfeiffer, AndreasEvaluation and Implementation of Various Persistent Storage Options for CMSWEB Services in Kubernetes Infrastructure at CERNDetectors and Experimental TechniquesThis paper summarizes the various storage options that we implemented for the CMSWEB cluster in Kubernetes infrastructure. All CMSWEB services require storage for logs, while some services also require storage for data. We also provide a feasibility analysis of various storage options and describe the pros/cons of each technique from the perspective of the CMSWEB cluster and its users. In the end, we also propose recommendations according to the service needs. The first option is the CephFS which can be mounted multiple times across various clusters and VMs and works very well with k8s. We use it both for data and the logs. The second option is the Cinder volume. It is the block storage that runs the filesystem on top of it. It can only be attached to one instance at a time. We use this option only for the data. The third option is S3 storage. It is object storage that offers a scalable storage service that can be used by applications compatible with the Amazon S3 protocol. It is used for the logs. For S3, we explored two mechanisms. For the first scenario, we consider fluentd that runs as a sidecar container in the service pods and sends logs to S3 bucket. For the second scenario, we considered filebeat that runs as a sidecar container in the service pod and scaps those logs to fluentd which runs as a daemonset in each node and sends those logs to S3 in the end. The fourth option is EOS. We configured EOS inside the pods of the CMSWEB services. The fifth option that we explored is to use dedicated VMs that have Ceph volume attached to them. In EOS and VM, the logs from the service pods are sent to EOS/VM using the rsync approach. The last option is to send service logs to Elasticsearch. It has been implemented using fluentd that runs as a daemonset in each node. In parallel to the sending logs to S3 fluentd also sends those logs to the Elasticsearch infrastructure at CERN.CMS-CR-2022-019oai:cds.cern.ch:28025872022-01-25
spellingShingle Detectors and Experimental Techniques
Imran, Muhammad
Kuznetsov, Valentin
Paparrigopoulos, Panos
Trigazis, Spyridon
Pfeiffer, Andreas
Evaluation and Implementation of Various Persistent Storage Options for CMSWEB Services in Kubernetes Infrastructure at CERN
title Evaluation and Implementation of Various Persistent Storage Options for CMSWEB Services in Kubernetes Infrastructure at CERN
title_full Evaluation and Implementation of Various Persistent Storage Options for CMSWEB Services in Kubernetes Infrastructure at CERN
title_fullStr Evaluation and Implementation of Various Persistent Storage Options for CMSWEB Services in Kubernetes Infrastructure at CERN
title_full_unstemmed Evaluation and Implementation of Various Persistent Storage Options for CMSWEB Services in Kubernetes Infrastructure at CERN
title_short Evaluation and Implementation of Various Persistent Storage Options for CMSWEB Services in Kubernetes Infrastructure at CERN
title_sort evaluation and implementation of various persistent storage options for cmsweb services in kubernetes infrastructure at cern
topic Detectors and Experimental Techniques
url https://dx.doi.org/10.1088/1742-6596/2438/1/012035
http://cds.cern.ch/record/2802587
work_keys_str_mv AT imranmuhammad evaluationandimplementationofvariouspersistentstorageoptionsforcmswebservicesinkubernetesinfrastructureatcern
AT kuznetsovvalentin evaluationandimplementationofvariouspersistentstorageoptionsforcmswebservicesinkubernetesinfrastructureatcern
AT paparrigopoulospanos evaluationandimplementationofvariouspersistentstorageoptionsforcmswebservicesinkubernetesinfrastructureatcern
AT trigazisspyridon evaluationandimplementationofvariouspersistentstorageoptionsforcmswebservicesinkubernetesinfrastructureatcern
AT pfeifferandreas evaluationandimplementationofvariouspersistentstorageoptionsforcmswebservicesinkubernetesinfrastructureatcern