Cargando…

Testing of complex, large-scale distributed storage systems: a CERN disk storage case study

Complex, large-scale distributed systems are frequently used to solve extraordinary computing, storage and other problems. However, the development of these systems usually requires working with several software components, maintaining and improving a large codebase and also providing a collaborativ...

Descripción completa

Detalles Bibliográficos
Autores principales: Makai, Jozsef, Peters, Andreas Joachim, Bitzes, Georgios, Sindrilaru, Elvin Alin, Simon, Michal Kamil, Manzi, Andrea
Lenguaje:eng
Publicado: 2019
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/201921405008
http://cds.cern.ch/record/2700237
_version_ 1780964603380891648
author Makai, Jozsef
Peters, Andreas Joachim
Bitzes, Georgios
Sindrilaru, Elvin Alin
Simon, Michal Kamil
Manzi, Andrea
author_facet Makai, Jozsef
Peters, Andreas Joachim
Bitzes, Georgios
Sindrilaru, Elvin Alin
Simon, Michal Kamil
Manzi, Andrea
author_sort Makai, Jozsef
collection CERN
description Complex, large-scale distributed systems are frequently used to solve extraordinary computing, storage and other problems. However, the development of these systems usually requires working with several software components, maintaining and improving a large codebase and also providing a collaborative environment for many developers working together. The central role that such complex systems play in mission critical tasks and also in the daily activity of the users means that any software bug affecting the availability of the service has far reaching effects. Providing an easily extensible testing framework is a pre-requisite for building both confidence in the system but also among developers who contribute to the code. The testing framework can address concrete bugs found in the codebase thus avoiding any future regressions and also provides a high degree of confidence for the people contributing new code. Easily incorporating other people's work into the project greatly helps scaling out manpower so that having more developers contributing to the project can actually result in more work being done rather then more bugs added. In this paper we go through the case study of EOS, the CERN disk storage system and introduce the methods and mechanisms of how to achieve all-automatic regression and robustness testing along with continuous integration for such a large-scale, complex and critical system using a container-based environment.
id oai-inspirehep.net-1761019
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2019
record_format invenio
spelling oai-inspirehep.net-17610192022-08-10T12:24:10Zdoi:10.1051/epjconf/201921405008http://cds.cern.ch/record/2700237engMakai, JozsefPeters, Andreas JoachimBitzes, GeorgiosSindrilaru, Elvin AlinSimon, Michal KamilManzi, AndreaTesting of complex, large-scale distributed storage systems: a CERN disk storage case studyComputing and ComputersComplex, large-scale distributed systems are frequently used to solve extraordinary computing, storage and other problems. However, the development of these systems usually requires working with several software components, maintaining and improving a large codebase and also providing a collaborative environment for many developers working together. The central role that such complex systems play in mission critical tasks and also in the daily activity of the users means that any software bug affecting the availability of the service has far reaching effects. Providing an easily extensible testing framework is a pre-requisite for building both confidence in the system but also among developers who contribute to the code. The testing framework can address concrete bugs found in the codebase thus avoiding any future regressions and also provides a high degree of confidence for the people contributing new code. Easily incorporating other people's work into the project greatly helps scaling out manpower so that having more developers contributing to the project can actually result in more work being done rather then more bugs added. In this paper we go through the case study of EOS, the CERN disk storage system and introduce the methods and mechanisms of how to achieve all-automatic regression and robustness testing along with continuous integration for such a large-scale, complex and critical system using a container-based environment.oai:inspirehep.net:17610192019
spellingShingle Computing and Computers
Makai, Jozsef
Peters, Andreas Joachim
Bitzes, Georgios
Sindrilaru, Elvin Alin
Simon, Michal Kamil
Manzi, Andrea
Testing of complex, large-scale distributed storage systems: a CERN disk storage case study
title Testing of complex, large-scale distributed storage systems: a CERN disk storage case study
title_full Testing of complex, large-scale distributed storage systems: a CERN disk storage case study
title_fullStr Testing of complex, large-scale distributed storage systems: a CERN disk storage case study
title_full_unstemmed Testing of complex, large-scale distributed storage systems: a CERN disk storage case study
title_short Testing of complex, large-scale distributed storage systems: a CERN disk storage case study
title_sort testing of complex, large-scale distributed storage systems: a cern disk storage case study
topic Computing and Computers
url https://dx.doi.org/10.1051/epjconf/201921405008
http://cds.cern.ch/record/2700237
work_keys_str_mv AT makaijozsef testingofcomplexlargescaledistributedstoragesystemsacerndiskstoragecasestudy
AT petersandreasjoachim testingofcomplexlargescaledistributedstoragesystemsacerndiskstoragecasestudy
AT bitzesgeorgios testingofcomplexlargescaledistributedstoragesystemsacerndiskstoragecasestudy
AT sindrilaruelvinalin testingofcomplexlargescaledistributedstoragesystemsacerndiskstoragecasestudy
AT simonmichalkamil testingofcomplexlargescaledistributedstoragesystemsacerndiskstoragecasestudy
AT manziandrea testingofcomplexlargescaledistributedstoragesystemsacerndiskstoragecasestudy