Cargando…

Support for HTCondor high-throughput computing workflows in the REANA reusable analysis platform

REANA is a reusable and reproducible data analysis platform allowing researchers to structure their analysis pipelines and run them on remote containerised compute clouds. REANA supports several different workflows systems (CWL, Serial, Yadage) and uses Kubernetes’ job execution backend. We have des...

Descripción completa

Detalles Bibliográficos
Autores principales: Maciulaitis, Rokas, Brener, Paul, Hampton, Scott, Hildreth, Mike, Hurtado Anampa, Kenyi Paolo, Johnson, Irena, Kankel, Cody, Okraska, Jan, Rodriguez Rodriguez, Diego, Simko, Tibor
Lenguaje:eng
Publicado: 2019
Materias:
Acceso en línea:http://cds.cern.ch/record/2696223
Descripción
Sumario:REANA is a reusable and reproducible data analysis platform allowing researchers to structure their analysis pipelines and run them on remote containerised compute clouds. REANA supports several different workflows systems (CWL, Serial, Yadage) and uses Kubernetes’ job execution backend. We have designed an abstract job execution component that extends the REANA platform job execution capabilities to support multiple compute backends. We have tested the abstract job execution component with HTCondor and verified the scalability of the designed solution. The results show that the REANA platform would be able to support hybrid scientific workflows where different parts of the analysis pipelines can be executed on multiple computing backends.