Cargando…

SWAN: a Service for Interactive Analysis in the Cloud

SWAN (Service for Web based ANalysis) is a platform to perform interactive data analysis in the cloud. SWAN allows users to write and run their data analyses with only a web browser, leveraging on the widely-adopted Jupyter notebook interface. The user code, executions and data live entirely in the...

Descripción completa

Detalles Bibliográficos
Autores principales: Piparo, Danilo, Tejedor, Enric, Mato, Pere, Mascetti, Luca, Moscicki, Jakub, Lamanna, Massimo
Lenguaje:eng
Publicado: 2016
Materias:
Acceso en línea:https://dx.doi.org/10.1016/j.future.2016.11.035
http://cds.cern.ch/record/2158559
_version_ 1780950785844051968
author Piparo, Danilo
Tejedor, Enric
Mato, Pere
Mascetti, Luca
Moscicki, Jakub
Lamanna, Massimo
author_facet Piparo, Danilo
Tejedor, Enric
Mato, Pere
Mascetti, Luca
Moscicki, Jakub
Lamanna, Massimo
author_sort Piparo, Danilo
collection CERN
description SWAN (Service for Web based ANalysis) is a platform to perform interactive data analysis in the cloud. SWAN allows users to write and run their data analyses with only a web browser, leveraging on the widely-adopted Jupyter notebook interface. The user code, executions and data live entirely in the cloud. SWAN makes it easier to produce and share results and scientific code, access scientific software, produce tutorials and demonstrations as well as preserve analyses. Furthermore, it is also a powerful tool for non-scientific data analytics. This paper describes how a pilot of the SWAN service was implemented and deployed at CERN. Its backend combines state-of-the-art software technologies with a set of existing IT services such as user authentication, virtual computing infrastructure, mass storage, file synchronisation and sharing, specialised clusters and batch systems. The added value of this combination of services is discussed, with special focus on the opportunities offered by the CERNBox service and its massive storage backend, EOS. In particular, it is described how a cloud-based analysis model benefits from synchronised storage and sharing capabilities.
id cern-2158559
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2016
record_format invenio
spelling cern-21585592019-09-30T06:29:59Zdoi:10.1016/j.future.2016.11.035http://cds.cern.ch/record/2158559engPiparo, DaniloTejedor, EnricMato, PereMascetti, LucaMoscicki, JakubLamanna, MassimoSWAN: a Service for Interactive Analysis in the CloudComputing and ComputersSWAN (Service for Web based ANalysis) is a platform to perform interactive data analysis in the cloud. SWAN allows users to write and run their data analyses with only a web browser, leveraging on the widely-adopted Jupyter notebook interface. The user code, executions and data live entirely in the cloud. SWAN makes it easier to produce and share results and scientific code, access scientific software, produce tutorials and demonstrations as well as preserve analyses. Furthermore, it is also a powerful tool for non-scientific data analytics. This paper describes how a pilot of the SWAN service was implemented and deployed at CERN. Its backend combines state-of-the-art software technologies with a set of existing IT services such as user authentication, virtual computing infrastructure, mass storage, file synchronisation and sharing, specialised clusters and batch systems. The added value of this combination of services is discussed, with special focus on the opportunities offered by the CERNBox service and its massive storage backend, EOS. In particular, it is described how a cloud-based analysis model benefits from synchronised storage and sharing capabilities.CERN-OPEN-2016-005oai:cds.cern.ch:21585592016-06-06
spellingShingle Computing and Computers
Piparo, Danilo
Tejedor, Enric
Mato, Pere
Mascetti, Luca
Moscicki, Jakub
Lamanna, Massimo
SWAN: a Service for Interactive Analysis in the Cloud
title SWAN: a Service for Interactive Analysis in the Cloud
title_full SWAN: a Service for Interactive Analysis in the Cloud
title_fullStr SWAN: a Service for Interactive Analysis in the Cloud
title_full_unstemmed SWAN: a Service for Interactive Analysis in the Cloud
title_short SWAN: a Service for Interactive Analysis in the Cloud
title_sort swan: a service for interactive analysis in the cloud
topic Computing and Computers
url https://dx.doi.org/10.1016/j.future.2016.11.035
http://cds.cern.ch/record/2158559
work_keys_str_mv AT piparodanilo swanaserviceforinteractiveanalysisinthecloud
AT tejedorenric swanaserviceforinteractiveanalysisinthecloud
AT matopere swanaserviceforinteractiveanalysisinthecloud
AT mascettiluca swanaserviceforinteractiveanalysisinthecloud
AT moscickijakub swanaserviceforinteractiveanalysisinthecloud
AT lamannamassimo swanaserviceforinteractiveanalysisinthecloud