Cargando…

Data Mining as a Service (DMaaS)

Data Mining as a Service (DMaaS) is a software and computing infrastructure that allows interactive mining of scientific data in the cloud. It allows users to run advanced data analyses by leveraging the widely adopted Jupyter notebook interface. Furthermore, the system makes it easier to share resu...

Descripción completa

Detalles Bibliográficos
Autores principales: Tejedor, E, Piparo, D, Mascetti, L, Moscicki, J, Lamanna, M, Mato, P
Lenguaje:eng
Publicado: 2016
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/762/1/012039
http://cds.cern.ch/record/2265885
_version_ 1780954510501347328
author Tejedor, E
Piparo, D
Mascetti, L
Moscicki, J
Lamanna, M
Mato, P
author_facet Tejedor, E
Piparo, D
Mascetti, L
Moscicki, J
Lamanna, M
Mato, P
author_sort Tejedor, E
collection CERN
description Data Mining as a Service (DMaaS) is a software and computing infrastructure that allows interactive mining of scientific data in the cloud. It allows users to run advanced data analyses by leveraging the widely adopted Jupyter notebook interface. Furthermore, the system makes it easier to share results and scientific code, access scientific software, produce tutorials and demonstrations as well as preserve the analyses of scientists. This paper describes how a first pilot of the DMaaS service is being deployed at CERN, starting from the notebook interface that has been fully integrated with the ROOT analysis framework, in order to provide all the tools for scientists to run their analyses. Additionally, we characterise the service backend, which combines a set of IT services such as user authentication, virtual computing infrastructure, mass storage, file synchronisation, development portals or batch systems. The added value acquired by the combination of the aforementioned categories of services is discussed, focusing on the opportunities offered by the CERNBox synchronisation service and its massive storage backend, EOS.
id oai-inspirehep.net-1499983
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2016
record_format invenio
spelling oai-inspirehep.net-14999832019-10-15T15:17:47Zdoi:10.1088/1742-6596/762/1/012039http://cds.cern.ch/record/2265885engTejedor, EPiparo, DMascetti, LMoscicki, JLamanna, MMato, PData Mining as a Service (DMaaS)Computing and ComputersData Mining as a Service (DMaaS) is a software and computing infrastructure that allows interactive mining of scientific data in the cloud. It allows users to run advanced data analyses by leveraging the widely adopted Jupyter notebook interface. Furthermore, the system makes it easier to share results and scientific code, access scientific software, produce tutorials and demonstrations as well as preserve the analyses of scientists. This paper describes how a first pilot of the DMaaS service is being deployed at CERN, starting from the notebook interface that has been fully integrated with the ROOT analysis framework, in order to provide all the tools for scientists to run their analyses. Additionally, we characterise the service backend, which combines a set of IT services such as user authentication, virtual computing infrastructure, mass storage, file synchronisation, development portals or batch systems. The added value acquired by the combination of the aforementioned categories of services is discussed, focusing on the opportunities offered by the CERNBox synchronisation service and its massive storage backend, EOS.oai:inspirehep.net:14999832016
spellingShingle Computing and Computers
Tejedor, E
Piparo, D
Mascetti, L
Moscicki, J
Lamanna, M
Mato, P
Data Mining as a Service (DMaaS)
title Data Mining as a Service (DMaaS)
title_full Data Mining as a Service (DMaaS)
title_fullStr Data Mining as a Service (DMaaS)
title_full_unstemmed Data Mining as a Service (DMaaS)
title_short Data Mining as a Service (DMaaS)
title_sort data mining as a service (dmaas)
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/762/1/012039
http://cds.cern.ch/record/2265885
work_keys_str_mv AT tejedore dataminingasaservicedmaas
AT piparod dataminingasaservicedmaas
AT mascettil dataminingasaservicedmaas
AT moscickij dataminingasaservicedmaas
AT lamannam dataminingasaservicedmaas
AT matop dataminingasaservicedmaas