Cargando…
Data Mining as a Service (DMaaS)
Data Mining as a Service (DMaaS) is a software and computing infrastructure that allows interactive mining of scientific data in the cloud. It allows users to run advanced data analyses by leveraging the widely adopted Jupyter notebook interface. Furthermore, the system makes it easier to share resu...
Autores principales: | , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2016
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/762/1/012039 http://cds.cern.ch/record/2265885 |
_version_ | 1780954510501347328 |
---|---|
author | Tejedor, E Piparo, D Mascetti, L Moscicki, J Lamanna, M Mato, P |
author_facet | Tejedor, E Piparo, D Mascetti, L Moscicki, J Lamanna, M Mato, P |
author_sort | Tejedor, E |
collection | CERN |
description | Data Mining as a Service (DMaaS) is a software and computing infrastructure that allows interactive mining of scientific data in the cloud. It allows users to run advanced data analyses by leveraging the widely adopted Jupyter notebook interface. Furthermore, the system makes it easier to share results and scientific code, access scientific software, produce tutorials and demonstrations as well as preserve the analyses of scientists. This paper describes how a first pilot of the DMaaS service is being deployed at CERN, starting from the notebook interface that has been fully integrated with the ROOT analysis framework, in order to provide all the tools for scientists to run their analyses. Additionally, we characterise the service backend, which combines a set of IT services such as user authentication, virtual computing infrastructure, mass storage, file synchronisation, development portals or batch systems. The added value acquired by the combination of the aforementioned categories of services is discussed, focusing on the opportunities offered by the CERNBox synchronisation service and its massive storage backend, EOS. |
id | oai-inspirehep.net-1499983 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2016 |
record_format | invenio |
spelling | oai-inspirehep.net-14999832019-10-15T15:17:47Zdoi:10.1088/1742-6596/762/1/012039http://cds.cern.ch/record/2265885engTejedor, EPiparo, DMascetti, LMoscicki, JLamanna, MMato, PData Mining as a Service (DMaaS)Computing and ComputersData Mining as a Service (DMaaS) is a software and computing infrastructure that allows interactive mining of scientific data in the cloud. It allows users to run advanced data analyses by leveraging the widely adopted Jupyter notebook interface. Furthermore, the system makes it easier to share results and scientific code, access scientific software, produce tutorials and demonstrations as well as preserve the analyses of scientists. This paper describes how a first pilot of the DMaaS service is being deployed at CERN, starting from the notebook interface that has been fully integrated with the ROOT analysis framework, in order to provide all the tools for scientists to run their analyses. Additionally, we characterise the service backend, which combines a set of IT services such as user authentication, virtual computing infrastructure, mass storage, file synchronisation, development portals or batch systems. The added value acquired by the combination of the aforementioned categories of services is discussed, focusing on the opportunities offered by the CERNBox synchronisation service and its massive storage backend, EOS.oai:inspirehep.net:14999832016 |
spellingShingle | Computing and Computers Tejedor, E Piparo, D Mascetti, L Moscicki, J Lamanna, M Mato, P Data Mining as a Service (DMaaS) |
title | Data Mining as a Service (DMaaS) |
title_full | Data Mining as a Service (DMaaS) |
title_fullStr | Data Mining as a Service (DMaaS) |
title_full_unstemmed | Data Mining as a Service (DMaaS) |
title_short | Data Mining as a Service (DMaaS) |
title_sort | data mining as a service (dmaas) |
topic | Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/762/1/012039 http://cds.cern.ch/record/2265885 |
work_keys_str_mv | AT tejedore dataminingasaservicedmaas AT piparod dataminingasaservicedmaas AT mascettil dataminingasaservicedmaas AT moscickij dataminingasaservicedmaas AT lamannam dataminingasaservicedmaas AT matop dataminingasaservicedmaas |