Cargando…

ATLAS Analytics and Machine Learning Platforms

In 2015 ATLAS Distributed Computing started to migrate its monitoring systems away from Oracle DB and decided to adopt new big data platforms that are open source, horizontally scalable, and offer the flexibility of NoSQL systems. Three years later, the full software stack is in place, the system is...

Descripción completa

Detalles Bibliográficos
Autores principales: Vukotic, Ilija, Barberis, Dario, Legger, Federica, Gardner, Robert
Lenguaje:eng
Publicado: 2018
Materias:
Acceso en línea:http://cds.cern.ch/record/2627020
_version_ 1780958935078928384
author Vukotic, Ilija
Barberis, Dario
Legger, Federica
Gardner, Robert
author_facet Vukotic, Ilija
Barberis, Dario
Legger, Federica
Gardner, Robert
author_sort Vukotic, Ilija
collection CERN
description In 2015 ATLAS Distributed Computing started to migrate its monitoring systems away from Oracle DB and decided to adopt new big data platforms that are open source, horizontally scalable, and offer the flexibility of NoSQL systems. Three years later, the full software stack is in place, the system is considered in production and operating at near maximum capacity (in terms of storage capacity and tightly coupled analysis capability). The new model provides several tools for fast and easy to deploy monitoring and accounting. The main advantages are: ample ways to do complex analytics studies (using technologies such as java, pig, spark, python, jupyter), flexibility in reorganization of data flows, near real time and inline processing. The analytics studies improve our understanding of different computing systems and their interplay, thus enabling whole-system debugging and optimization. In addition, the platform provides services to alarm or warn on anomalous conditions, and several services closing feedback loops with the Distributed Computing systems. Here we briefly describe the main system components and data flows, but will concentrate on both hardware and software tools we use for in depth analytics/simulations, support for machine learning algorithms, specifically artificial neural network training and reinforcement learning techniques. We describe several applications the platform enables, and discuss ways for further scale up.
id cern-2627020
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2018
record_format invenio
spelling cern-26270202019-09-30T06:29:59Zhttp://cds.cern.ch/record/2627020engVukotic, IlijaBarberis, DarioLegger, FedericaGardner, RobertATLAS Analytics and Machine Learning PlatformsParticle Physics - ExperimentIn 2015 ATLAS Distributed Computing started to migrate its monitoring systems away from Oracle DB and decided to adopt new big data platforms that are open source, horizontally scalable, and offer the flexibility of NoSQL systems. Three years later, the full software stack is in place, the system is considered in production and operating at near maximum capacity (in terms of storage capacity and tightly coupled analysis capability). The new model provides several tools for fast and easy to deploy monitoring and accounting. The main advantages are: ample ways to do complex analytics studies (using technologies such as java, pig, spark, python, jupyter), flexibility in reorganization of data flows, near real time and inline processing. The analytics studies improve our understanding of different computing systems and their interplay, thus enabling whole-system debugging and optimization. In addition, the platform provides services to alarm or warn on anomalous conditions, and several services closing feedback loops with the Distributed Computing systems. Here we briefly describe the main system components and data flows, but will concentrate on both hardware and software tools we use for in depth analytics/simulations, support for machine learning algorithms, specifically artificial neural network training and reinforcement learning techniques. We describe several applications the platform enables, and discuss ways for further scale up.ATL-SOFT-SLIDE-2018-417oai:cds.cern.ch:26270202018-06-28
spellingShingle Particle Physics - Experiment
Vukotic, Ilija
Barberis, Dario
Legger, Federica
Gardner, Robert
ATLAS Analytics and Machine Learning Platforms
title ATLAS Analytics and Machine Learning Platforms
title_full ATLAS Analytics and Machine Learning Platforms
title_fullStr ATLAS Analytics and Machine Learning Platforms
title_full_unstemmed ATLAS Analytics and Machine Learning Platforms
title_short ATLAS Analytics and Machine Learning Platforms
title_sort atlas analytics and machine learning platforms
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2627020
work_keys_str_mv AT vukoticilija atlasanalyticsandmachinelearningplatforms
AT barberisdario atlasanalyticsandmachinelearningplatforms
AT leggerfederica atlasanalyticsandmachinelearningplatforms
AT gardnerrobert atlasanalyticsandmachinelearningplatforms