Cargando…

Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors

Real time data processing is an important component of particle physics experiments with large computing resource requirements. As the Large Hadron Collider (LHC) at CERN is preparing for its next upgrade the LHCb experiment is upgrading its detector for a 30x increase in data throughput. In prepara...

Descripción completa

Detalles Bibliográficos
Autores principales: Fernández, Plácido, del Rio Astorga, David, Dolz, Manuel F, Fernández, Javier, Awile, Omar, García, J Daniel
Lenguaje:eng
Publicado: 2018
Materias:
Acceso en línea:https://dx.doi.org/10.1109/PDP2018.2018.00121
http://cds.cern.ch/record/2677502
_version_ 1780962825889382400
author Fernández, Plácido
del Rio Astorga, David
Dolz, Manuel F
Fernández, Javier
Awile, Omar
García, J Daniel
author_facet Fernández, Plácido
del Rio Astorga, David
Dolz, Manuel F
Fernández, Javier
Awile, Omar
García, J Daniel
author_sort Fernández, Plácido
collection CERN
description Real time data processing is an important component of particle physics experiments with large computing resource requirements. As the Large Hadron Collider (LHC) at CERN is preparing for its next upgrade the LHCb experiment is upgrading its detector for a 30x increase in data throughput. In preparation for this upgrade the experiment is considering a number of architectural improvements encompassing both its software and hardware infrastructure. One of the hardware platforms under consideration is the Intel Xeon-Phi Knights Landing processor. Thanks to its on-package high-bandwidth memory and many-core architecture it offers an interesting alternative to more traditional server systems. We present a scalable, multi-threaded and NUMA-aware Kalman filter proto-application for particle track fitting expressed in terms of generic parallel patterns using the GrPPI interface. We show how code maintainability and readability improves, while maintaining comparable levels of performance to the baseline implementation. This is achieved by keeping the parallel algorithms in the underlying framework generic, but topology aware through the use of the Portable Hardware Locality (hwloc) library, which allows us to target different architectures with the same program. We measure the performance of our topology-aware GrPPI Kalman filter implementation on the Intel Xeon-Phi Knights Landing platform and conclude on the feasibility of integrating such high-level parallelization libraries in complex software frameworks such as LHCb's Gaudi framework.
id oai-inspirehep.net-1689274
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2018
record_format invenio
spelling oai-inspirehep.net-16892742019-09-30T06:29:59Zdoi:10.1109/PDP2018.2018.00121http://cds.cern.ch/record/2677502engFernández, Plácidodel Rio Astorga, DavidDolz, Manuel FFernández, JavierAwile, OmarGarcía, J DanielParallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL ProcessorsComputing and ComputersReal time data processing is an important component of particle physics experiments with large computing resource requirements. As the Large Hadron Collider (LHC) at CERN is preparing for its next upgrade the LHCb experiment is upgrading its detector for a 30x increase in data throughput. In preparation for this upgrade the experiment is considering a number of architectural improvements encompassing both its software and hardware infrastructure. One of the hardware platforms under consideration is the Intel Xeon-Phi Knights Landing processor. Thanks to its on-package high-bandwidth memory and many-core architecture it offers an interesting alternative to more traditional server systems. We present a scalable, multi-threaded and NUMA-aware Kalman filter proto-application for particle track fitting expressed in terms of generic parallel patterns using the GrPPI interface. We show how code maintainability and readability improves, while maintaining comparable levels of performance to the baseline implementation. This is achieved by keeping the parallel algorithms in the underlying framework generic, but topology aware through the use of the Portable Hardware Locality (hwloc) library, which allows us to target different architectures with the same program. We measure the performance of our topology-aware GrPPI Kalman filter implementation on the Intel Xeon-Phi Knights Landing platform and conclude on the feasibility of integrating such high-level parallelization libraries in complex software frameworks such as LHCb's Gaudi framework.oai:inspirehep.net:16892742018
spellingShingle Computing and Computers
Fernández, Plácido
del Rio Astorga, David
Dolz, Manuel F
Fernández, Javier
Awile, Omar
García, J Daniel
Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
title Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
title_full Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
title_fullStr Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
title_full_unstemmed Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
title_short Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
title_sort parallelizing and optimizing lhcb-kalman for intel xeon phi knl processors
topic Computing and Computers
url https://dx.doi.org/10.1109/PDP2018.2018.00121
http://cds.cern.ch/record/2677502
work_keys_str_mv AT fernandezplacido parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors
AT delrioastorgadavid parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors
AT dolzmanuelf parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors
AT fernandezjavier parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors
AT awileomar parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors
AT garciajdaniel parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors