Cargando…
Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
Real time data processing is an important component of particle physics experiments with large computing resource requirements. As the Large Hadron Collider (LHC) at CERN is preparing for its next upgrade the LHCb experiment is upgrading its detector for a 30x increase in data throughput. In prepara...
Autores principales: | , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2018
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1109/PDP2018.2018.00121 http://cds.cern.ch/record/2677502 |
_version_ | 1780962825889382400 |
---|---|
author | Fernández, Plácido del Rio Astorga, David Dolz, Manuel F Fernández, Javier Awile, Omar García, J Daniel |
author_facet | Fernández, Plácido del Rio Astorga, David Dolz, Manuel F Fernández, Javier Awile, Omar García, J Daniel |
author_sort | Fernández, Plácido |
collection | CERN |
description | Real time data processing is an important component of particle physics experiments with large computing resource requirements. As the Large Hadron Collider (LHC) at CERN is preparing for its next upgrade the LHCb experiment is upgrading its detector for a 30x increase in data throughput. In preparation for this upgrade the experiment is considering a number of architectural improvements encompassing both its software and hardware infrastructure. One of the hardware platforms under consideration is the Intel Xeon-Phi Knights Landing processor. Thanks to its on-package high-bandwidth memory and many-core architecture it offers an interesting alternative to more traditional server systems. We present a scalable, multi-threaded and NUMA-aware Kalman filter proto-application for particle track fitting expressed in terms of generic parallel patterns using the GrPPI interface. We show how code maintainability and readability improves, while maintaining comparable levels of performance to the baseline implementation. This is achieved by keeping the parallel algorithms in the underlying framework generic, but topology aware through the use of the Portable Hardware Locality (hwloc) library, which allows us to target different architectures with the same program. We measure the performance of our topology-aware GrPPI Kalman filter implementation on the Intel Xeon-Phi Knights Landing platform and conclude on the feasibility of integrating such high-level parallelization libraries in complex software frameworks such as LHCb's Gaudi framework. |
id | oai-inspirehep.net-1689274 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2018 |
record_format | invenio |
spelling | oai-inspirehep.net-16892742019-09-30T06:29:59Zdoi:10.1109/PDP2018.2018.00121http://cds.cern.ch/record/2677502engFernández, Plácidodel Rio Astorga, DavidDolz, Manuel FFernández, JavierAwile, OmarGarcía, J DanielParallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL ProcessorsComputing and ComputersReal time data processing is an important component of particle physics experiments with large computing resource requirements. As the Large Hadron Collider (LHC) at CERN is preparing for its next upgrade the LHCb experiment is upgrading its detector for a 30x increase in data throughput. In preparation for this upgrade the experiment is considering a number of architectural improvements encompassing both its software and hardware infrastructure. One of the hardware platforms under consideration is the Intel Xeon-Phi Knights Landing processor. Thanks to its on-package high-bandwidth memory and many-core architecture it offers an interesting alternative to more traditional server systems. We present a scalable, multi-threaded and NUMA-aware Kalman filter proto-application for particle track fitting expressed in terms of generic parallel patterns using the GrPPI interface. We show how code maintainability and readability improves, while maintaining comparable levels of performance to the baseline implementation. This is achieved by keeping the parallel algorithms in the underlying framework generic, but topology aware through the use of the Portable Hardware Locality (hwloc) library, which allows us to target different architectures with the same program. We measure the performance of our topology-aware GrPPI Kalman filter implementation on the Intel Xeon-Phi Knights Landing platform and conclude on the feasibility of integrating such high-level parallelization libraries in complex software frameworks such as LHCb's Gaudi framework.oai:inspirehep.net:16892742018 |
spellingShingle | Computing and Computers Fernández, Plácido del Rio Astorga, David Dolz, Manuel F Fernández, Javier Awile, Omar García, J Daniel Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors |
title | Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors |
title_full | Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors |
title_fullStr | Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors |
title_full_unstemmed | Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors |
title_short | Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors |
title_sort | parallelizing and optimizing lhcb-kalman for intel xeon phi knl processors |
topic | Computing and Computers |
url | https://dx.doi.org/10.1109/PDP2018.2018.00121 http://cds.cern.ch/record/2677502 |
work_keys_str_mv | AT fernandezplacido parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors AT delrioastorgadavid parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors AT dolzmanuelf parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors AT fernandezjavier parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors AT awileomar parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors AT garciajdaniel parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors |