Cargando…

Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors

Real time data processing is an important component of particle physics experiments with large computing resource requirements. As the Large Hadron Collider (LHC) at CERN is preparing for its next upgrade the LHCb experiment is upgrading its detector for a 30x increase in data throughput. In prepara...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fernández, Plácido, del Rio Astorga, David, Dolz, Manuel F, Fernández, Javier, Awile, Omar, García, J Daniel
Lenguaje:	eng
Publicado:	2018
Materias:	Computing and Computers
Acceso en línea:	https://dx.doi.org/10.1109/PDP2018.2018.00121 http://cds.cern.ch/record/2677502

_version_	1780962825889382400
author	Fernández, Plácido del Rio Astorga, David Dolz, Manuel F Fernández, Javier Awile, Omar García, J Daniel
author_facet	Fernández, Plácido del Rio Astorga, David Dolz, Manuel F Fernández, Javier Awile, Omar García, J Daniel
author_sort	Fernández, Plácido
collection	CERN
description	Real time data processing is an important component of particle physics experiments with large computing resource requirements. As the Large Hadron Collider (LHC) at CERN is preparing for its next upgrade the LHCb experiment is upgrading its detector for a 30x increase in data throughput. In preparation for this upgrade the experiment is considering a number of architectural improvements encompassing both its software and hardware infrastructure. One of the hardware platforms under consideration is the Intel Xeon-Phi Knights Landing processor. Thanks to its on-package high-bandwidth memory and many-core architecture it offers an interesting alternative to more traditional server systems. We present a scalable, multi-threaded and NUMA-aware Kalman filter proto-application for particle track fitting expressed in terms of generic parallel patterns using the GrPPI interface. We show how code maintainability and readability improves, while maintaining comparable levels of performance to the baseline implementation. This is achieved by keeping the parallel algorithms in the underlying framework generic, but topology aware through the use of the Portable Hardware Locality (hwloc) library, which allows us to target different architectures with the same program. We measure the performance of our topology-aware GrPPI Kalman filter implementation on the Intel Xeon-Phi Knights Landing platform and conclude on the feasibility of integrating such high-level parallelization libraries in complex software frameworks such as LHCb's Gaudi framework.
id	oai-inspirehep.net-1689274
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2018
record_format	invenio
spelling	oai-inspirehep.net-16892742019-09-30T06:29:59Zdoi:10.1109/PDP2018.2018.00121http://cds.cern.ch/record/2677502engFernández, Plácidodel Rio Astorga, DavidDolz, Manuel FFernández, JavierAwile, OmarGarcía, J DanielParallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL ProcessorsComputing and ComputersReal time data processing is an important component of particle physics experiments with large computing resource requirements. As the Large Hadron Collider (LHC) at CERN is preparing for its next upgrade the LHCb experiment is upgrading its detector for a 30x increase in data throughput. In preparation for this upgrade the experiment is considering a number of architectural improvements encompassing both its software and hardware infrastructure. One of the hardware platforms under consideration is the Intel Xeon-Phi Knights Landing processor. Thanks to its on-package high-bandwidth memory and many-core architecture it offers an interesting alternative to more traditional server systems. We present a scalable, multi-threaded and NUMA-aware Kalman filter proto-application for particle track fitting expressed in terms of generic parallel patterns using the GrPPI interface. We show how code maintainability and readability improves, while maintaining comparable levels of performance to the baseline implementation. This is achieved by keeping the parallel algorithms in the underlying framework generic, but topology aware through the use of the Portable Hardware Locality (hwloc) library, which allows us to target different architectures with the same program. We measure the performance of our topology-aware GrPPI Kalman filter implementation on the Intel Xeon-Phi Knights Landing platform and conclude on the feasibility of integrating such high-level parallelization libraries in complex software frameworks such as LHCb's Gaudi framework.oai:inspirehep.net:16892742018
spellingShingle	Computing and Computers Fernández, Plácido del Rio Astorga, David Dolz, Manuel F Fernández, Javier Awile, Omar García, J Daniel Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
title	Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
title_full	Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
title_fullStr	Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
title_full_unstemmed	Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
title_short	Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors
title_sort	parallelizing and optimizing lhcb-kalman for intel xeon phi knl processors
topic	Computing and Computers
url	https://dx.doi.org/10.1109/PDP2018.2018.00121 http://cds.cern.ch/record/2677502
work_keys_str_mv	AT fernandezplacido parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors AT delrioastorgadavid parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors AT dolzmanuelf parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors AT fernandezjavier parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors AT awileomar parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors AT garciajdaniel parallelizingandoptimizinglhcbkalmanforintelxeonphiknlprocessors

Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors

Ejemplares similares