Cargando…

Fast parallel event reconstruction

<!--HTML--><p align="justify">On-line processing of large data volumes produced in modern HEP experiments requires using maximum capabilities of modern and future many-core CPU and GPU architectures.</p><p align="justify">One of such powerful feature is a...

Descripción completa

Detalles Bibliográficos
Autor principal: Kisel, Ivan
Lenguaje:eng
Publicado: 2010
Materias:
Acceso en línea:http://cds.cern.ch/record/1332180
_version_ 1780921759934971904
author Kisel, Ivan
author_facet Kisel, Ivan
author_sort Kisel, Ivan
collection CERN
description <!--HTML--><p align="justify">On-line processing of large data volumes produced in modern HEP experiments requires using maximum capabilities of modern and future many-core CPU and GPU architectures.</p><p align="justify">One of such powerful feature is a SIMD instruction set, which allows packing several data items in one register and to operate on all of them, thus achievingmore operations per clock cycle. Motivated by the idea of using the SIMD unit ofmodern processors, the KF based track fit has been adapted for parallelism, including memory optimization, numerical analysis, vectorization with inline operator overloading, and optimization using SDKs. The speed of the algorithm has been increased in 120000&nbsp;times with 0.1&nbsp;ms/track, running in parallel on 16&nbsp;SPEs of a Cell Blade computer.&nbsp;&nbsp;Running on a Nehalem CPU with 8&nbsp;cores it shows the processing speed of 52 ns/track using the Intel Threading Building Blocks.&nbsp;The same KF algorithm running on an Nvidia GTX&nbsp;280 in the CUDA frameworkprovides a plane throughput of 22&nbsp;tracks/ms.In addition, a many-core architecture code named Larrabee can be considered an interesting platform to further scale the Kalman filter in the threading and vectorization dimensions. Less architecture-dependent programming frameworks, such as OpenCL and Intel Ct,may also better support future changes in architecture. Thus, for example, the KF algorithm demonstrates a linear many-core scalability being implemented in the Intel Ct parallel language.</p><p align="justify">The fully SIMDized CA track finder of the future heavy-ion CBM experiment (FAIR/GSI) with the included SIMD KF track fit shows the full reconstruction efficiency of 92%. High energetic particles have the reconstruction efficiency of 97%. The efficiency of low energetic tracks is 82% due to significant multiple scattering in the detector material. The level of ghost tracks is only about 3%. The CA track finder demonstrates the maximum throughput of 150&nbsp;centralor 1100&nbsp;minimum bias events/s running on a Nehalem CPU with 8&nbsp;cores. The strong many-core scalability of the CA track finder makes possible to keep the reconstruction at the event-level parallelism.</p><p align="justify">More details on parallelism of the event reconstruction algorithms of the CBM, as well as ALICE and STAR experiments will be presented and discussed.</p><p align="justify">A short overview of the &quot;Workshop for Future Challenges in Tracking and Trigger Concepts&quot; (GSI, Germany, 07-11 June, 2010) will be also given.</p>
id cern-1332180
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2010
record_format invenio
spelling cern-13321802022-11-02T22:30:18Zhttp://cds.cern.ch/record/1332180engKisel, IvanFast parallel event reconstructionFast parallel event reconstructionComputing Seminar<!--HTML--><p align="justify">On-line processing of large data volumes produced in modern HEP experiments requires using maximum capabilities of modern and future many-core CPU and GPU architectures.</p><p align="justify">One of such powerful feature is a SIMD instruction set, which allows packing several data items in one register and to operate on all of them, thus achievingmore operations per clock cycle. Motivated by the idea of using the SIMD unit ofmodern processors, the KF based track fit has been adapted for parallelism, including memory optimization, numerical analysis, vectorization with inline operator overloading, and optimization using SDKs. The speed of the algorithm has been increased in 120000&nbsp;times with 0.1&nbsp;ms/track, running in parallel on 16&nbsp;SPEs of a Cell Blade computer.&nbsp;&nbsp;Running on a Nehalem CPU with 8&nbsp;cores it shows the processing speed of 52 ns/track using the Intel Threading Building Blocks.&nbsp;The same KF algorithm running on an Nvidia GTX&nbsp;280 in the CUDA frameworkprovides a plane throughput of 22&nbsp;tracks/ms.In addition, a many-core architecture code named Larrabee can be considered an interesting platform to further scale the Kalman filter in the threading and vectorization dimensions. Less architecture-dependent programming frameworks, such as OpenCL and Intel Ct,may also better support future changes in architecture. Thus, for example, the KF algorithm demonstrates a linear many-core scalability being implemented in the Intel Ct parallel language.</p><p align="justify">The fully SIMDized CA track finder of the future heavy-ion CBM experiment (FAIR/GSI) with the included SIMD KF track fit shows the full reconstruction efficiency of 92%. High energetic particles have the reconstruction efficiency of 97%. The efficiency of low energetic tracks is 82% due to significant multiple scattering in the detector material. The level of ghost tracks is only about 3%. The CA track finder demonstrates the maximum throughput of 150&nbsp;centralor 1100&nbsp;minimum bias events/s running on a Nehalem CPU with 8&nbsp;cores. The strong many-core scalability of the CA track finder makes possible to keep the reconstruction at the event-level parallelism.</p><p align="justify">More details on parallelism of the event reconstruction algorithms of the CBM, as well as ALICE and STAR experiments will be presented and discussed.</p><p align="justify">A short overview of the &quot;Workshop for Future Challenges in Tracking and Trigger Concepts&quot; (GSI, Germany, 07-11 June, 2010) will be also given.</p>oai:cds.cern.ch:13321802010
spellingShingle Computing Seminar
Kisel, Ivan
Fast parallel event reconstruction
title Fast parallel event reconstruction
title_full Fast parallel event reconstruction
title_fullStr Fast parallel event reconstruction
title_full_unstemmed Fast parallel event reconstruction
title_short Fast parallel event reconstruction
title_sort fast parallel event reconstruction
topic Computing Seminar
url http://cds.cern.ch/record/1332180
work_keys_str_mv AT kiselivan fastparalleleventreconstruction