Cargando…

Challenges and opportunities integrating LLAMA into AdePT

Particle transport simulations are a cornerstone of high-energy physics (HEP), constituting a substantial part of the computing workload performed in HEP. To boost the simulation throughput and energy efficiency, GPUs as accelerators have been explored in recent years, further driven by the increasi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Gruber, Bernhard Manfred, Amadio, Guilherme, Hageböck, Stephan
Lenguaje:	eng
Publicado:	2023
Materias:	physics.comp-ph Other Fields of Physics hep-ex Particle Physics - Experiment
Acceso en línea:	http://cds.cern.ch/record/2856531

_version_	1780977516577554432
author	Gruber, Bernhard Manfred Amadio, Guilherme Hageböck, Stephan
author_facet	Gruber, Bernhard Manfred Amadio, Guilherme Hageböck, Stephan
author_sort	Gruber, Bernhard Manfred
collection	CERN
description	Particle transport simulations are a cornerstone of high-energy physics (HEP), constituting a substantial part of the computing workload performed in HEP. To boost the simulation throughput and energy efficiency, GPUs as accelerators have been explored in recent years, further driven by the increasing use of GPUs on HPCs. The Accelerated demonstrator of electromagnetic Particle Transport (AdePT) is an advanced prototype for offloading the simulation of electromagnetic showers in Geant4 to GPUs, and still undergoes continuous development and optimization. Improving memory layout and data access is vital to use modern, massively parallel GPU hardware efficiently, contributing to the challenge of migrating traditional CPU based data structures to GPUs in AdePT. The low-level abstraction of memory access (LLAMA) is a C++ library that provides a zero-runtime-overhead data structure abstraction layer, focusing on multidimensional arrays of nested, structured data. It provides a framework for defining and switching custom memory mappings at compile time to define data layouts and instrument data access, making LLAMA an ideal tool to tackle the memory-related optimization challenges in AdePT. Our contribution shares insights gained with LLAMA when instrumenting data access inside AdePT, complementing traditional GPU profiler outputs. We demonstrate traces of read/write counts to data structure elements as well as memory heatmaps. The acquired knowledge allowed for subsequent data layout optimizations.
id	cern-2856531
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2023
record_format	invenio
spelling	cern-28565312023-06-29T04:23:28Zhttp://cds.cern.ch/record/2856531engGruber, Bernhard ManfredAmadio, GuilhermeHageböck, StephanChallenges and opportunities integrating LLAMA into AdePTphysics.comp-phOther Fields of Physicshep-exParticle Physics - ExperimentParticle transport simulations are a cornerstone of high-energy physics (HEP), constituting a substantial part of the computing workload performed in HEP. To boost the simulation throughput and energy efficiency, GPUs as accelerators have been explored in recent years, further driven by the increasing use of GPUs on HPCs. The Accelerated demonstrator of electromagnetic Particle Transport (AdePT) is an advanced prototype for offloading the simulation of electromagnetic showers in Geant4 to GPUs, and still undergoes continuous development and optimization. Improving memory layout and data access is vital to use modern, massively parallel GPU hardware efficiently, contributing to the challenge of migrating traditional CPU based data structures to GPUs in AdePT. The low-level abstraction of memory access (LLAMA) is a C++ library that provides a zero-runtime-overhead data structure abstraction layer, focusing on multidimensional arrays of nested, structured data. It provides a framework for defining and switching custom memory mappings at compile time to define data layouts and instrument data access, making LLAMA an ideal tool to tackle the memory-related optimization challenges in AdePT. Our contribution shares insights gained with LLAMA when instrumenting data access inside AdePT, complementing traditional GPU profiler outputs. We demonstrate traces of read/write counts to data structure elements as well as memory heatmaps. The acquired knowledge allowed for subsequent data layout optimizations.arXiv:2302.08252oai:cds.cern.ch:28565312023-02-16
spellingShingle	physics.comp-ph Other Fields of Physics hep-ex Particle Physics - Experiment Gruber, Bernhard Manfred Amadio, Guilherme Hageböck, Stephan Challenges and opportunities integrating LLAMA into AdePT
title	Challenges and opportunities integrating LLAMA into AdePT
title_full	Challenges and opportunities integrating LLAMA into AdePT
title_fullStr	Challenges and opportunities integrating LLAMA into AdePT
title_full_unstemmed	Challenges and opportunities integrating LLAMA into AdePT
title_short	Challenges and opportunities integrating LLAMA into AdePT
title_sort	challenges and opportunities integrating llama into adept
topic	physics.comp-ph Other Fields of Physics hep-ex Particle Physics - Experiment
url	http://cds.cern.ch/record/2856531
work_keys_str_mv	AT gruberbernhardmanfred challengesandopportunitiesintegratingllamaintoadept AT amadioguilherme challengesandopportunitiesintegratingllamaintoadept AT hagebockstephan challengesandopportunitiesintegratingllamaintoadept

Challenges and opportunities integrating LLAMA into AdePT

Ejemplares similares