Cargando…
A journey over the memory managment stack for HPC large applications on moderne architectures
<!--HTML-->Memory managment has always been an issue for large application but the increase of memory space and intra-node thread-based parallelism now put lot more pressure on this complex part of the operating system stack. Althrough there is a long tradition of algorithm developpements on t...
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2019
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2691972 |
_version_ | 1780963919988260864 |
---|---|
author | Valat, Sébastien |
author_facet | Valat, Sébastien |
author_sort | Valat, Sébastien |
collection | CERN |
description | <!--HTML-->Memory managment has always been an issue for large application but the increase of memory space and intra-node thread-based parallelism now put lot more pressure on this complex part of the operating system stack. Althrough there is a long tradition of algorithm developpements on this topic with behind 60 years of research there is still a lot to do.
This is even more true in large scale application where the size of the code (target was a million line C++/MPI app) and global complexity is a big limitation to apply what should theoritically be the clean way to proceed. We also today need to make global optimization to make the wall stack well interacting not letting a component breaking the performance gained by the top or bottom one.
After making a PhD. on memory management in HPC mostly arround a malloc implementation and various kernels memory managment studies for supercomputers and NUMA architectures I pursued as a post-doc developping a memory profiling tool: MALT. During my time at CERN I added to the list NUMAPROF a NUMA memory profiling tool.
I can over this talk recap the 9 years road I walked on with experience feedback showing sometimes impressive performance gaps on large real applications by considering the path from CPU caches, NUMA layout going through the OS paging system and malloc implementation closing by profiling real applications. I will try to glue the full picture showing the need to keep the global picture to really reach performance. |
id | cern-2691972 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2019 |
record_format | invenio |
spelling | cern-26919722022-11-02T22:24:40Zhttp://cds.cern.ch/record/2691972engValat, SébastienA journey over the memory managment stack for HPC large applications on moderne architecturesIXPUG 2019 Annual Conference at CERNother events or meetings<!--HTML-->Memory managment has always been an issue for large application but the increase of memory space and intra-node thread-based parallelism now put lot more pressure on this complex part of the operating system stack. Althrough there is a long tradition of algorithm developpements on this topic with behind 60 years of research there is still a lot to do. This is even more true in large scale application where the size of the code (target was a million line C++/MPI app) and global complexity is a big limitation to apply what should theoritically be the clean way to proceed. We also today need to make global optimization to make the wall stack well interacting not letting a component breaking the performance gained by the top or bottom one. After making a PhD. on memory management in HPC mostly arround a malloc implementation and various kernels memory managment studies for supercomputers and NUMA architectures I pursued as a post-doc developping a memory profiling tool: MALT. During my time at CERN I added to the list NUMAPROF a NUMA memory profiling tool. I can over this talk recap the 9 years road I walked on with experience feedback showing sometimes impressive performance gaps on large real applications by considering the path from CPU caches, NUMA layout going through the OS paging system and malloc implementation closing by profiling real applications. I will try to glue the full picture showing the need to keep the global picture to really reach performance.oai:cds.cern.ch:26919722019 |
spellingShingle | other events or meetings Valat, Sébastien A journey over the memory managment stack for HPC large applications on moderne architectures |
title | A journey over the memory managment stack for HPC large applications on moderne architectures |
title_full | A journey over the memory managment stack for HPC large applications on moderne architectures |
title_fullStr | A journey over the memory managment stack for HPC large applications on moderne architectures |
title_full_unstemmed | A journey over the memory managment stack for HPC large applications on moderne architectures |
title_short | A journey over the memory managment stack for HPC large applications on moderne architectures |
title_sort | journey over the memory managment stack for hpc large applications on moderne architectures |
topic | other events or meetings |
url | http://cds.cern.ch/record/2691972 |
work_keys_str_mv | AT valatsebastien ajourneyoverthememorymanagmentstackforhpclargeapplicationsonmodernearchitectures AT valatsebastien ixpug2019annualconferenceatcern AT valatsebastien journeyoverthememorymanagmentstackforhpclargeapplicationsonmodernearchitectures |