Cargando…

II - Template Metaprogramming for Massively Parallel Scientific Computing - Vectorization with Expression Templates

<!--HTML-->Large scale scientific computing raises questions on different levels ranging from the fomulation of the problems to the choice of the best algorithms and their implementation for a specific platform. There are similarities in these different topics that can be exploited by modern-s...

Descripción completa

Detalles Bibliográficos
Autor principal: VYSKOČIL, Jiří
Lenguaje:eng
Publicado: 2016
Materias:
Acceso en línea:http://cds.cern.ch/record/2135871
_version_ 1780949950393221120
author VYSKOČIL, Jiří
author_facet VYSKOČIL, Jiří
author_sort VYSKOČIL, Jiří
collection CERN
description <!--HTML-->Large scale scientific computing raises questions on different levels ranging from the fomulation of the problems to the choice of the best algorithms and their implementation for a specific platform. There are similarities in these different topics that can be exploited by modern-style C++ template metaprogramming techniques to produce readable, maintainable and generic code. Traditional low-level code tend to be fast but platform-dependent, and it obfuscates the meaning of the algorithm. On the other hand, object-oriented approach is nice to read, but may come with an inherent performance penalty. These lectures aim to present he basics of the Expression Template (ET) idiom which allows us to keep the object-oriented approach without sacrificing performance. We will in particular show to to enhance ET to include SIMD vectorization. We will then introduce techniques for abstracting iteration, and introduce thread-level parallelism for use in heavy data-centric loads. We will show to to apply these methods in a way which keeps the "front end" code very readable. --- LECTURE 2 In this lecture, we will have a closer look at the opportunities for implementing SIMD vectorisation through the Expression Template idiom. We will see how it can create a layer of separation between the algorithm, and the low-level implementation. We will use the C++ template mechanisms to accommodate our program so that the algorithm itself doesn't need to explicitly specify SIMD-related types alignment, or operations. We will also explore how our memory data structure layout affects SIMD performance in different workloads, and introduce methods which improve performance in specific cases.
id cern-2135871
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2016
record_format invenio
spelling cern-21358712022-11-02T22:32:26Zhttp://cds.cern.ch/record/2135871engVYSKOČIL, JiříII - Template Metaprogramming for Massively Parallel Scientific Computing - Vectorization with Expression Templatesinverted CERN School of Computing 2016inverted CSC<!--HTML-->Large scale scientific computing raises questions on different levels ranging from the fomulation of the problems to the choice of the best algorithms and their implementation for a specific platform. There are similarities in these different topics that can be exploited by modern-style C++ template metaprogramming techniques to produce readable, maintainable and generic code. Traditional low-level code tend to be fast but platform-dependent, and it obfuscates the meaning of the algorithm. On the other hand, object-oriented approach is nice to read, but may come with an inherent performance penalty. These lectures aim to present he basics of the Expression Template (ET) idiom which allows us to keep the object-oriented approach without sacrificing performance. We will in particular show to to enhance ET to include SIMD vectorization. We will then introduce techniques for abstracting iteration, and introduce thread-level parallelism for use in heavy data-centric loads. We will show to to apply these methods in a way which keeps the "front end" code very readable. --- LECTURE 2 In this lecture, we will have a closer look at the opportunities for implementing SIMD vectorisation through the Expression Template idiom. We will see how it can create a layer of separation between the algorithm, and the low-level implementation. We will use the C++ template mechanisms to accommodate our program so that the algorithm itself doesn't need to explicitly specify SIMD-related types alignment, or operations. We will also explore how our memory data structure layout affects SIMD performance in different workloads, and introduce methods which improve performance in specific cases.oai:cds.cern.ch:21358712016
spellingShingle inverted CSC
VYSKOČIL, Jiří
II - Template Metaprogramming for Massively Parallel Scientific Computing - Vectorization with Expression Templates
title II - Template Metaprogramming for Massively Parallel Scientific Computing - Vectorization with Expression Templates
title_full II - Template Metaprogramming for Massively Parallel Scientific Computing - Vectorization with Expression Templates
title_fullStr II - Template Metaprogramming for Massively Parallel Scientific Computing - Vectorization with Expression Templates
title_full_unstemmed II - Template Metaprogramming for Massively Parallel Scientific Computing - Vectorization with Expression Templates
title_short II - Template Metaprogramming for Massively Parallel Scientific Computing - Vectorization with Expression Templates
title_sort ii - template metaprogramming for massively parallel scientific computing - vectorization with expression templates
topic inverted CSC
url http://cds.cern.ch/record/2135871
work_keys_str_mv AT vyskociljiri iitemplatemetaprogrammingformassivelyparallelscientificcomputingvectorizationwithexpressiontemplates
AT vyskociljiri invertedcernschoolofcomputing2016