Cargando…
A high-performance portable abstract interface for explicit SIMD vectorization
This work establishes a scalable, easy to use and efficient approach for exploiting SIMD capabilities of modern CPUs, without the need for extensive knowledge of architecture specific instruction sets. We provide a description of a new API, known as UME::SIMD, which provides a flexible, portable, ty...
Autores principales: | Karpiński, P, McDonald, J |
---|---|
Lenguaje: | eng |
Publicado: |
2017
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1145/3026937.3026939 http://cds.cern.ch/record/2318246 |
Ejemplares similares
-
Implementation of Cholesky Decomposition with SIMD vectorization
por: Balasubramanian, Rahul
Publicado: (2016) -
SIMD Algorithms
por: Parkinson, D
Publicado: (1992) -
SIMD Processor Arrays
por: Parkinson, D
Publicado: (1992) -
Small SIMD matrices for CERN high throughput computing
por: Lemaitre, Florian, et al.
Publicado: (2018) -
Cholesky factorization on SIMD multi-core architectures
por: Lemaitre, Florian, et al.
Publicado: (2017)