Cargando…
Performant programming for GPUs
<!--HTML-->Programming for Heterogeneous Architectures - lecture 3 - Data locality, coalesced memory accesses, tiled data processing - GPU streams, pipelined memory transfers - Under the hood: branchless, warps, masked execution - Debugging and profiling a GPU application
Autor principal: | Campora, Daniel |
---|---|
Lenguaje: | eng |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2773476 |
Ejemplares similares
-
Programming for GPUs
por: vom Bruch, Dorothea
Publicado: (2021) -
Design patterns and best practices
por: Campora, Daniel
Publicado: (2021) -
Modern programming languages for HEP
por: Ponce, Sebastien
Publicado: (2021) -
Practical vectorization
por: Ponce, Sebastien
Publicado: (2021) -
Parallel and optimised scientific software - exercise introduction
por: Ponce, Sebastien
Publicado: (2021)