Cargando…
Performant programming for GPUs
<!--HTML-->Programming for Heterogeneous Architectures - lecture 3 - Data locality, coalesced memory accesses, tiled data processing - GPU streams, pipelined memory transfers - Under the hood: branchless, warps, masked execution - Debugging and profiling a GPU application
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2773476 |
Sumario: | <!--HTML-->Programming for Heterogeneous Architectures - lecture 3
- Data locality, coalesced memory accesses, tiled data processing
- GPU streams, pipelined memory transfers
- Under the hood: branchless, warps, masked execution
- Debugging and profiling a GPU application |
---|