Cargando…

Performant programming for GPUs

<!--HTML-->Programming for Heterogeneous Architectures - lecture 3 - Data locality, coalesced memory accesses, tiled data processing - GPU streams, pipelined memory transfers - Under the hood: branchless, warps, masked execution - Debugging and profiling a GPU application

Detalles Bibliográficos
Autor principal: Campora, Daniel
Lenguaje:eng
Publicado: 2021
Materias:
Acceso en línea:http://cds.cern.ch/record/2773476

Ejemplares similares