Cargando…

Performant programming for GPUs

Mostrar otras versiones (1)

Programming for Heterogeneous Architectures - lecture 3 - Data locality, coalesced memory accesses, tiled data processing - GPU streams, pipelined memory transfers - Under the hood: branchless, warps, masked execution - Debugging and profiling a GPU application

Detalles Bibliográficos
Autor principal:	Campora, Daniel
Lenguaje:	eng
Publicado:	2021
Materias:	Thematic CSC
Acceso en línea:	http://cds.cern.ch/record/2773476

Ejemplares similares

Programming for GPUs
por: vom Bruch, Dorothea
Publicado: (2021)

Design patterns and best practices
por: Campora, Daniel
Publicado: (2021)

Modern programming languages for HEP
por: Ponce, Sebastien
Publicado: (2021)

Practical vectorization
por: Ponce, Sebastien
Publicado: (2021)

Parallel and optimised scientific software - exercise introduction
por: Ponce, Sebastien
Publicado: (2021)

Scientific computing on heterogeneous architectures
por: vom Bruch, Dorothea
Publicado: (2021)

Introduction to CERN School of Computing
por: Lopienski, Sebastian
Publicado: (2021)

Preparing for the HL-LHC computational challenge
por: Piparo, Danilo
Publicado: (2021)

Writing parallel software
por: Piparo, Danilo
Publicado: (2021)

Optimizing existing large codebase
por: Ponce, Sebastien
Publicado: (2021)

Special evening talk: Future of the Universe and of Humanity
por: Puljak, Ivica
Publicado: (2021)

Matrix Element Regression with Deep Neural Networks -- breaking the CPU barrier
por: Bury, Florian
Publicado: (2021)

PyTorch C++ API
por: Brunner, David
Publicado: (2021)

FRED: a fast Monte Carlo code on GPU for Treatment Planning Software
por: De Simoni, Micol
Publicado: (2021)

Track reconstruction on heterogeneous architectures with SYCL
por: Sobol, Bartosz
Publicado: (2021)

Exploring Heterogeneous Architectures in Track Reconstruction Software
por: Mania, Georgiana
Publicado: (2021)

Parallel and optimised scientific software - exercise debriefing
por: Ponce, Sebastien
Publicado: (2021)

A practical approach to Convolutional Neural Networks (lecture)
por: Campora Perez, Daniel Hugo
Publicado: (2019)

Let your machine do the learning - Lecture 1
por: Campora Perez, Daniel Hugo
Publicado: (2017)

Let your machine do the learning - Lecture 2
por: Campora Perez, Daniel Hugo
Publicado: (2017)

inverted CERN School of Computing 2008
por: CERN. Geneva
Publicado: (2008)

Programming Paradigms (lecture)
por: Lieret, Kilian
Publicado: (2020)

Multiplatform Programming with Python
por: Kicsiny, Peter
Publicado: (2023)

From sequential to parallel programming with patterns
por: Fernandez Declara, Placido
Publicado: (2018)

Programming Paradigms and Software Design Patterns (exercise consultation)
por: Lieret, Kilian
Publicado: (2020)

CPU Performance Profiling on Linux in the HEP Context
por: Kabadzhov, Ivan
Publicado: (2023)

Modern C++ vs. its legacy: when stability is more important than performance (lecture)
por: Meinert, Nis
Publicado: (2020)

Applying natural evolution for solving computational problems - Lecture 1
por: Lanza Garcia, Daniel
Publicado: (2017)

Applying natural evolution for solving computational problems - Lecture 2
por: Lanza Garcia, Daniel
Publicado: (2017)

I - Event reconstruction in Modern Particle Physics
por: SAUNDERS, Daniel Martin
Publicado: (2016)

II - Event reconstruction in Modern Particle Physics
por: SAUNDERS, Daniel Martin
Publicado: (2016)

Use of GPUs in the LHCb trigger
por: Neufeld, Niko, et al.
Publicado: (2015)

Big Data Technologies and Physics Analysis with Apache Spark (lecture 2)
por: Motesnitsalis, Evangelos
Publicado: (2019)

Big Data Technologies and Physics Analysis with Apache Spark (lecture 1)
por: Motesnitsalis, Evangelos
Publicado: (2019)

Tensor Networks - Introduction and Matrix Product States (lecture 1)
por: Emonts, Patrick
Publicado: (2019)

Hardware Acceleration Through FPGAs - Basics of VHDL (lecture 2)
por: Lopez, Giorgio
Publicado: (2019)

Hardware Acceleration Through FPGAs - Basic Concepts (lecture 1)
por: Lopez, Giorgio
Publicado: (2019)

Introduction to the Inverted CSC
por: Lopienski, Sebastian
Publicado: (2019)

How container orchestration can strengthen your micro-services: the approach of Kubernetes (lecture)
por: Poggi, Riccardo
Publicado: (2019)

Efficient C++ implementation of custom FEM kernel with Eigen
por: Sizov, Mikhail
Publicado: (2019)

Cannot write session to /tmp/vufind_sessions/sess_3aokgntrbmh5cqq6g3q9sm3qgq