Cargando…

Locality-Based Cache Management and Warp Scheduling for Reducing Cache Contention in GPU

GPGPUs has gradually become a mainstream acceleration component in high-performance computing. The long latency of memory operations is the bottleneck of GPU performance. In the GPU, multiple threads are divided into one warp for scheduling and execution. The L1 data caches have little capacity, whi...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Juan, Wei, Zelin, Yang, Huijing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8537857/
https://www.ncbi.nlm.nih.gov/pubmed/34683312
http://dx.doi.org/10.3390/mi12101262