Cargando…

Locality-Based Cache Management and Warp Scheduling for Reducing Cache Contention in GPU

GPGPUs has gradually become a mainstream acceleration component in high-performance computing. The long latency of memory operations is the bottleneck of GPU performance. In the GPU, multiple threads are divided into one warp for scheduling and execution. The L1 data caches have little capacity, whi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fang, Juan, Wei, Zelin, Yang, Huijing
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8537857/ https://www.ncbi.nlm.nih.gov/pubmed/34683312 http://dx.doi.org/10.3390/mi12101262

Internet

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8537857/
https://www.ncbi.nlm.nih.gov/pubmed/34683312
http://dx.doi.org/10.3390/mi12101262

Locality-Based Cache Management and Warp Scheduling for Reducing Cache Contention in GPU

Internet

Ejemplares similares