Cargando…
Locality-Based Cache Management and Warp Scheduling for Reducing Cache Contention in GPU
GPGPUs has gradually become a mainstream acceleration component in high-performance computing. The long latency of memory operations is the bottleneck of GPU performance. In the GPU, multiple threads are divided into one warp for scheduling and execution. The L1 data caches have little capacity, whi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8537857/ https://www.ncbi.nlm.nih.gov/pubmed/34683312 http://dx.doi.org/10.3390/mi12101262 |