Cargando…
BLAS3 optimization for the Godson-3B1500
This paper proposes a performance model for general matrix multiplication (GEMM) on decoupled access/execute (DAE) architecture platforms, in order to guide improvements of the GEMM performance in the Godson-3B1500. This model focuses on the features of access processors (APs) and execute processors...
Autores principales: | Zhang, Ming, Gu, Naijie, Ren, Kaixin |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5122567/ https://www.ncbi.nlm.nih.gov/pubmed/27933269 http://dx.doi.org/10.1186/s40064-016-3690-3 |
Ejemplares similares
-
Blas Galindo
por: Galindo, Blas, 1910-1993
Publicado: (1987) -
The sparse BLAS
por: Duff, I S, et al.
Publicado: (2001) -
LINPACK working note ; 3, Fortran BLAS timing
por: Dongarra, J J
Publicado: (1980) -
A blocked implementation of level 3 BLAS for RISC processors
por: Daydé, M J, et al.
Publicado: (1996) -
San Blas de Nayarit
Publicado: (1993)