Cargando…

Optimizing the Performance of the Sparse Matrix–Vector Multiplication Kernel in FPGA Guided by the Roofline Model

The widespread adoption of massively parallel processors over the past decade has fundamentally transformed the landscape of high-performance computing hardware. This revolution has recently driven the advancement of FPGAs, which are emerging as an attractive alternative to power-hungry many-core de...

Descripción completa

Detalles Bibliográficos
Autores principales:	Favaro, Federico, Dufrechou, Ernesto, Oliver, Juan P., Ezzatti, Pablo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673536/ https://www.ncbi.nlm.nih.gov/pubmed/38004887 http://dx.doi.org/10.3390/mi14112030

_version_	1785140645926535168
author	Favaro, Federico Dufrechou, Ernesto Oliver, Juan P. Ezzatti, Pablo
author_facet	Favaro, Federico Dufrechou, Ernesto Oliver, Juan P. Ezzatti, Pablo
author_sort	Favaro, Federico
collection	PubMed
description	The widespread adoption of massively parallel processors over the past decade has fundamentally transformed the landscape of high-performance computing hardware. This revolution has recently driven the advancement of FPGAs, which are emerging as an attractive alternative to power-hungry many-core devices in a world increasingly concerned with energy consumption. Consequently, numerous recent studies have focused on implementing efficient dense and sparse numerical linear algebra (NLA) kernels on FPGAs. To maximize the efficiency of these kernels, a key aspect is the exploration of analytical tools to comprehend the performance of the developments and guide the optimization process. In this regard, the roofline model (RLM) is a well-known graphical tool that facilitates the analysis of computational performance and identifies the primary bottlenecks of a specific software when executed on a particular hardware platform. Our previous efforts advanced in developing efficient implementations of the sparse matrix–vector multiplication (SpMV) for FPGAs, considering both speed and energy consumption. In this work, we propose an extension of the RLM that enables optimizing runtime and energy consumption for NLA kernels based on sparse blocked storage formats on FPGAs. To test the power of this tool, we use it to extend our previous SpMV kernels by leveraging a block-sparse storage format that enables more efficient data access.
format	Online Article Text
id	pubmed-10673536
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-106735362023-10-31 Optimizing the Performance of the Sparse Matrix–Vector Multiplication Kernel in FPGA Guided by the Roofline Model Favaro, Federico Dufrechou, Ernesto Oliver, Juan P. Ezzatti, Pablo Micromachines (Basel) Article The widespread adoption of massively parallel processors over the past decade has fundamentally transformed the landscape of high-performance computing hardware. This revolution has recently driven the advancement of FPGAs, which are emerging as an attractive alternative to power-hungry many-core devices in a world increasingly concerned with energy consumption. Consequently, numerous recent studies have focused on implementing efficient dense and sparse numerical linear algebra (NLA) kernels on FPGAs. To maximize the efficiency of these kernels, a key aspect is the exploration of analytical tools to comprehend the performance of the developments and guide the optimization process. In this regard, the roofline model (RLM) is a well-known graphical tool that facilitates the analysis of computational performance and identifies the primary bottlenecks of a specific software when executed on a particular hardware platform. Our previous efforts advanced in developing efficient implementations of the sparse matrix–vector multiplication (SpMV) for FPGAs, considering both speed and energy consumption. In this work, we propose an extension of the RLM that enables optimizing runtime and energy consumption for NLA kernels based on sparse blocked storage formats on FPGAs. To test the power of this tool, we use it to extend our previous SpMV kernels by leveraging a block-sparse storage format that enables more efficient data access. MDPI 2023-10-31 /pmc/articles/PMC10673536/ /pubmed/38004887 http://dx.doi.org/10.3390/mi14112030 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Favaro, Federico Dufrechou, Ernesto Oliver, Juan P. Ezzatti, Pablo Optimizing the Performance of the Sparse Matrix–Vector Multiplication Kernel in FPGA Guided by the Roofline Model
title	Optimizing the Performance of the Sparse Matrix–Vector Multiplication Kernel in FPGA Guided by the Roofline Model
title_full	Optimizing the Performance of the Sparse Matrix–Vector Multiplication Kernel in FPGA Guided by the Roofline Model
title_fullStr	Optimizing the Performance of the Sparse Matrix–Vector Multiplication Kernel in FPGA Guided by the Roofline Model
title_full_unstemmed	Optimizing the Performance of the Sparse Matrix–Vector Multiplication Kernel in FPGA Guided by the Roofline Model
title_short	Optimizing the Performance of the Sparse Matrix–Vector Multiplication Kernel in FPGA Guided by the Roofline Model
title_sort	optimizing the performance of the sparse matrix–vector multiplication kernel in fpga guided by the roofline model
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673536/ https://www.ncbi.nlm.nih.gov/pubmed/38004887 http://dx.doi.org/10.3390/mi14112030
work_keys_str_mv	AT favarofederico optimizingtheperformanceofthesparsematrixvectormultiplicationkernelinfpgaguidedbytherooflinemodel AT dufrechouernesto optimizingtheperformanceofthesparsematrixvectormultiplicationkernelinfpgaguidedbytherooflinemodel AT oliverjuanp optimizingtheperformanceofthesparsematrixvectormultiplicationkernelinfpgaguidedbytherooflinemodel AT ezzattipablo optimizingtheperformanceofthesparsematrixvectormultiplicationkernelinfpgaguidedbytherooflinemodel

Optimizing the Performance of the Sparse Matrix–Vector Multiplication Kernel in FPGA Guided by the Roofline Model

Ejemplares similares