Cargando…

Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization

We design and develop a new high performance implementation of a fast direct LU-based solver using low-rank approximations on massively parallel systems. The LU factorization is the most time-consuming step in solving systems of linear equations in the context of analyzing acoustic scattering from l...

Descripción completa

Detalles Bibliográficos
Autores principales: Al-Harthi, Noha, Alomairy, Rabab, Akbudak, Kadir, Chen, Rui, Ltaief, Hatem, Bagci, Hakan, Keyes, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7295352/
http://dx.doi.org/10.1007/978-3-030-50743-5_11
_version_ 1783546634539368448
author Al-Harthi, Noha
Alomairy, Rabab
Akbudak, Kadir
Chen, Rui
Ltaief, Hatem
Bagci, Hakan
Keyes, David
author_facet Al-Harthi, Noha
Alomairy, Rabab
Akbudak, Kadir
Chen, Rui
Ltaief, Hatem
Bagci, Hakan
Keyes, David
author_sort Al-Harthi, Noha
collection PubMed
description We design and develop a new high performance implementation of a fast direct LU-based solver using low-rank approximations on massively parallel systems. The LU factorization is the most time-consuming step in solving systems of linear equations in the context of analyzing acoustic scattering from large 3D objects. The matrix equation is obtained by discretizing the boundary integral of the exterior Helmholtz problem using a higher-order Nyström scheme. The main idea is to exploit the inherent data sparsity of the matrix operator by performing local tile-centric approximations while still capturing the most significant information. In particular, the proposed LU-based solver leverages the Tile Low-Rank (TLR) data compression format as implemented in the Hierarchical Computations on Manycore Architectures (HiCMA) library to decrease the complexity of “classical” dense direct solvers from cubic to quadratic order. We taskify the underlying boundary integral kernels to expose fine-grained computations. We then employ the dynamic runtime system StarPU to orchestrate the scheduling of computational tasks on shared and distributed-memory systems. The resulting asynchronous execution permits to compensate for the load imbalance due to the heterogeneous ranks, while mitigating the overhead of data motion. We assess the robustness of our TLR LU-based solver and study the qualitative impact when using different numerical accuracies. The new TLR LU factorization outperforms the state-of-the-art dense factorizations by up to an order of magnitude on various parallel systems, for analysis of scattering from large-scale 3D synthetic and real geometries.
format Online
Article
Text
id pubmed-7295352
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72953522020-06-16 Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization Al-Harthi, Noha Alomairy, Rabab Akbudak, Kadir Chen, Rui Ltaief, Hatem Bagci, Hakan Keyes, David High Performance Computing Article We design and develop a new high performance implementation of a fast direct LU-based solver using low-rank approximations on massively parallel systems. The LU factorization is the most time-consuming step in solving systems of linear equations in the context of analyzing acoustic scattering from large 3D objects. The matrix equation is obtained by discretizing the boundary integral of the exterior Helmholtz problem using a higher-order Nyström scheme. The main idea is to exploit the inherent data sparsity of the matrix operator by performing local tile-centric approximations while still capturing the most significant information. In particular, the proposed LU-based solver leverages the Tile Low-Rank (TLR) data compression format as implemented in the Hierarchical Computations on Manycore Architectures (HiCMA) library to decrease the complexity of “classical” dense direct solvers from cubic to quadratic order. We taskify the underlying boundary integral kernels to expose fine-grained computations. We then employ the dynamic runtime system StarPU to orchestrate the scheduling of computational tasks on shared and distributed-memory systems. The resulting asynchronous execution permits to compensate for the load imbalance due to the heterogeneous ranks, while mitigating the overhead of data motion. We assess the robustness of our TLR LU-based solver and study the qualitative impact when using different numerical accuracies. The new TLR LU factorization outperforms the state-of-the-art dense factorizations by up to an order of magnitude on various parallel systems, for analysis of scattering from large-scale 3D synthetic and real geometries. 2020-05-22 /pmc/articles/PMC7295352/ http://dx.doi.org/10.1007/978-3-030-50743-5_11 Text en © The Author(s) 2020 Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
spellingShingle Article
Al-Harthi, Noha
Alomairy, Rabab
Akbudak, Kadir
Chen, Rui
Ltaief, Hatem
Bagci, Hakan
Keyes, David
Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization
title Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization
title_full Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization
title_fullStr Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization
title_full_unstemmed Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization
title_short Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization
title_sort solving acoustic boundary integral equations using high performance tile low-rank lu factorization
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7295352/
http://dx.doi.org/10.1007/978-3-030-50743-5_11
work_keys_str_mv AT alharthinoha solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization
AT alomairyrabab solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization
AT akbudakkadir solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization
AT chenrui solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization
AT ltaiefhatem solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization
AT bagcihakan solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization
AT keyesdavid solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization