Cargando…
Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization
We design and develop a new high performance implementation of a fast direct LU-based solver using low-rank approximations on massively parallel systems. The LU factorization is the most time-consuming step in solving systems of linear equations in the context of analyzing acoustic scattering from l...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7295352/ http://dx.doi.org/10.1007/978-3-030-50743-5_11 |
_version_ | 1783546634539368448 |
---|---|
author | Al-Harthi, Noha Alomairy, Rabab Akbudak, Kadir Chen, Rui Ltaief, Hatem Bagci, Hakan Keyes, David |
author_facet | Al-Harthi, Noha Alomairy, Rabab Akbudak, Kadir Chen, Rui Ltaief, Hatem Bagci, Hakan Keyes, David |
author_sort | Al-Harthi, Noha |
collection | PubMed |
description | We design and develop a new high performance implementation of a fast direct LU-based solver using low-rank approximations on massively parallel systems. The LU factorization is the most time-consuming step in solving systems of linear equations in the context of analyzing acoustic scattering from large 3D objects. The matrix equation is obtained by discretizing the boundary integral of the exterior Helmholtz problem using a higher-order Nyström scheme. The main idea is to exploit the inherent data sparsity of the matrix operator by performing local tile-centric approximations while still capturing the most significant information. In particular, the proposed LU-based solver leverages the Tile Low-Rank (TLR) data compression format as implemented in the Hierarchical Computations on Manycore Architectures (HiCMA) library to decrease the complexity of “classical” dense direct solvers from cubic to quadratic order. We taskify the underlying boundary integral kernels to expose fine-grained computations. We then employ the dynamic runtime system StarPU to orchestrate the scheduling of computational tasks on shared and distributed-memory systems. The resulting asynchronous execution permits to compensate for the load imbalance due to the heterogeneous ranks, while mitigating the overhead of data motion. We assess the robustness of our TLR LU-based solver and study the qualitative impact when using different numerical accuracies. The new TLR LU factorization outperforms the state-of-the-art dense factorizations by up to an order of magnitude on various parallel systems, for analysis of scattering from large-scale 3D synthetic and real geometries. |
format | Online Article Text |
id | pubmed-7295352 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72953522020-06-16 Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization Al-Harthi, Noha Alomairy, Rabab Akbudak, Kadir Chen, Rui Ltaief, Hatem Bagci, Hakan Keyes, David High Performance Computing Article We design and develop a new high performance implementation of a fast direct LU-based solver using low-rank approximations on massively parallel systems. The LU factorization is the most time-consuming step in solving systems of linear equations in the context of analyzing acoustic scattering from large 3D objects. The matrix equation is obtained by discretizing the boundary integral of the exterior Helmholtz problem using a higher-order Nyström scheme. The main idea is to exploit the inherent data sparsity of the matrix operator by performing local tile-centric approximations while still capturing the most significant information. In particular, the proposed LU-based solver leverages the Tile Low-Rank (TLR) data compression format as implemented in the Hierarchical Computations on Manycore Architectures (HiCMA) library to decrease the complexity of “classical” dense direct solvers from cubic to quadratic order. We taskify the underlying boundary integral kernels to expose fine-grained computations. We then employ the dynamic runtime system StarPU to orchestrate the scheduling of computational tasks on shared and distributed-memory systems. The resulting asynchronous execution permits to compensate for the load imbalance due to the heterogeneous ranks, while mitigating the overhead of data motion. We assess the robustness of our TLR LU-based solver and study the qualitative impact when using different numerical accuracies. The new TLR LU factorization outperforms the state-of-the-art dense factorizations by up to an order of magnitude on various parallel systems, for analysis of scattering from large-scale 3D synthetic and real geometries. 2020-05-22 /pmc/articles/PMC7295352/ http://dx.doi.org/10.1007/978-3-030-50743-5_11 Text en © The Author(s) 2020 Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. |
spellingShingle | Article Al-Harthi, Noha Alomairy, Rabab Akbudak, Kadir Chen, Rui Ltaief, Hatem Bagci, Hakan Keyes, David Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization |
title | Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization |
title_full | Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization |
title_fullStr | Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization |
title_full_unstemmed | Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization |
title_short | Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization |
title_sort | solving acoustic boundary integral equations using high performance tile low-rank lu factorization |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7295352/ http://dx.doi.org/10.1007/978-3-030-50743-5_11 |
work_keys_str_mv | AT alharthinoha solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization AT alomairyrabab solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization AT akbudakkadir solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization AT chenrui solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization AT ltaiefhatem solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization AT bagcihakan solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization AT keyesdavid solvingacousticboundaryintegralequationsusinghighperformancetilelowranklufactorization |