Cargando…

Solving global shallow water equations on heterogeneous supercomputers

The scientific demand for more accurate modeling of the climate system calls for more computing power to support higher resolutions, inclusion of more component models, more complicated physics schemes, and larger ensembles. As the recent improvements in computing power mostly come from the increasi...

Descripción completa

Detalles Bibliográficos
Autores principales: Fu, Haohuan, Gan, Lin, Yang, Chao, Xue, Wei, Wang, Lanning, Wang, Xinliang, Huang, Xiaomeng, Yang, Guangwen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345762/
https://www.ncbi.nlm.nih.gov/pubmed/28282428
http://dx.doi.org/10.1371/journal.pone.0172583
_version_ 1782513775922905088
author Fu, Haohuan
Gan, Lin
Yang, Chao
Xue, Wei
Wang, Lanning
Wang, Xinliang
Huang, Xiaomeng
Yang, Guangwen
author_facet Fu, Haohuan
Gan, Lin
Yang, Chao
Xue, Wei
Wang, Lanning
Wang, Xinliang
Huang, Xiaomeng
Yang, Guangwen
author_sort Fu, Haohuan
collection PubMed
description The scientific demand for more accurate modeling of the climate system calls for more computing power to support higher resolutions, inclusion of more component models, more complicated physics schemes, and larger ensembles. As the recent improvements in computing power mostly come from the increasing number of nodes in a system and the integration of heterogeneous accelerators, how to scale the computing problems onto more nodes and various kinds of accelerators has become a challenge for the model development. This paper describes our efforts on developing a highly scalable framework for performing global atmospheric modeling on heterogeneous supercomputers equipped with various accelerators, such as GPU (Graphic Processing Unit), MIC (Many Integrated Core), and FPGA (Field Programmable Gate Arrays) cards. We propose a generalized partition scheme of the problem domain, so as to keep a balanced utilization of both CPU resources and accelerator resources. With optimizations on both computing and memory access patterns, we manage to achieve around 8 to 20 times speedup when comparing one hybrid GPU or MIC node with one CPU node with 12 cores. Using a customized FPGA-based data-flow engines, we see the potential to gain another 5 to 8 times improvement on performance. On heterogeneous supercomputers, such as Tianhe-1A and Tianhe-2, our framework is capable of achieving ideally linear scaling efficiency, and sustained double-precision performances of 581 Tflops on Tianhe-1A (using 3750 nodes) and 3.74 Pflops on Tianhe-2 (using 8644 nodes). Our study also provides an evaluation on the programming paradigm of various accelerator architectures (GPU, MIC, FPGA) for performing global atmospheric simulation, to form a picture about both the potential performance benefits and the programming efforts involved.
format Online
Article
Text
id pubmed-5345762
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53457622017-03-30 Solving global shallow water equations on heterogeneous supercomputers Fu, Haohuan Gan, Lin Yang, Chao Xue, Wei Wang, Lanning Wang, Xinliang Huang, Xiaomeng Yang, Guangwen PLoS One Research Article The scientific demand for more accurate modeling of the climate system calls for more computing power to support higher resolutions, inclusion of more component models, more complicated physics schemes, and larger ensembles. As the recent improvements in computing power mostly come from the increasing number of nodes in a system and the integration of heterogeneous accelerators, how to scale the computing problems onto more nodes and various kinds of accelerators has become a challenge for the model development. This paper describes our efforts on developing a highly scalable framework for performing global atmospheric modeling on heterogeneous supercomputers equipped with various accelerators, such as GPU (Graphic Processing Unit), MIC (Many Integrated Core), and FPGA (Field Programmable Gate Arrays) cards. We propose a generalized partition scheme of the problem domain, so as to keep a balanced utilization of both CPU resources and accelerator resources. With optimizations on both computing and memory access patterns, we manage to achieve around 8 to 20 times speedup when comparing one hybrid GPU or MIC node with one CPU node with 12 cores. Using a customized FPGA-based data-flow engines, we see the potential to gain another 5 to 8 times improvement on performance. On heterogeneous supercomputers, such as Tianhe-1A and Tianhe-2, our framework is capable of achieving ideally linear scaling efficiency, and sustained double-precision performances of 581 Tflops on Tianhe-1A (using 3750 nodes) and 3.74 Pflops on Tianhe-2 (using 8644 nodes). Our study also provides an evaluation on the programming paradigm of various accelerator architectures (GPU, MIC, FPGA) for performing global atmospheric simulation, to form a picture about both the potential performance benefits and the programming efforts involved. Public Library of Science 2017-03-10 /pmc/articles/PMC5345762/ /pubmed/28282428 http://dx.doi.org/10.1371/journal.pone.0172583 Text en © 2017 Fu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Fu, Haohuan
Gan, Lin
Yang, Chao
Xue, Wei
Wang, Lanning
Wang, Xinliang
Huang, Xiaomeng
Yang, Guangwen
Solving global shallow water equations on heterogeneous supercomputers
title Solving global shallow water equations on heterogeneous supercomputers
title_full Solving global shallow water equations on heterogeneous supercomputers
title_fullStr Solving global shallow water equations on heterogeneous supercomputers
title_full_unstemmed Solving global shallow water equations on heterogeneous supercomputers
title_short Solving global shallow water equations on heterogeneous supercomputers
title_sort solving global shallow water equations on heterogeneous supercomputers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345762/
https://www.ncbi.nlm.nih.gov/pubmed/28282428
http://dx.doi.org/10.1371/journal.pone.0172583
work_keys_str_mv AT fuhaohuan solvingglobalshallowwaterequationsonheterogeneoussupercomputers
AT ganlin solvingglobalshallowwaterequationsonheterogeneoussupercomputers
AT yangchao solvingglobalshallowwaterequationsonheterogeneoussupercomputers
AT xuewei solvingglobalshallowwaterequationsonheterogeneoussupercomputers
AT wanglanning solvingglobalshallowwaterequationsonheterogeneoussupercomputers
AT wangxinliang solvingglobalshallowwaterequationsonheterogeneoussupercomputers
AT huangxiaomeng solvingglobalshallowwaterequationsonheterogeneoussupercomputers
AT yangguangwen solvingglobalshallowwaterequationsonheterogeneoussupercomputers