Cargando…
Solving global shallow water equations on heterogeneous supercomputers
The scientific demand for more accurate modeling of the climate system calls for more computing power to support higher resolutions, inclusion of more component models, more complicated physics schemes, and larger ensembles. As the recent improvements in computing power mostly come from the increasi...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345762/ https://www.ncbi.nlm.nih.gov/pubmed/28282428 http://dx.doi.org/10.1371/journal.pone.0172583 |
_version_ | 1782513775922905088 |
---|---|
author | Fu, Haohuan Gan, Lin Yang, Chao Xue, Wei Wang, Lanning Wang, Xinliang Huang, Xiaomeng Yang, Guangwen |
author_facet | Fu, Haohuan Gan, Lin Yang, Chao Xue, Wei Wang, Lanning Wang, Xinliang Huang, Xiaomeng Yang, Guangwen |
author_sort | Fu, Haohuan |
collection | PubMed |
description | The scientific demand for more accurate modeling of the climate system calls for more computing power to support higher resolutions, inclusion of more component models, more complicated physics schemes, and larger ensembles. As the recent improvements in computing power mostly come from the increasing number of nodes in a system and the integration of heterogeneous accelerators, how to scale the computing problems onto more nodes and various kinds of accelerators has become a challenge for the model development. This paper describes our efforts on developing a highly scalable framework for performing global atmospheric modeling on heterogeneous supercomputers equipped with various accelerators, such as GPU (Graphic Processing Unit), MIC (Many Integrated Core), and FPGA (Field Programmable Gate Arrays) cards. We propose a generalized partition scheme of the problem domain, so as to keep a balanced utilization of both CPU resources and accelerator resources. With optimizations on both computing and memory access patterns, we manage to achieve around 8 to 20 times speedup when comparing one hybrid GPU or MIC node with one CPU node with 12 cores. Using a customized FPGA-based data-flow engines, we see the potential to gain another 5 to 8 times improvement on performance. On heterogeneous supercomputers, such as Tianhe-1A and Tianhe-2, our framework is capable of achieving ideally linear scaling efficiency, and sustained double-precision performances of 581 Tflops on Tianhe-1A (using 3750 nodes) and 3.74 Pflops on Tianhe-2 (using 8644 nodes). Our study also provides an evaluation on the programming paradigm of various accelerator architectures (GPU, MIC, FPGA) for performing global atmospheric simulation, to form a picture about both the potential performance benefits and the programming efforts involved. |
format | Online Article Text |
id | pubmed-5345762 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-53457622017-03-30 Solving global shallow water equations on heterogeneous supercomputers Fu, Haohuan Gan, Lin Yang, Chao Xue, Wei Wang, Lanning Wang, Xinliang Huang, Xiaomeng Yang, Guangwen PLoS One Research Article The scientific demand for more accurate modeling of the climate system calls for more computing power to support higher resolutions, inclusion of more component models, more complicated physics schemes, and larger ensembles. As the recent improvements in computing power mostly come from the increasing number of nodes in a system and the integration of heterogeneous accelerators, how to scale the computing problems onto more nodes and various kinds of accelerators has become a challenge for the model development. This paper describes our efforts on developing a highly scalable framework for performing global atmospheric modeling on heterogeneous supercomputers equipped with various accelerators, such as GPU (Graphic Processing Unit), MIC (Many Integrated Core), and FPGA (Field Programmable Gate Arrays) cards. We propose a generalized partition scheme of the problem domain, so as to keep a balanced utilization of both CPU resources and accelerator resources. With optimizations on both computing and memory access patterns, we manage to achieve around 8 to 20 times speedup when comparing one hybrid GPU or MIC node with one CPU node with 12 cores. Using a customized FPGA-based data-flow engines, we see the potential to gain another 5 to 8 times improvement on performance. On heterogeneous supercomputers, such as Tianhe-1A and Tianhe-2, our framework is capable of achieving ideally linear scaling efficiency, and sustained double-precision performances of 581 Tflops on Tianhe-1A (using 3750 nodes) and 3.74 Pflops on Tianhe-2 (using 8644 nodes). Our study also provides an evaluation on the programming paradigm of various accelerator architectures (GPU, MIC, FPGA) for performing global atmospheric simulation, to form a picture about both the potential performance benefits and the programming efforts involved. Public Library of Science 2017-03-10 /pmc/articles/PMC5345762/ /pubmed/28282428 http://dx.doi.org/10.1371/journal.pone.0172583 Text en © 2017 Fu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Fu, Haohuan Gan, Lin Yang, Chao Xue, Wei Wang, Lanning Wang, Xinliang Huang, Xiaomeng Yang, Guangwen Solving global shallow water equations on heterogeneous supercomputers |
title | Solving global shallow water equations on heterogeneous supercomputers |
title_full | Solving global shallow water equations on heterogeneous supercomputers |
title_fullStr | Solving global shallow water equations on heterogeneous supercomputers |
title_full_unstemmed | Solving global shallow water equations on heterogeneous supercomputers |
title_short | Solving global shallow water equations on heterogeneous supercomputers |
title_sort | solving global shallow water equations on heterogeneous supercomputers |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345762/ https://www.ncbi.nlm.nih.gov/pubmed/28282428 http://dx.doi.org/10.1371/journal.pone.0172583 |
work_keys_str_mv | AT fuhaohuan solvingglobalshallowwaterequationsonheterogeneoussupercomputers AT ganlin solvingglobalshallowwaterequationsonheterogeneoussupercomputers AT yangchao solvingglobalshallowwaterequationsonheterogeneoussupercomputers AT xuewei solvingglobalshallowwaterequationsonheterogeneoussupercomputers AT wanglanning solvingglobalshallowwaterequationsonheterogeneoussupercomputers AT wangxinliang solvingglobalshallowwaterequationsonheterogeneoussupercomputers AT huangxiaomeng solvingglobalshallowwaterequationsonheterogeneoussupercomputers AT yangguangwen solvingglobalshallowwaterequationsonheterogeneoussupercomputers |