Cargando…
Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation
This paper focuses on evaluating the impact of different data layouts on the computational efficiency of GPU-accelerated Inverse Distance Weighting (IDW) interpolation algorithm. First we redesign and improve our previous GPU implementation that was performed by exploiting the feature of CUDA dynami...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4735051/ https://www.ncbi.nlm.nih.gov/pubmed/26877902 http://dx.doi.org/10.1186/s40064-016-1731-6 |
_version_ | 1782413006580219904 |
---|---|
author | Mei, Gang Tian, Hong |
author_facet | Mei, Gang Tian, Hong |
author_sort | Mei, Gang |
collection | PubMed |
description | This paper focuses on evaluating the impact of different data layouts on the computational efficiency of GPU-accelerated Inverse Distance Weighting (IDW) interpolation algorithm. First we redesign and improve our previous GPU implementation that was performed by exploiting the feature of CUDA dynamic parallelism (CDP). Then we implement three versions of GPU implementations, i.e., the naive version, the tiled version, and the improved CDP version, based upon five data layouts, including the Structure of Arrays (SoA), the Array of Structures (AoS), the Array of aligned Structures (AoaS), the Structure of Arrays of aligned Structures (SoAoS), and the Hybrid layout. We also carry out several groups of experimental tests to evaluate the impact. Experimental results show that: the layouts AoS and AoaS achieve better performance than the layout SoA for both the naive version and tiled version, while the layout SoA is the best choice for the improved CDP version. We also observe that: for the two combined data layouts (the SoAoS and the Hybrid), there are no notable performance gains when compared to other three basic layouts. We recommend that: in practical applications, the layout AoaS is the best choice since the tiled version is the fastest one among three versions. The source code of all implementations are publicly available. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40064-016-1731-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4735051 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-47350512016-02-12 Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation Mei, Gang Tian, Hong Springerplus Research This paper focuses on evaluating the impact of different data layouts on the computational efficiency of GPU-accelerated Inverse Distance Weighting (IDW) interpolation algorithm. First we redesign and improve our previous GPU implementation that was performed by exploiting the feature of CUDA dynamic parallelism (CDP). Then we implement three versions of GPU implementations, i.e., the naive version, the tiled version, and the improved CDP version, based upon five data layouts, including the Structure of Arrays (SoA), the Array of Structures (AoS), the Array of aligned Structures (AoaS), the Structure of Arrays of aligned Structures (SoAoS), and the Hybrid layout. We also carry out several groups of experimental tests to evaluate the impact. Experimental results show that: the layouts AoS and AoaS achieve better performance than the layout SoA for both the naive version and tiled version, while the layout SoA is the best choice for the improved CDP version. We also observe that: for the two combined data layouts (the SoAoS and the Hybrid), there are no notable performance gains when compared to other three basic layouts. We recommend that: in practical applications, the layout AoaS is the best choice since the tiled version is the fastest one among three versions. The source code of all implementations are publicly available. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40064-016-1731-6) contains supplementary material, which is available to authorized users. Springer International Publishing 2016-02-01 /pmc/articles/PMC4735051/ /pubmed/26877902 http://dx.doi.org/10.1186/s40064-016-1731-6 Text en © Mei and Tian. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Research Mei, Gang Tian, Hong Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation |
title | Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation |
title_full | Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation |
title_fullStr | Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation |
title_full_unstemmed | Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation |
title_short | Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation |
title_sort | impact of data layouts on the efficiency of gpu-accelerated idw interpolation |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4735051/ https://www.ncbi.nlm.nih.gov/pubmed/26877902 http://dx.doi.org/10.1186/s40064-016-1731-6 |
work_keys_str_mv | AT meigang impactofdatalayoutsontheefficiencyofgpuacceleratedidwinterpolation AT tianhong impactofdatalayoutsontheefficiencyofgpuacceleratedidwinterpolation |