Cargando…

Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation

This paper focuses on evaluating the impact of different data layouts on the computational efficiency of GPU-accelerated Inverse Distance Weighting (IDW) interpolation algorithm. First we redesign and improve our previous GPU implementation that was performed by exploiting the feature of CUDA dynami...

Descripción completa

Detalles Bibliográficos
Autores principales: Mei, Gang, Tian, Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4735051/
https://www.ncbi.nlm.nih.gov/pubmed/26877902
http://dx.doi.org/10.1186/s40064-016-1731-6
_version_ 1782413006580219904
author Mei, Gang
Tian, Hong
author_facet Mei, Gang
Tian, Hong
author_sort Mei, Gang
collection PubMed
description This paper focuses on evaluating the impact of different data layouts on the computational efficiency of GPU-accelerated Inverse Distance Weighting (IDW) interpolation algorithm. First we redesign and improve our previous GPU implementation that was performed by exploiting the feature of CUDA dynamic parallelism (CDP). Then we implement three versions of GPU implementations, i.e., the naive version, the tiled version, and the improved CDP version, based upon five data layouts, including the Structure of Arrays (SoA), the Array of Structures (AoS), the Array of aligned Structures (AoaS), the Structure of Arrays of aligned Structures (SoAoS), and the Hybrid layout. We also carry out several groups of experimental tests to evaluate the impact. Experimental results show that: the layouts AoS and AoaS achieve better performance than the layout SoA for both the naive version and tiled version, while the layout SoA is the best choice for the improved CDP version. We also observe that: for the two combined data layouts (the SoAoS and the Hybrid), there are no notable performance gains when compared to other three basic layouts. We recommend that: in practical applications, the layout AoaS is the best choice since the tiled version is the fastest one among three versions. The source code of all implementations are publicly available. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40064-016-1731-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4735051
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-47350512016-02-12 Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation Mei, Gang Tian, Hong Springerplus Research This paper focuses on evaluating the impact of different data layouts on the computational efficiency of GPU-accelerated Inverse Distance Weighting (IDW) interpolation algorithm. First we redesign and improve our previous GPU implementation that was performed by exploiting the feature of CUDA dynamic parallelism (CDP). Then we implement three versions of GPU implementations, i.e., the naive version, the tiled version, and the improved CDP version, based upon five data layouts, including the Structure of Arrays (SoA), the Array of Structures (AoS), the Array of aligned Structures (AoaS), the Structure of Arrays of aligned Structures (SoAoS), and the Hybrid layout. We also carry out several groups of experimental tests to evaluate the impact. Experimental results show that: the layouts AoS and AoaS achieve better performance than the layout SoA for both the naive version and tiled version, while the layout SoA is the best choice for the improved CDP version. We also observe that: for the two combined data layouts (the SoAoS and the Hybrid), there are no notable performance gains when compared to other three basic layouts. We recommend that: in practical applications, the layout AoaS is the best choice since the tiled version is the fastest one among three versions. The source code of all implementations are publicly available. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40064-016-1731-6) contains supplementary material, which is available to authorized users. Springer International Publishing 2016-02-01 /pmc/articles/PMC4735051/ /pubmed/26877902 http://dx.doi.org/10.1186/s40064-016-1731-6 Text en © Mei and Tian. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Research
Mei, Gang
Tian, Hong
Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation
title Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation
title_full Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation
title_fullStr Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation
title_full_unstemmed Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation
title_short Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation
title_sort impact of data layouts on the efficiency of gpu-accelerated idw interpolation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4735051/
https://www.ncbi.nlm.nih.gov/pubmed/26877902
http://dx.doi.org/10.1186/s40064-016-1731-6
work_keys_str_mv AT meigang impactofdatalayoutsontheefficiencyofgpuacceleratedidwinterpolation
AT tianhong impactofdatalayoutsontheefficiencyofgpuacceleratedidwinterpolation