Cargando…
Accelerating adaptive inverse distance weighting interpolation algorithm on a graphics processing unit
This paper focuses on designing and implementing parallel adaptive inverse distance weighting (AIDW) interpolation algorithms by using the graphics processing unit (GPU). The AIDW is an improved version of the standard IDW, which can adaptively determine the power parameter according to the data poi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society Publishing
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5627094/ https://www.ncbi.nlm.nih.gov/pubmed/28989754 http://dx.doi.org/10.1098/rsos.170436 |
_version_ | 1783268655602073600 |
---|---|
author | Mei, Gang Xu, Liangliang Xu, Nengxiong |
author_facet | Mei, Gang Xu, Liangliang Xu, Nengxiong |
author_sort | Mei, Gang |
collection | PubMed |
description | This paper focuses on designing and implementing parallel adaptive inverse distance weighting (AIDW) interpolation algorithms by using the graphics processing unit (GPU). The AIDW is an improved version of the standard IDW, which can adaptively determine the power parameter according to the data points’ spatial distribution pattern and achieve more accurate predictions than those predicted by IDW. In this paper, we first present two versions of the GPU-accelerated AIDW, i.e. the naive version without profiting from the shared memory and the tiled version taking advantage of the shared memory. We also implement the naive version and the tiled version using two data layouts, structure of arrays and array of aligned structures, on both single and double precision. We then evaluate the performance of parallel AIDW by comparing it with its corresponding serial algorithm on three different machines equipped with the GPUs GT730M, M5000 and K40c. The experimental results indicate that: (i) there is no significant difference in the computational efficiency when different data layouts are employed; (ii) the tiled version is always slightly faster than the naive version; and (iii) on single precision the achieved speed-up can be up to 763 (on the GPU M5000), while on double precision the obtained highest speed-up is 197 (on the GPU K40c). To benefit the community, all source code and testing data related to the presented parallel AIDW algorithm are publicly available. |
format | Online Article Text |
id | pubmed-5627094 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | The Royal Society Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-56270942017-10-08 Accelerating adaptive inverse distance weighting interpolation algorithm on a graphics processing unit Mei, Gang Xu, Liangliang Xu, Nengxiong R Soc Open Sci Computer Science This paper focuses on designing and implementing parallel adaptive inverse distance weighting (AIDW) interpolation algorithms by using the graphics processing unit (GPU). The AIDW is an improved version of the standard IDW, which can adaptively determine the power parameter according to the data points’ spatial distribution pattern and achieve more accurate predictions than those predicted by IDW. In this paper, we first present two versions of the GPU-accelerated AIDW, i.e. the naive version without profiting from the shared memory and the tiled version taking advantage of the shared memory. We also implement the naive version and the tiled version using two data layouts, structure of arrays and array of aligned structures, on both single and double precision. We then evaluate the performance of parallel AIDW by comparing it with its corresponding serial algorithm on three different machines equipped with the GPUs GT730M, M5000 and K40c. The experimental results indicate that: (i) there is no significant difference in the computational efficiency when different data layouts are employed; (ii) the tiled version is always slightly faster than the naive version; and (iii) on single precision the achieved speed-up can be up to 763 (on the GPU M5000), while on double precision the obtained highest speed-up is 197 (on the GPU K40c). To benefit the community, all source code and testing data related to the presented parallel AIDW algorithm are publicly available. The Royal Society Publishing 2017-09-20 /pmc/articles/PMC5627094/ /pubmed/28989754 http://dx.doi.org/10.1098/rsos.170436 Text en © 2017 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Computer Science Mei, Gang Xu, Liangliang Xu, Nengxiong Accelerating adaptive inverse distance weighting interpolation algorithm on a graphics processing unit |
title | Accelerating adaptive inverse distance weighting interpolation algorithm on a graphics processing unit |
title_full | Accelerating adaptive inverse distance weighting interpolation algorithm on a graphics processing unit |
title_fullStr | Accelerating adaptive inverse distance weighting interpolation algorithm on a graphics processing unit |
title_full_unstemmed | Accelerating adaptive inverse distance weighting interpolation algorithm on a graphics processing unit |
title_short | Accelerating adaptive inverse distance weighting interpolation algorithm on a graphics processing unit |
title_sort | accelerating adaptive inverse distance weighting interpolation algorithm on a graphics processing unit |
topic | Computer Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5627094/ https://www.ncbi.nlm.nih.gov/pubmed/28989754 http://dx.doi.org/10.1098/rsos.170436 |
work_keys_str_mv | AT meigang acceleratingadaptiveinversedistanceweightinginterpolationalgorithmonagraphicsprocessingunit AT xuliangliang acceleratingadaptiveinversedistanceweightinginterpolationalgorithmonagraphicsprocessingunit AT xunengxiong acceleratingadaptiveinversedistanceweightinginterpolationalgorithmonagraphicsprocessingunit |