Cargando…

Performance-aware programming for intraoperative intensity-based image registration on graphics processing units

PURPOSE: Intensity-based image registration has been proven essential in many applications accredited to its unparalleled ability to resolve image misalignments. However, long registration time for image realignment prohibits its use in intra-operative navigation systems. There has been much work on...

Descripción completa

Detalles Bibliográficos
Autores principales: Leong, Martin C. W., Lee, Kit-Hang, Kwan, Bowen P. Y., Ng, Yui-Lun, Liu, Zhiyu, Navab, Nassir, Luk, Wayne, Kwok, Ka-Wai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946684/
https://www.ncbi.nlm.nih.gov/pubmed/33484431
http://dx.doi.org/10.1007/s11548-020-02303-y
_version_ 1783663101678190592
author Leong, Martin C. W.
Lee, Kit-Hang
Kwan, Bowen P. Y.
Ng, Yui-Lun
Liu, Zhiyu
Navab, Nassir
Luk, Wayne
Kwok, Ka-Wai
author_facet Leong, Martin C. W.
Lee, Kit-Hang
Kwan, Bowen P. Y.
Ng, Yui-Lun
Liu, Zhiyu
Navab, Nassir
Luk, Wayne
Kwok, Ka-Wai
author_sort Leong, Martin C. W.
collection PubMed
description PURPOSE: Intensity-based image registration has been proven essential in many applications accredited to its unparalleled ability to resolve image misalignments. However, long registration time for image realignment prohibits its use in intra-operative navigation systems. There has been much work on accelerating the registration process by improving the algorithm’s robustness, but the innate computation required by the registration algorithm has been unresolved. METHODS: Intensity-based registration methods involve operations with high arithmetic load and memory access demand, which supposes to be reduced by graphics processing units (GPUs). Although GPUs are widespread and affordable, there is a lack of open-source GPU implementations optimized for non-rigid image registration. This paper demonstrates performance-aware programming techniques, which involves systematic exploitation of GPU features, by implementing the diffeomorphic log-demons algorithm. RESULTS: By resolving the pinpointed computation bottlenecks on GPU, our implementation of diffeomorphic log-demons on Nvidia GTX Titan X GPU has achieved ~ 95 times speed-up compared to the CPU and registered a 1.3-M voxel image in 286 ms. Even for large 37-M voxel images, our implementation is able to register in 8.56 s, which attained ~ 258 times speed-up. Our solution involves effective employment of GPU computation units, memory, and data bandwidth to resolve computation bottlenecks. CONCLUSION: The computation bottlenecks in diffeomorphic log-demons are pinpointed, analyzed, and resolved using various GPU performance-aware programming techniques. The proposed fast computation on basic image operations not only enhances the computation of diffeomorphic log-demons, but is also potentially extended to speed up many other intensity-based approaches. Our implementation is open-source on GitHub at https://bit.ly/2PYZxQz.
format Online
Article
Text
id pubmed-7946684
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-79466842021-03-28 Performance-aware programming for intraoperative intensity-based image registration on graphics processing units Leong, Martin C. W. Lee, Kit-Hang Kwan, Bowen P. Y. Ng, Yui-Lun Liu, Zhiyu Navab, Nassir Luk, Wayne Kwok, Ka-Wai Int J Comput Assist Radiol Surg Original Article PURPOSE: Intensity-based image registration has been proven essential in many applications accredited to its unparalleled ability to resolve image misalignments. However, long registration time for image realignment prohibits its use in intra-operative navigation systems. There has been much work on accelerating the registration process by improving the algorithm’s robustness, but the innate computation required by the registration algorithm has been unresolved. METHODS: Intensity-based registration methods involve operations with high arithmetic load and memory access demand, which supposes to be reduced by graphics processing units (GPUs). Although GPUs are widespread and affordable, there is a lack of open-source GPU implementations optimized for non-rigid image registration. This paper demonstrates performance-aware programming techniques, which involves systematic exploitation of GPU features, by implementing the diffeomorphic log-demons algorithm. RESULTS: By resolving the pinpointed computation bottlenecks on GPU, our implementation of diffeomorphic log-demons on Nvidia GTX Titan X GPU has achieved ~ 95 times speed-up compared to the CPU and registered a 1.3-M voxel image in 286 ms. Even for large 37-M voxel images, our implementation is able to register in 8.56 s, which attained ~ 258 times speed-up. Our solution involves effective employment of GPU computation units, memory, and data bandwidth to resolve computation bottlenecks. CONCLUSION: The computation bottlenecks in diffeomorphic log-demons are pinpointed, analyzed, and resolved using various GPU performance-aware programming techniques. The proposed fast computation on basic image operations not only enhances the computation of diffeomorphic log-demons, but is also potentially extended to speed up many other intensity-based approaches. Our implementation is open-source on GitHub at https://bit.ly/2PYZxQz. Springer International Publishing 2021-01-23 2021 /pmc/articles/PMC7946684/ /pubmed/33484431 http://dx.doi.org/10.1007/s11548-020-02303-y Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Original Article
Leong, Martin C. W.
Lee, Kit-Hang
Kwan, Bowen P. Y.
Ng, Yui-Lun
Liu, Zhiyu
Navab, Nassir
Luk, Wayne
Kwok, Ka-Wai
Performance-aware programming for intraoperative intensity-based image registration on graphics processing units
title Performance-aware programming for intraoperative intensity-based image registration on graphics processing units
title_full Performance-aware programming for intraoperative intensity-based image registration on graphics processing units
title_fullStr Performance-aware programming for intraoperative intensity-based image registration on graphics processing units
title_full_unstemmed Performance-aware programming for intraoperative intensity-based image registration on graphics processing units
title_short Performance-aware programming for intraoperative intensity-based image registration on graphics processing units
title_sort performance-aware programming for intraoperative intensity-based image registration on graphics processing units
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946684/
https://www.ncbi.nlm.nih.gov/pubmed/33484431
http://dx.doi.org/10.1007/s11548-020-02303-y
work_keys_str_mv AT leongmartincw performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits
AT leekithang performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits
AT kwanbowenpy performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits
AT ngyuilun performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits
AT liuzhiyu performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits
AT navabnassir performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits
AT lukwayne performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits
AT kwokkawai performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits