Cargando…
Performance-aware programming for intraoperative intensity-based image registration on graphics processing units
PURPOSE: Intensity-based image registration has been proven essential in many applications accredited to its unparalleled ability to resolve image misalignments. However, long registration time for image realignment prohibits its use in intra-operative navigation systems. There has been much work on...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946684/ https://www.ncbi.nlm.nih.gov/pubmed/33484431 http://dx.doi.org/10.1007/s11548-020-02303-y |
_version_ | 1783663101678190592 |
---|---|
author | Leong, Martin C. W. Lee, Kit-Hang Kwan, Bowen P. Y. Ng, Yui-Lun Liu, Zhiyu Navab, Nassir Luk, Wayne Kwok, Ka-Wai |
author_facet | Leong, Martin C. W. Lee, Kit-Hang Kwan, Bowen P. Y. Ng, Yui-Lun Liu, Zhiyu Navab, Nassir Luk, Wayne Kwok, Ka-Wai |
author_sort | Leong, Martin C. W. |
collection | PubMed |
description | PURPOSE: Intensity-based image registration has been proven essential in many applications accredited to its unparalleled ability to resolve image misalignments. However, long registration time for image realignment prohibits its use in intra-operative navigation systems. There has been much work on accelerating the registration process by improving the algorithm’s robustness, but the innate computation required by the registration algorithm has been unresolved. METHODS: Intensity-based registration methods involve operations with high arithmetic load and memory access demand, which supposes to be reduced by graphics processing units (GPUs). Although GPUs are widespread and affordable, there is a lack of open-source GPU implementations optimized for non-rigid image registration. This paper demonstrates performance-aware programming techniques, which involves systematic exploitation of GPU features, by implementing the diffeomorphic log-demons algorithm. RESULTS: By resolving the pinpointed computation bottlenecks on GPU, our implementation of diffeomorphic log-demons on Nvidia GTX Titan X GPU has achieved ~ 95 times speed-up compared to the CPU and registered a 1.3-M voxel image in 286 ms. Even for large 37-M voxel images, our implementation is able to register in 8.56 s, which attained ~ 258 times speed-up. Our solution involves effective employment of GPU computation units, memory, and data bandwidth to resolve computation bottlenecks. CONCLUSION: The computation bottlenecks in diffeomorphic log-demons are pinpointed, analyzed, and resolved using various GPU performance-aware programming techniques. The proposed fast computation on basic image operations not only enhances the computation of diffeomorphic log-demons, but is also potentially extended to speed up many other intensity-based approaches. Our implementation is open-source on GitHub at https://bit.ly/2PYZxQz. |
format | Online Article Text |
id | pubmed-7946684 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-79466842021-03-28 Performance-aware programming for intraoperative intensity-based image registration on graphics processing units Leong, Martin C. W. Lee, Kit-Hang Kwan, Bowen P. Y. Ng, Yui-Lun Liu, Zhiyu Navab, Nassir Luk, Wayne Kwok, Ka-Wai Int J Comput Assist Radiol Surg Original Article PURPOSE: Intensity-based image registration has been proven essential in many applications accredited to its unparalleled ability to resolve image misalignments. However, long registration time for image realignment prohibits its use in intra-operative navigation systems. There has been much work on accelerating the registration process by improving the algorithm’s robustness, but the innate computation required by the registration algorithm has been unresolved. METHODS: Intensity-based registration methods involve operations with high arithmetic load and memory access demand, which supposes to be reduced by graphics processing units (GPUs). Although GPUs are widespread and affordable, there is a lack of open-source GPU implementations optimized for non-rigid image registration. This paper demonstrates performance-aware programming techniques, which involves systematic exploitation of GPU features, by implementing the diffeomorphic log-demons algorithm. RESULTS: By resolving the pinpointed computation bottlenecks on GPU, our implementation of diffeomorphic log-demons on Nvidia GTX Titan X GPU has achieved ~ 95 times speed-up compared to the CPU and registered a 1.3-M voxel image in 286 ms. Even for large 37-M voxel images, our implementation is able to register in 8.56 s, which attained ~ 258 times speed-up. Our solution involves effective employment of GPU computation units, memory, and data bandwidth to resolve computation bottlenecks. CONCLUSION: The computation bottlenecks in diffeomorphic log-demons are pinpointed, analyzed, and resolved using various GPU performance-aware programming techniques. The proposed fast computation on basic image operations not only enhances the computation of diffeomorphic log-demons, but is also potentially extended to speed up many other intensity-based approaches. Our implementation is open-source on GitHub at https://bit.ly/2PYZxQz. Springer International Publishing 2021-01-23 2021 /pmc/articles/PMC7946684/ /pubmed/33484431 http://dx.doi.org/10.1007/s11548-020-02303-y Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Original Article Leong, Martin C. W. Lee, Kit-Hang Kwan, Bowen P. Y. Ng, Yui-Lun Liu, Zhiyu Navab, Nassir Luk, Wayne Kwok, Ka-Wai Performance-aware programming for intraoperative intensity-based image registration on graphics processing units |
title | Performance-aware programming for intraoperative intensity-based image registration on graphics processing units |
title_full | Performance-aware programming for intraoperative intensity-based image registration on graphics processing units |
title_fullStr | Performance-aware programming for intraoperative intensity-based image registration on graphics processing units |
title_full_unstemmed | Performance-aware programming for intraoperative intensity-based image registration on graphics processing units |
title_short | Performance-aware programming for intraoperative intensity-based image registration on graphics processing units |
title_sort | performance-aware programming for intraoperative intensity-based image registration on graphics processing units |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946684/ https://www.ncbi.nlm.nih.gov/pubmed/33484431 http://dx.doi.org/10.1007/s11548-020-02303-y |
work_keys_str_mv | AT leongmartincw performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits AT leekithang performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits AT kwanbowenpy performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits AT ngyuilun performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits AT liuzhiyu performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits AT navabnassir performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits AT lukwayne performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits AT kwokkawai performanceawareprogrammingforintraoperativeintensitybasedimageregistrationongraphicsprocessingunits |