Cargando…
Trace: a high-throughput tomographic reconstruction engine for large-scale datasets
BACKGROUND: Modern synchrotron light sources and detectors produce data at such scale and complexity that large-scale computation is required to unleash their full power. One of the widely used imaging techniques that generates data at tens of gigabytes per second is computed tomography (CT). Althou...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5313579/ https://www.ncbi.nlm.nih.gov/pubmed/28261544 http://dx.doi.org/10.1186/s40679-017-0040-7 |
_version_ | 1782508365313736704 |
---|---|
author | Bicer, Tekin Gürsoy, Doğa Andrade, Vincent De Kettimuthu, Rajkumar Scullin, William Carlo, Francesco De Foster, Ian T. |
author_facet | Bicer, Tekin Gürsoy, Doğa Andrade, Vincent De Kettimuthu, Rajkumar Scullin, William Carlo, Francesco De Foster, Ian T. |
author_sort | Bicer, Tekin |
collection | PubMed |
description | BACKGROUND: Modern synchrotron light sources and detectors produce data at such scale and complexity that large-scale computation is required to unleash their full power. One of the widely used imaging techniques that generates data at tens of gigabytes per second is computed tomography (CT). Although CT experiments result in rapid data generation, the analysis and reconstruction of the collected data may require hours or even days of computation time with a medium-sized workstation, which hinders the scientific progress that relies on the results of analysis. METHODS: We present Trace, a data-intensive computing engine that we have developed to enable high-performance implementation of iterative tomographic reconstruction algorithms for parallel computers. Trace provides fine-grained reconstruction of tomography datasets using both (thread-level) shared memory and (process-level) distributed memory parallelization. Trace utilizes a special data structure called replicated reconstruction object to maximize application performance. We also present the optimizations that we apply to the replicated reconstruction objects and evaluate them using tomography datasets collected at the Advanced Photon Source. RESULTS: Our experimental evaluations show that our optimizations and parallelization techniques can provide 158× speedup using 32 compute nodes (384 cores) over a single-core configuration and decrease the end-to-end processing time of a large sinogram (with 4501 × 1 × 22,400 dimensions) from 12.5 h to <5 min per iteration. CONCLUSION: The proposed tomographic reconstruction engine can efficiently process large-scale tomographic data using many compute nodes and minimize reconstruction times. |
format | Online Article Text |
id | pubmed-5313579 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-53135792017-03-01 Trace: a high-throughput tomographic reconstruction engine for large-scale datasets Bicer, Tekin Gürsoy, Doğa Andrade, Vincent De Kettimuthu, Rajkumar Scullin, William Carlo, Francesco De Foster, Ian T. Adv Struct Chem Imaging Research BACKGROUND: Modern synchrotron light sources and detectors produce data at such scale and complexity that large-scale computation is required to unleash their full power. One of the widely used imaging techniques that generates data at tens of gigabytes per second is computed tomography (CT). Although CT experiments result in rapid data generation, the analysis and reconstruction of the collected data may require hours or even days of computation time with a medium-sized workstation, which hinders the scientific progress that relies on the results of analysis. METHODS: We present Trace, a data-intensive computing engine that we have developed to enable high-performance implementation of iterative tomographic reconstruction algorithms for parallel computers. Trace provides fine-grained reconstruction of tomography datasets using both (thread-level) shared memory and (process-level) distributed memory parallelization. Trace utilizes a special data structure called replicated reconstruction object to maximize application performance. We also present the optimizations that we apply to the replicated reconstruction objects and evaluate them using tomography datasets collected at the Advanced Photon Source. RESULTS: Our experimental evaluations show that our optimizations and parallelization techniques can provide 158× speedup using 32 compute nodes (384 cores) over a single-core configuration and decrease the end-to-end processing time of a large sinogram (with 4501 × 1 × 22,400 dimensions) from 12.5 h to <5 min per iteration. CONCLUSION: The proposed tomographic reconstruction engine can efficiently process large-scale tomographic data using many compute nodes and minimize reconstruction times. Springer International Publishing 2017-01-28 2017 /pmc/articles/PMC5313579/ /pubmed/28261544 http://dx.doi.org/10.1186/s40679-017-0040-7 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Research Bicer, Tekin Gürsoy, Doğa Andrade, Vincent De Kettimuthu, Rajkumar Scullin, William Carlo, Francesco De Foster, Ian T. Trace: a high-throughput tomographic reconstruction engine for large-scale datasets |
title | Trace: a high-throughput tomographic reconstruction engine for large-scale datasets |
title_full | Trace: a high-throughput tomographic reconstruction engine for large-scale datasets |
title_fullStr | Trace: a high-throughput tomographic reconstruction engine for large-scale datasets |
title_full_unstemmed | Trace: a high-throughput tomographic reconstruction engine for large-scale datasets |
title_short | Trace: a high-throughput tomographic reconstruction engine for large-scale datasets |
title_sort | trace: a high-throughput tomographic reconstruction engine for large-scale datasets |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5313579/ https://www.ncbi.nlm.nih.gov/pubmed/28261544 http://dx.doi.org/10.1186/s40679-017-0040-7 |
work_keys_str_mv | AT bicertekin traceahighthroughputtomographicreconstructionengineforlargescaledatasets AT gursoydoga traceahighthroughputtomographicreconstructionengineforlargescaledatasets AT andradevincentde traceahighthroughputtomographicreconstructionengineforlargescaledatasets AT kettimuthurajkumar traceahighthroughputtomographicreconstructionengineforlargescaledatasets AT scullinwilliam traceahighthroughputtomographicreconstructionengineforlargescaledatasets AT carlofrancescode traceahighthroughputtomographicreconstructionengineforlargescaledatasets AT fosteriant traceahighthroughputtomographicreconstructionengineforlargescaledatasets |