Cargando…

Trace: a high-throughput tomographic reconstruction engine for large-scale datasets

BACKGROUND: Modern synchrotron light sources and detectors produce data at such scale and complexity that large-scale computation is required to unleash their full power. One of the widely used imaging techniques that generates data at tens of gigabytes per second is computed tomography (CT). Althou...

Descripción completa

Detalles Bibliográficos
Autores principales: Bicer, Tekin, Gürsoy, Doğa, Andrade, Vincent De, Kettimuthu, Rajkumar, Scullin, William, Carlo, Francesco De, Foster, Ian T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5313579/
https://www.ncbi.nlm.nih.gov/pubmed/28261544
http://dx.doi.org/10.1186/s40679-017-0040-7
_version_ 1782508365313736704
author Bicer, Tekin
Gürsoy, Doğa
Andrade, Vincent De
Kettimuthu, Rajkumar
Scullin, William
Carlo, Francesco De
Foster, Ian T.
author_facet Bicer, Tekin
Gürsoy, Doğa
Andrade, Vincent De
Kettimuthu, Rajkumar
Scullin, William
Carlo, Francesco De
Foster, Ian T.
author_sort Bicer, Tekin
collection PubMed
description BACKGROUND: Modern synchrotron light sources and detectors produce data at such scale and complexity that large-scale computation is required to unleash their full power. One of the widely used imaging techniques that generates data at tens of gigabytes per second is computed tomography (CT). Although CT experiments result in rapid data generation, the analysis and reconstruction of the collected data may require hours or even days of computation time with a medium-sized workstation, which hinders the scientific progress that relies on the results of analysis. METHODS: We present Trace, a data-intensive computing engine that we have developed to enable high-performance implementation of iterative tomographic reconstruction algorithms for parallel computers. Trace provides fine-grained reconstruction of tomography datasets using both (thread-level) shared memory and (process-level) distributed memory parallelization. Trace utilizes a special data structure called replicated reconstruction object to maximize application performance. We also present the optimizations that we apply to the replicated reconstruction objects and evaluate them using tomography datasets collected at the Advanced Photon Source. RESULTS: Our experimental evaluations show that our optimizations and parallelization techniques can provide 158× speedup using 32 compute nodes (384 cores) over a single-core configuration and decrease the end-to-end processing time of a large sinogram (with 4501 × 1 × 22,400 dimensions) from 12.5 h to <5 min per iteration. CONCLUSION: The proposed tomographic reconstruction engine can efficiently process large-scale tomographic data using many compute nodes and minimize reconstruction times.
format Online
Article
Text
id pubmed-5313579
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-53135792017-03-01 Trace: a high-throughput tomographic reconstruction engine for large-scale datasets Bicer, Tekin Gürsoy, Doğa Andrade, Vincent De Kettimuthu, Rajkumar Scullin, William Carlo, Francesco De Foster, Ian T. Adv Struct Chem Imaging Research BACKGROUND: Modern synchrotron light sources and detectors produce data at such scale and complexity that large-scale computation is required to unleash their full power. One of the widely used imaging techniques that generates data at tens of gigabytes per second is computed tomography (CT). Although CT experiments result in rapid data generation, the analysis and reconstruction of the collected data may require hours or even days of computation time with a medium-sized workstation, which hinders the scientific progress that relies on the results of analysis. METHODS: We present Trace, a data-intensive computing engine that we have developed to enable high-performance implementation of iterative tomographic reconstruction algorithms for parallel computers. Trace provides fine-grained reconstruction of tomography datasets using both (thread-level) shared memory and (process-level) distributed memory parallelization. Trace utilizes a special data structure called replicated reconstruction object to maximize application performance. We also present the optimizations that we apply to the replicated reconstruction objects and evaluate them using tomography datasets collected at the Advanced Photon Source. RESULTS: Our experimental evaluations show that our optimizations and parallelization techniques can provide 158× speedup using 32 compute nodes (384 cores) over a single-core configuration and decrease the end-to-end processing time of a large sinogram (with 4501 × 1 × 22,400 dimensions) from 12.5 h to <5 min per iteration. CONCLUSION: The proposed tomographic reconstruction engine can efficiently process large-scale tomographic data using many compute nodes and minimize reconstruction times. Springer International Publishing 2017-01-28 2017 /pmc/articles/PMC5313579/ /pubmed/28261544 http://dx.doi.org/10.1186/s40679-017-0040-7 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Research
Bicer, Tekin
Gürsoy, Doğa
Andrade, Vincent De
Kettimuthu, Rajkumar
Scullin, William
Carlo, Francesco De
Foster, Ian T.
Trace: a high-throughput tomographic reconstruction engine for large-scale datasets
title Trace: a high-throughput tomographic reconstruction engine for large-scale datasets
title_full Trace: a high-throughput tomographic reconstruction engine for large-scale datasets
title_fullStr Trace: a high-throughput tomographic reconstruction engine for large-scale datasets
title_full_unstemmed Trace: a high-throughput tomographic reconstruction engine for large-scale datasets
title_short Trace: a high-throughput tomographic reconstruction engine for large-scale datasets
title_sort trace: a high-throughput tomographic reconstruction engine for large-scale datasets
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5313579/
https://www.ncbi.nlm.nih.gov/pubmed/28261544
http://dx.doi.org/10.1186/s40679-017-0040-7
work_keys_str_mv AT bicertekin traceahighthroughputtomographicreconstructionengineforlargescaledatasets
AT gursoydoga traceahighthroughputtomographicreconstructionengineforlargescaledatasets
AT andradevincentde traceahighthroughputtomographicreconstructionengineforlargescaledatasets
AT kettimuthurajkumar traceahighthroughputtomographicreconstructionengineforlargescaledatasets
AT scullinwilliam traceahighthroughputtomographicreconstructionengineforlargescaledatasets
AT carlofrancescode traceahighthroughputtomographicreconstructionengineforlargescaledatasets
AT fosteriant traceahighthroughputtomographicreconstructionengineforlargescaledatasets