Cargando…
In situ and in-transit analysis of cosmological simulations
Modern cosmological simulations have reached the trillion-element scale, rendering data storage and subsequent analysis formidable tasks. To address this circumstance, we present a new MPI-parallel approach for analysis of simulation data while the simulation runs, as an alternative to the tradition...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6511997/ https://www.ncbi.nlm.nih.gov/pubmed/31149559 http://dx.doi.org/10.1186/s40668-016-0017-2 |
_version_ | 1783417627624865792 |
---|---|
author | Friesen, Brian Almgren, Ann Lukić, Zarija Weber, Gunther Morozov, Dmitriy Beckner, Vincent Day, Marcus |
author_facet | Friesen, Brian Almgren, Ann Lukić, Zarija Weber, Gunther Morozov, Dmitriy Beckner, Vincent Day, Marcus |
author_sort | Friesen, Brian |
collection | PubMed |
description | Modern cosmological simulations have reached the trillion-element scale, rendering data storage and subsequent analysis formidable tasks. To address this circumstance, we present a new MPI-parallel approach for analysis of simulation data while the simulation runs, as an alternative to the traditional workflow consisting of periodically saving large data sets to disk for subsequent ‘offline’ analysis. We demonstrate this approach in the compressible gasdynamics/N-body code Nyx, a hybrid [Formula: see text] code based on the BoxLib framework, used for large-scale cosmological simulations. We have enabled on-the-fly workflows in two different ways: one is a straightforward approach consisting of all MPI processes periodically halting the main simulation and analyzing each component of data that they own (‘in situ’). The other consists of partitioning processes into disjoint MPI groups, with one performing the simulation and periodically sending data to the other ‘sidecar’ group, which post-processes it while the simulation continues (‘in-transit’). The two groups execute their tasks asynchronously, stopping only to synchronize when a new set of simulation data needs to be analyzed. For both the in situ and in-transit approaches, we experiment with two different analysis suites with distinct performance behavior: one which finds dark matter halos in the simulation using merge trees to calculate the mass contained within iso-density contours, and another which calculates probability distribution functions and power spectra of various fields in the simulation. Both are common analysis tasks for cosmology, and both result in summary statistics significantly smaller than the original data set. We study the behavior of each type of analysis in each workflow in order to determine the optimal configuration for the different data analysis algorithms. |
format | Online Article Text |
id | pubmed-6511997 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-65119972019-05-28 In situ and in-transit analysis of cosmological simulations Friesen, Brian Almgren, Ann Lukić, Zarija Weber, Gunther Morozov, Dmitriy Beckner, Vincent Day, Marcus Comput Astrophys Cosmol Research Modern cosmological simulations have reached the trillion-element scale, rendering data storage and subsequent analysis formidable tasks. To address this circumstance, we present a new MPI-parallel approach for analysis of simulation data while the simulation runs, as an alternative to the traditional workflow consisting of periodically saving large data sets to disk for subsequent ‘offline’ analysis. We demonstrate this approach in the compressible gasdynamics/N-body code Nyx, a hybrid [Formula: see text] code based on the BoxLib framework, used for large-scale cosmological simulations. We have enabled on-the-fly workflows in two different ways: one is a straightforward approach consisting of all MPI processes periodically halting the main simulation and analyzing each component of data that they own (‘in situ’). The other consists of partitioning processes into disjoint MPI groups, with one performing the simulation and periodically sending data to the other ‘sidecar’ group, which post-processes it while the simulation continues (‘in-transit’). The two groups execute their tasks asynchronously, stopping only to synchronize when a new set of simulation data needs to be analyzed. For both the in situ and in-transit approaches, we experiment with two different analysis suites with distinct performance behavior: one which finds dark matter halos in the simulation using merge trees to calculate the mass contained within iso-density contours, and another which calculates probability distribution functions and power spectra of various fields in the simulation. Both are common analysis tasks for cosmology, and both result in summary statistics significantly smaller than the original data set. We study the behavior of each type of analysis in each workflow in order to determine the optimal configuration for the different data analysis algorithms. Springer International Publishing 2016-08-24 2016 /pmc/articles/PMC6511997/ /pubmed/31149559 http://dx.doi.org/10.1186/s40668-016-0017-2 Text en © Friesen et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Research Friesen, Brian Almgren, Ann Lukić, Zarija Weber, Gunther Morozov, Dmitriy Beckner, Vincent Day, Marcus In situ and in-transit analysis of cosmological simulations |
title | In situ and in-transit analysis of cosmological simulations |
title_full | In situ and in-transit analysis of cosmological simulations |
title_fullStr | In situ and in-transit analysis of cosmological simulations |
title_full_unstemmed | In situ and in-transit analysis of cosmological simulations |
title_short | In situ and in-transit analysis of cosmological simulations |
title_sort | in situ and in-transit analysis of cosmological simulations |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6511997/ https://www.ncbi.nlm.nih.gov/pubmed/31149559 http://dx.doi.org/10.1186/s40668-016-0017-2 |
work_keys_str_mv | AT friesenbrian insituandintransitanalysisofcosmologicalsimulations AT almgrenann insituandintransitanalysisofcosmologicalsimulations AT lukiczarija insituandintransitanalysisofcosmologicalsimulations AT webergunther insituandintransitanalysisofcosmologicalsimulations AT morozovdmitriy insituandintransitanalysisofcosmologicalsimulations AT becknervincent insituandintransitanalysisofcosmologicalsimulations AT daymarcus insituandintransitanalysisofcosmologicalsimulations |