Cargando…

In situ and in-transit analysis of cosmological simulations

Modern cosmological simulations have reached the trillion-element scale, rendering data storage and subsequent analysis formidable tasks. To address this circumstance, we present a new MPI-parallel approach for analysis of simulation data while the simulation runs, as an alternative to the tradition...

Descripción completa

Detalles Bibliográficos
Autores principales: Friesen, Brian, Almgren, Ann, Lukić, Zarija, Weber, Gunther, Morozov, Dmitriy, Beckner, Vincent, Day, Marcus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6511997/
https://www.ncbi.nlm.nih.gov/pubmed/31149559
http://dx.doi.org/10.1186/s40668-016-0017-2
_version_ 1783417627624865792
author Friesen, Brian
Almgren, Ann
Lukić, Zarija
Weber, Gunther
Morozov, Dmitriy
Beckner, Vincent
Day, Marcus
author_facet Friesen, Brian
Almgren, Ann
Lukić, Zarija
Weber, Gunther
Morozov, Dmitriy
Beckner, Vincent
Day, Marcus
author_sort Friesen, Brian
collection PubMed
description Modern cosmological simulations have reached the trillion-element scale, rendering data storage and subsequent analysis formidable tasks. To address this circumstance, we present a new MPI-parallel approach for analysis of simulation data while the simulation runs, as an alternative to the traditional workflow consisting of periodically saving large data sets to disk for subsequent ‘offline’ analysis. We demonstrate this approach in the compressible gasdynamics/N-body code Nyx, a hybrid [Formula: see text] code based on the BoxLib framework, used for large-scale cosmological simulations. We have enabled on-the-fly workflows in two different ways: one is a straightforward approach consisting of all MPI processes periodically halting the main simulation and analyzing each component of data that they own (‘in situ’). The other consists of partitioning processes into disjoint MPI groups, with one performing the simulation and periodically sending data to the other ‘sidecar’ group, which post-processes it while the simulation continues (‘in-transit’). The two groups execute their tasks asynchronously, stopping only to synchronize when a new set of simulation data needs to be analyzed. For both the in situ and in-transit approaches, we experiment with two different analysis suites with distinct performance behavior: one which finds dark matter halos in the simulation using merge trees to calculate the mass contained within iso-density contours, and another which calculates probability distribution functions and power spectra of various fields in the simulation. Both are common analysis tasks for cosmology, and both result in summary statistics significantly smaller than the original data set. We study the behavior of each type of analysis in each workflow in order to determine the optimal configuration for the different data analysis algorithms.
format Online
Article
Text
id pubmed-6511997
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-65119972019-05-28 In situ and in-transit analysis of cosmological simulations Friesen, Brian Almgren, Ann Lukić, Zarija Weber, Gunther Morozov, Dmitriy Beckner, Vincent Day, Marcus Comput Astrophys Cosmol Research Modern cosmological simulations have reached the trillion-element scale, rendering data storage and subsequent analysis formidable tasks. To address this circumstance, we present a new MPI-parallel approach for analysis of simulation data while the simulation runs, as an alternative to the traditional workflow consisting of periodically saving large data sets to disk for subsequent ‘offline’ analysis. We demonstrate this approach in the compressible gasdynamics/N-body code Nyx, a hybrid [Formula: see text] code based on the BoxLib framework, used for large-scale cosmological simulations. We have enabled on-the-fly workflows in two different ways: one is a straightforward approach consisting of all MPI processes periodically halting the main simulation and analyzing each component of data that they own (‘in situ’). The other consists of partitioning processes into disjoint MPI groups, with one performing the simulation and periodically sending data to the other ‘sidecar’ group, which post-processes it while the simulation continues (‘in-transit’). The two groups execute their tasks asynchronously, stopping only to synchronize when a new set of simulation data needs to be analyzed. For both the in situ and in-transit approaches, we experiment with two different analysis suites with distinct performance behavior: one which finds dark matter halos in the simulation using merge trees to calculate the mass contained within iso-density contours, and another which calculates probability distribution functions and power spectra of various fields in the simulation. Both are common analysis tasks for cosmology, and both result in summary statistics significantly smaller than the original data set. We study the behavior of each type of analysis in each workflow in order to determine the optimal configuration for the different data analysis algorithms. Springer International Publishing 2016-08-24 2016 /pmc/articles/PMC6511997/ /pubmed/31149559 http://dx.doi.org/10.1186/s40668-016-0017-2 Text en © Friesen et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Research
Friesen, Brian
Almgren, Ann
Lukić, Zarija
Weber, Gunther
Morozov, Dmitriy
Beckner, Vincent
Day, Marcus
In situ and in-transit analysis of cosmological simulations
title In situ and in-transit analysis of cosmological simulations
title_full In situ and in-transit analysis of cosmological simulations
title_fullStr In situ and in-transit analysis of cosmological simulations
title_full_unstemmed In situ and in-transit analysis of cosmological simulations
title_short In situ and in-transit analysis of cosmological simulations
title_sort in situ and in-transit analysis of cosmological simulations
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6511997/
https://www.ncbi.nlm.nih.gov/pubmed/31149559
http://dx.doi.org/10.1186/s40668-016-0017-2
work_keys_str_mv AT friesenbrian insituandintransitanalysisofcosmologicalsimulations
AT almgrenann insituandintransitanalysisofcosmologicalsimulations
AT lukiczarija insituandintransitanalysisofcosmologicalsimulations
AT webergunther insituandintransitanalysisofcosmologicalsimulations
AT morozovdmitriy insituandintransitanalysisofcosmologicalsimulations
AT becknervincent insituandintransitanalysisofcosmologicalsimulations
AT daymarcus insituandintransitanalysisofcosmologicalsimulations