Cargando…

Enhancing reproducibility in scientific computing: Metrics and registry for Singularity containers

Here we present Singularity Hub, a framework to build and deploy Singularity containers for mobility of compute, and the singularity-python software with novel metrics for assessing reproducibility of such containers. Singularity containers make it possible for scientists and developers to package r...

Descripción completa

Detalles Bibliográficos
Autores principales: Sochat, Vanessa V., Prybol, Cameron J., Kurtzer, Gregory M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5706697/
https://www.ncbi.nlm.nih.gov/pubmed/29186161
http://dx.doi.org/10.1371/journal.pone.0188511
_version_ 1783282270196465664
author Sochat, Vanessa V.
Prybol, Cameron J.
Kurtzer, Gregory M.
author_facet Sochat, Vanessa V.
Prybol, Cameron J.
Kurtzer, Gregory M.
author_sort Sochat, Vanessa V.
collection PubMed
description Here we present Singularity Hub, a framework to build and deploy Singularity containers for mobility of compute, and the singularity-python software with novel metrics for assessing reproducibility of such containers. Singularity containers make it possible for scientists and developers to package reproducible software, and Singularity Hub adds automation to this workflow by building, capturing metadata for, visualizing, and serving containers programmatically. Our novel metrics, based on custom filters of content hashes of container contents, allow for comparison of an entire container, including operating system, custom software, and metadata. First we will review Singularity Hub’s primary use cases and how the infrastructure has been designed to support modern, common workflows. Next, we conduct three analyses to demonstrate build consistency, reproducibility metric and performance and interpretability, and potential for discovery. This is the first effort to demonstrate a rigorous assessment of measurable similarity between containers and operating systems. We provide these capabilities within Singularity Hub, as well as the source software singularity-python that provides the underlying functionality. Singularity Hub is available at https://singularity-hub.org, and we are excited to provide it as an openly available platform for building, and deploying scientific containers.
format Online
Article
Text
id pubmed-5706697
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-57066972017-12-08 Enhancing reproducibility in scientific computing: Metrics and registry for Singularity containers Sochat, Vanessa V. Prybol, Cameron J. Kurtzer, Gregory M. PLoS One Research Article Here we present Singularity Hub, a framework to build and deploy Singularity containers for mobility of compute, and the singularity-python software with novel metrics for assessing reproducibility of such containers. Singularity containers make it possible for scientists and developers to package reproducible software, and Singularity Hub adds automation to this workflow by building, capturing metadata for, visualizing, and serving containers programmatically. Our novel metrics, based on custom filters of content hashes of container contents, allow for comparison of an entire container, including operating system, custom software, and metadata. First we will review Singularity Hub’s primary use cases and how the infrastructure has been designed to support modern, common workflows. Next, we conduct three analyses to demonstrate build consistency, reproducibility metric and performance and interpretability, and potential for discovery. This is the first effort to demonstrate a rigorous assessment of measurable similarity between containers and operating systems. We provide these capabilities within Singularity Hub, as well as the source software singularity-python that provides the underlying functionality. Singularity Hub is available at https://singularity-hub.org, and we are excited to provide it as an openly available platform for building, and deploying scientific containers. Public Library of Science 2017-11-29 /pmc/articles/PMC5706697/ /pubmed/29186161 http://dx.doi.org/10.1371/journal.pone.0188511 Text en © 2017 Sochat et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sochat, Vanessa V.
Prybol, Cameron J.
Kurtzer, Gregory M.
Enhancing reproducibility in scientific computing: Metrics and registry for Singularity containers
title Enhancing reproducibility in scientific computing: Metrics and registry for Singularity containers
title_full Enhancing reproducibility in scientific computing: Metrics and registry for Singularity containers
title_fullStr Enhancing reproducibility in scientific computing: Metrics and registry for Singularity containers
title_full_unstemmed Enhancing reproducibility in scientific computing: Metrics and registry for Singularity containers
title_short Enhancing reproducibility in scientific computing: Metrics and registry for Singularity containers
title_sort enhancing reproducibility in scientific computing: metrics and registry for singularity containers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5706697/
https://www.ncbi.nlm.nih.gov/pubmed/29186161
http://dx.doi.org/10.1371/journal.pone.0188511
work_keys_str_mv AT sochatvanessav enhancingreproducibilityinscientificcomputingmetricsandregistryforsingularitycontainers
AT prybolcameronj enhancingreproducibilityinscientificcomputingmetricsandregistryforsingularitycontainers
AT kurtzergregorym enhancingreproducibilityinscientificcomputingmetricsandregistryforsingularitycontainers