Cargando…

Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space

The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL; https://anvilproject.org) was developed to address a widespread community need for a unified computing environment for genomics data storage, management, and analysis. In this perspective, we present AnVIL, des...

Descripción completa

Detalles Bibliográficos
Autores principales: Schatz, Michael C., Philippakis, Anthony A., Afgan, Enis, Banks, Eric, Carey, Vincent J., Carroll, Robert J., Culotti, Alessandro, Ellrott, Kyle, Goecks, Jeremy, Grossman, Robert L., Hall, Ira M., Hansen, Kasper D., Lawson, Jonathan, Leek, Jeffrey T., Luria, Anne O’Donnell, Mosher, Stephen, Morgan, Martin, Nekrutenko, Anton, O’Connor, Brian D., Osborn, Kevin, Paten, Benedict, Patterson, Candace, Tan, Frederick J., Taylor, Casey Overby, Vessio, Jennifer, Waldron, Levi, Wang, Ting, Wuichet, Kristin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8863334/
https://www.ncbi.nlm.nih.gov/pubmed/35199087
http://dx.doi.org/10.1016/j.xgen.2021.100085
_version_ 1784655219131416576
author Schatz, Michael C.
Philippakis, Anthony A.
Afgan, Enis
Banks, Eric
Carey, Vincent J.
Carroll, Robert J.
Culotti, Alessandro
Ellrott, Kyle
Goecks, Jeremy
Grossman, Robert L.
Hall, Ira M.
Hansen, Kasper D.
Lawson, Jonathan
Leek, Jeffrey T.
Luria, Anne O’Donnell
Mosher, Stephen
Morgan, Martin
Nekrutenko, Anton
O’Connor, Brian D.
Osborn, Kevin
Paten, Benedict
Patterson, Candace
Tan, Frederick J.
Taylor, Casey Overby
Vessio, Jennifer
Waldron, Levi
Wang, Ting
Wuichet, Kristin
author_facet Schatz, Michael C.
Philippakis, Anthony A.
Afgan, Enis
Banks, Eric
Carey, Vincent J.
Carroll, Robert J.
Culotti, Alessandro
Ellrott, Kyle
Goecks, Jeremy
Grossman, Robert L.
Hall, Ira M.
Hansen, Kasper D.
Lawson, Jonathan
Leek, Jeffrey T.
Luria, Anne O’Donnell
Mosher, Stephen
Morgan, Martin
Nekrutenko, Anton
O’Connor, Brian D.
Osborn, Kevin
Paten, Benedict
Patterson, Candace
Tan, Frederick J.
Taylor, Casey Overby
Vessio, Jennifer
Waldron, Levi
Wang, Ting
Wuichet, Kristin
author_sort Schatz, Michael C.
collection PubMed
description The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL; https://anvilproject.org) was developed to address a widespread community need for a unified computing environment for genomics data storage, management, and analysis. In this perspective, we present AnVIL, describe its ecosystem and interoperability with other platforms, and highlight how this platform and associated initiatives contribute to improved genomic data sharing efforts. The AnVIL is a federated cloud platform designed to manage and store genomics and related data, enable population-scale analysis, and facilitate collaboration through the sharing of data, code, and analysis results. By inverting the traditional model of data sharing, the AnVIL eliminates the need for data movement while also adding security measures for active threat detection and monitoring and provides scalable, shared computing resources for any researcher. We describe the core data management and analysis components of the AnVIL, which currently consists of Terra, Gen3, Galaxy, RStudio/Bioconductor, Dockstore, and Jupyter, and describe several flagship genomics datasets available within the AnVIL. We continue to extend and innovate the AnVIL ecosystem by implementing new capabilities, including mechanisms for interoperability and responsible data sharing, while streamlining access management. The AnVIL opens many new opportunities for analysis, collaboration, and data sharing that are needed to drive research and to make discoveries through the joint analysis of hundreds of thousands to millions of genomes along with associated clinical and molecular data types.
format Online
Article
Text
id pubmed-8863334
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-88633342022-02-22 Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space Schatz, Michael C. Philippakis, Anthony A. Afgan, Enis Banks, Eric Carey, Vincent J. Carroll, Robert J. Culotti, Alessandro Ellrott, Kyle Goecks, Jeremy Grossman, Robert L. Hall, Ira M. Hansen, Kasper D. Lawson, Jonathan Leek, Jeffrey T. Luria, Anne O’Donnell Mosher, Stephen Morgan, Martin Nekrutenko, Anton O’Connor, Brian D. Osborn, Kevin Paten, Benedict Patterson, Candace Tan, Frederick J. Taylor, Casey Overby Vessio, Jennifer Waldron, Levi Wang, Ting Wuichet, Kristin Cell Genom Perspective The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL; https://anvilproject.org) was developed to address a widespread community need for a unified computing environment for genomics data storage, management, and analysis. In this perspective, we present AnVIL, describe its ecosystem and interoperability with other platforms, and highlight how this platform and associated initiatives contribute to improved genomic data sharing efforts. The AnVIL is a federated cloud platform designed to manage and store genomics and related data, enable population-scale analysis, and facilitate collaboration through the sharing of data, code, and analysis results. By inverting the traditional model of data sharing, the AnVIL eliminates the need for data movement while also adding security measures for active threat detection and monitoring and provides scalable, shared computing resources for any researcher. We describe the core data management and analysis components of the AnVIL, which currently consists of Terra, Gen3, Galaxy, RStudio/Bioconductor, Dockstore, and Jupyter, and describe several flagship genomics datasets available within the AnVIL. We continue to extend and innovate the AnVIL ecosystem by implementing new capabilities, including mechanisms for interoperability and responsible data sharing, while streamlining access management. The AnVIL opens many new opportunities for analysis, collaboration, and data sharing that are needed to drive research and to make discoveries through the joint analysis of hundreds of thousands to millions of genomes along with associated clinical and molecular data types. Elsevier 2022-01-13 /pmc/articles/PMC8863334/ /pubmed/35199087 http://dx.doi.org/10.1016/j.xgen.2021.100085 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Perspective
Schatz, Michael C.
Philippakis, Anthony A.
Afgan, Enis
Banks, Eric
Carey, Vincent J.
Carroll, Robert J.
Culotti, Alessandro
Ellrott, Kyle
Goecks, Jeremy
Grossman, Robert L.
Hall, Ira M.
Hansen, Kasper D.
Lawson, Jonathan
Leek, Jeffrey T.
Luria, Anne O’Donnell
Mosher, Stephen
Morgan, Martin
Nekrutenko, Anton
O’Connor, Brian D.
Osborn, Kevin
Paten, Benedict
Patterson, Candace
Tan, Frederick J.
Taylor, Casey Overby
Vessio, Jennifer
Waldron, Levi
Wang, Ting
Wuichet, Kristin
Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space
title Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space
title_full Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space
title_fullStr Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space
title_full_unstemmed Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space
title_short Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space
title_sort inverting the model of genomics data sharing with the nhgri genomic data science analysis, visualization, and informatics lab-space
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8863334/
https://www.ncbi.nlm.nih.gov/pubmed/35199087
http://dx.doi.org/10.1016/j.xgen.2021.100085
work_keys_str_mv AT schatzmichaelc invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT philippakisanthonya invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT afganenis invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT bankseric invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT careyvincentj invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT carrollrobertj invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT culottialessandro invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT ellrottkyle invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT goecksjeremy invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT grossmanrobertl invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT halliram invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT hansenkasperd invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT lawsonjonathan invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT leekjeffreyt invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT luriaanneodonnell invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT mosherstephen invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT morganmartin invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT nekrutenkoanton invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT oconnorbriand invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT osbornkevin invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT patenbenedict invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT pattersoncandace invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT tanfrederickj invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT taylorcaseyoverby invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT vessiojennifer invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT waldronlevi invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT wangting invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT wuichetkristin invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace
AT invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace