Cargando…
Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space
The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL; https://anvilproject.org) was developed to address a widespread community need for a unified computing environment for genomics data storage, management, and analysis. In this perspective, we present AnVIL, des...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8863334/ https://www.ncbi.nlm.nih.gov/pubmed/35199087 http://dx.doi.org/10.1016/j.xgen.2021.100085 |
_version_ | 1784655219131416576 |
---|---|
author | Schatz, Michael C. Philippakis, Anthony A. Afgan, Enis Banks, Eric Carey, Vincent J. Carroll, Robert J. Culotti, Alessandro Ellrott, Kyle Goecks, Jeremy Grossman, Robert L. Hall, Ira M. Hansen, Kasper D. Lawson, Jonathan Leek, Jeffrey T. Luria, Anne O’Donnell Mosher, Stephen Morgan, Martin Nekrutenko, Anton O’Connor, Brian D. Osborn, Kevin Paten, Benedict Patterson, Candace Tan, Frederick J. Taylor, Casey Overby Vessio, Jennifer Waldron, Levi Wang, Ting Wuichet, Kristin |
author_facet | Schatz, Michael C. Philippakis, Anthony A. Afgan, Enis Banks, Eric Carey, Vincent J. Carroll, Robert J. Culotti, Alessandro Ellrott, Kyle Goecks, Jeremy Grossman, Robert L. Hall, Ira M. Hansen, Kasper D. Lawson, Jonathan Leek, Jeffrey T. Luria, Anne O’Donnell Mosher, Stephen Morgan, Martin Nekrutenko, Anton O’Connor, Brian D. Osborn, Kevin Paten, Benedict Patterson, Candace Tan, Frederick J. Taylor, Casey Overby Vessio, Jennifer Waldron, Levi Wang, Ting Wuichet, Kristin |
author_sort | Schatz, Michael C. |
collection | PubMed |
description | The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL; https://anvilproject.org) was developed to address a widespread community need for a unified computing environment for genomics data storage, management, and analysis. In this perspective, we present AnVIL, describe its ecosystem and interoperability with other platforms, and highlight how this platform and associated initiatives contribute to improved genomic data sharing efforts. The AnVIL is a federated cloud platform designed to manage and store genomics and related data, enable population-scale analysis, and facilitate collaboration through the sharing of data, code, and analysis results. By inverting the traditional model of data sharing, the AnVIL eliminates the need for data movement while also adding security measures for active threat detection and monitoring and provides scalable, shared computing resources for any researcher. We describe the core data management and analysis components of the AnVIL, which currently consists of Terra, Gen3, Galaxy, RStudio/Bioconductor, Dockstore, and Jupyter, and describe several flagship genomics datasets available within the AnVIL. We continue to extend and innovate the AnVIL ecosystem by implementing new capabilities, including mechanisms for interoperability and responsible data sharing, while streamlining access management. The AnVIL opens many new opportunities for analysis, collaboration, and data sharing that are needed to drive research and to make discoveries through the joint analysis of hundreds of thousands to millions of genomes along with associated clinical and molecular data types. |
format | Online Article Text |
id | pubmed-8863334 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-88633342022-02-22 Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space Schatz, Michael C. Philippakis, Anthony A. Afgan, Enis Banks, Eric Carey, Vincent J. Carroll, Robert J. Culotti, Alessandro Ellrott, Kyle Goecks, Jeremy Grossman, Robert L. Hall, Ira M. Hansen, Kasper D. Lawson, Jonathan Leek, Jeffrey T. Luria, Anne O’Donnell Mosher, Stephen Morgan, Martin Nekrutenko, Anton O’Connor, Brian D. Osborn, Kevin Paten, Benedict Patterson, Candace Tan, Frederick J. Taylor, Casey Overby Vessio, Jennifer Waldron, Levi Wang, Ting Wuichet, Kristin Cell Genom Perspective The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL; https://anvilproject.org) was developed to address a widespread community need for a unified computing environment for genomics data storage, management, and analysis. In this perspective, we present AnVIL, describe its ecosystem and interoperability with other platforms, and highlight how this platform and associated initiatives contribute to improved genomic data sharing efforts. The AnVIL is a federated cloud platform designed to manage and store genomics and related data, enable population-scale analysis, and facilitate collaboration through the sharing of data, code, and analysis results. By inverting the traditional model of data sharing, the AnVIL eliminates the need for data movement while also adding security measures for active threat detection and monitoring and provides scalable, shared computing resources for any researcher. We describe the core data management and analysis components of the AnVIL, which currently consists of Terra, Gen3, Galaxy, RStudio/Bioconductor, Dockstore, and Jupyter, and describe several flagship genomics datasets available within the AnVIL. We continue to extend and innovate the AnVIL ecosystem by implementing new capabilities, including mechanisms for interoperability and responsible data sharing, while streamlining access management. The AnVIL opens many new opportunities for analysis, collaboration, and data sharing that are needed to drive research and to make discoveries through the joint analysis of hundreds of thousands to millions of genomes along with associated clinical and molecular data types. Elsevier 2022-01-13 /pmc/articles/PMC8863334/ /pubmed/35199087 http://dx.doi.org/10.1016/j.xgen.2021.100085 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Perspective Schatz, Michael C. Philippakis, Anthony A. Afgan, Enis Banks, Eric Carey, Vincent J. Carroll, Robert J. Culotti, Alessandro Ellrott, Kyle Goecks, Jeremy Grossman, Robert L. Hall, Ira M. Hansen, Kasper D. Lawson, Jonathan Leek, Jeffrey T. Luria, Anne O’Donnell Mosher, Stephen Morgan, Martin Nekrutenko, Anton O’Connor, Brian D. Osborn, Kevin Paten, Benedict Patterson, Candace Tan, Frederick J. Taylor, Casey Overby Vessio, Jennifer Waldron, Levi Wang, Ting Wuichet, Kristin Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space |
title | Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space |
title_full | Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space |
title_fullStr | Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space |
title_full_unstemmed | Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space |
title_short | Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space |
title_sort | inverting the model of genomics data sharing with the nhgri genomic data science analysis, visualization, and informatics lab-space |
topic | Perspective |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8863334/ https://www.ncbi.nlm.nih.gov/pubmed/35199087 http://dx.doi.org/10.1016/j.xgen.2021.100085 |
work_keys_str_mv | AT schatzmichaelc invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT philippakisanthonya invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT afganenis invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT bankseric invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT careyvincentj invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT carrollrobertj invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT culottialessandro invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT ellrottkyle invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT goecksjeremy invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT grossmanrobertl invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT halliram invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT hansenkasperd invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT lawsonjonathan invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT leekjeffreyt invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT luriaanneodonnell invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT mosherstephen invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT morganmartin invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT nekrutenkoanton invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT oconnorbriand invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT osbornkevin invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT patenbenedict invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT pattersoncandace invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT tanfrederickj invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT taylorcaseyoverby invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT vessiojennifer invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT waldronlevi invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT wangting invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT wuichetkristin invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace AT invertingthemodelofgenomicsdatasharingwiththenhgrigenomicdatascienceanalysisvisualizationandinformaticslabspace |