Cargando…

Go Get Data (GGD) is a framework that facilitates reproducible access to genomic data

The rapid increase in the amount of genomic data provides researchers with an opportunity to integrate diverse datasets and annotations when addressing a wide range of biological questions. However, genomic datasets are deposited on different platforms and are stored in numerous formats from multipl...

Descripción completa

Detalles Bibliográficos
Autores principales: Cormier, Michael J., Belyeu, Jonathan R., Pedersen, Brent S., Brown, Joseph, Köster, Johannes, Quinlan, Aaron R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8041854/
https://www.ncbi.nlm.nih.gov/pubmed/33846313
http://dx.doi.org/10.1038/s41467-021-22381-z
Descripción
Sumario:The rapid increase in the amount of genomic data provides researchers with an opportunity to integrate diverse datasets and annotations when addressing a wide range of biological questions. However, genomic datasets are deposited on different platforms and are stored in numerous formats from multiple genome builds, which complicates the task of collecting, annotating, transforming, and integrating data as needed. Here, we developed Go Get Data (GGD) as a fast, reproducible approach to installing standardized data recipes. GGD is available on Github (https://gogetdata.github.io/), is extendable to other data types, and can streamline the complexities typically associated with data integration, saving researchers time and improving research reproducibility.