Cargando…

SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata

The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a comprehensive catalog of functional elements initiated shortly after the completion of the Human Genome Project. The current database exceeds 6500 experiments across more than 450 cell lines and tissues...

Descripción completa

Detalles Bibliográficos
Autores principales: Hitz, Benjamin C., Rowe, Laurence D., Podduturi, Nikhil R., Glick, David I., Baymuradov, Ulugbek K., Malladi, Venkat S., Chan, Esther T., Davidson, Jean M., Gabdank, Idan, Narayana, Aditi K., Onate, Kathrina C., Hilton, Jason, Ho, Marcus C., Lee, Brian T., Miyasato, Stuart R., Dreszer, Timothy R., Sloan, Cricket A., Strattan, J. Seth, Tanaka, Forrest Y., Hong, Eurie L., Cherry, J. Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5389787/
https://www.ncbi.nlm.nih.gov/pubmed/28403240
http://dx.doi.org/10.1371/journal.pone.0175310
_version_ 1782521334978314240
author Hitz, Benjamin C.
Rowe, Laurence D.
Podduturi, Nikhil R.
Glick, David I.
Baymuradov, Ulugbek K.
Malladi, Venkat S.
Chan, Esther T.
Davidson, Jean M.
Gabdank, Idan
Narayana, Aditi K.
Onate, Kathrina C.
Hilton, Jason
Ho, Marcus C.
Lee, Brian T.
Miyasato, Stuart R.
Dreszer, Timothy R.
Sloan, Cricket A.
Strattan, J. Seth
Tanaka, Forrest Y.
Hong, Eurie L.
Cherry, J. Michael
author_facet Hitz, Benjamin C.
Rowe, Laurence D.
Podduturi, Nikhil R.
Glick, David I.
Baymuradov, Ulugbek K.
Malladi, Venkat S.
Chan, Esther T.
Davidson, Jean M.
Gabdank, Idan
Narayana, Aditi K.
Onate, Kathrina C.
Hilton, Jason
Ho, Marcus C.
Lee, Brian T.
Miyasato, Stuart R.
Dreszer, Timothy R.
Sloan, Cricket A.
Strattan, J. Seth
Tanaka, Forrest Y.
Hong, Eurie L.
Cherry, J. Michael
author_sort Hitz, Benjamin C.
collection PubMed
description The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a comprehensive catalog of functional elements initiated shortly after the completion of the Human Genome Project. The current database exceeds 6500 experiments across more than 450 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the H. sapiens and M. musculus genomes. All ENCODE experimental data, metadata, and associated computational analyses are submitted to the ENCODE Data Coordination Center (DCC) for validation, tracking, storage, unified processing, and distribution to community resources and the scientific community. As the volume of data increases, the identification and organization of experimental details becomes increasingly intricate and demands careful curation. The ENCODE DCC has created a general purpose software system, known as SnoVault, that supports metadata and file submission, a database used for metadata storage, web pages for displaying the metadata and a robust API for querying the metadata. The software is fully open-source, code and installation instructions can be found at: http://github.com/ENCODE-DCC/snovault/ (for the generic database) and http://github.com/ENCODE-DCC/encoded/ to store genomic data in the manner of ENCODE. The core database engine, SnoVault (which is completely independent of ENCODE, genomic data, or bioinformatic data) has been released as a separate Python package.
format Online
Article
Text
id pubmed-5389787
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53897872017-05-03 SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata Hitz, Benjamin C. Rowe, Laurence D. Podduturi, Nikhil R. Glick, David I. Baymuradov, Ulugbek K. Malladi, Venkat S. Chan, Esther T. Davidson, Jean M. Gabdank, Idan Narayana, Aditi K. Onate, Kathrina C. Hilton, Jason Ho, Marcus C. Lee, Brian T. Miyasato, Stuart R. Dreszer, Timothy R. Sloan, Cricket A. Strattan, J. Seth Tanaka, Forrest Y. Hong, Eurie L. Cherry, J. Michael PLoS One Research Article The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a comprehensive catalog of functional elements initiated shortly after the completion of the Human Genome Project. The current database exceeds 6500 experiments across more than 450 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the H. sapiens and M. musculus genomes. All ENCODE experimental data, metadata, and associated computational analyses are submitted to the ENCODE Data Coordination Center (DCC) for validation, tracking, storage, unified processing, and distribution to community resources and the scientific community. As the volume of data increases, the identification and organization of experimental details becomes increasingly intricate and demands careful curation. The ENCODE DCC has created a general purpose software system, known as SnoVault, that supports metadata and file submission, a database used for metadata storage, web pages for displaying the metadata and a robust API for querying the metadata. The software is fully open-source, code and installation instructions can be found at: http://github.com/ENCODE-DCC/snovault/ (for the generic database) and http://github.com/ENCODE-DCC/encoded/ to store genomic data in the manner of ENCODE. The core database engine, SnoVault (which is completely independent of ENCODE, genomic data, or bioinformatic data) has been released as a separate Python package. Public Library of Science 2017-04-12 /pmc/articles/PMC5389787/ /pubmed/28403240 http://dx.doi.org/10.1371/journal.pone.0175310 Text en © 2017 Hitz et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hitz, Benjamin C.
Rowe, Laurence D.
Podduturi, Nikhil R.
Glick, David I.
Baymuradov, Ulugbek K.
Malladi, Venkat S.
Chan, Esther T.
Davidson, Jean M.
Gabdank, Idan
Narayana, Aditi K.
Onate, Kathrina C.
Hilton, Jason
Ho, Marcus C.
Lee, Brian T.
Miyasato, Stuart R.
Dreszer, Timothy R.
Sloan, Cricket A.
Strattan, J. Seth
Tanaka, Forrest Y.
Hong, Eurie L.
Cherry, J. Michael
SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata
title SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata
title_full SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata
title_fullStr SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata
title_full_unstemmed SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata
title_short SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata
title_sort snovault and encoded: a novel object-based storage system and applications to encode metadata
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5389787/
https://www.ncbi.nlm.nih.gov/pubmed/28403240
http://dx.doi.org/10.1371/journal.pone.0175310
work_keys_str_mv AT hitzbenjaminc snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT rowelaurenced snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT podduturinikhilr snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT glickdavidi snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT baymuradovulugbekk snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT malladivenkats snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT chanesthert snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT davidsonjeanm snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT gabdankidan snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT narayanaaditik snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT onatekathrinac snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT hiltonjason snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT homarcusc snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT leebriant snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT miyasatostuartr snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT dreszertimothyr snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT sloancricketa snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT strattanjseth snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT tanakaforresty snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT hongeuriel snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata
AT cherryjmichael snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata