Cargando…
SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata
The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a comprehensive catalog of functional elements initiated shortly after the completion of the Human Genome Project. The current database exceeds 6500 experiments across more than 450 cell lines and tissues...
Autores principales: | , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5389787/ https://www.ncbi.nlm.nih.gov/pubmed/28403240 http://dx.doi.org/10.1371/journal.pone.0175310 |
_version_ | 1782521334978314240 |
---|---|
author | Hitz, Benjamin C. Rowe, Laurence D. Podduturi, Nikhil R. Glick, David I. Baymuradov, Ulugbek K. Malladi, Venkat S. Chan, Esther T. Davidson, Jean M. Gabdank, Idan Narayana, Aditi K. Onate, Kathrina C. Hilton, Jason Ho, Marcus C. Lee, Brian T. Miyasato, Stuart R. Dreszer, Timothy R. Sloan, Cricket A. Strattan, J. Seth Tanaka, Forrest Y. Hong, Eurie L. Cherry, J. Michael |
author_facet | Hitz, Benjamin C. Rowe, Laurence D. Podduturi, Nikhil R. Glick, David I. Baymuradov, Ulugbek K. Malladi, Venkat S. Chan, Esther T. Davidson, Jean M. Gabdank, Idan Narayana, Aditi K. Onate, Kathrina C. Hilton, Jason Ho, Marcus C. Lee, Brian T. Miyasato, Stuart R. Dreszer, Timothy R. Sloan, Cricket A. Strattan, J. Seth Tanaka, Forrest Y. Hong, Eurie L. Cherry, J. Michael |
author_sort | Hitz, Benjamin C. |
collection | PubMed |
description | The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a comprehensive catalog of functional elements initiated shortly after the completion of the Human Genome Project. The current database exceeds 6500 experiments across more than 450 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the H. sapiens and M. musculus genomes. All ENCODE experimental data, metadata, and associated computational analyses are submitted to the ENCODE Data Coordination Center (DCC) for validation, tracking, storage, unified processing, and distribution to community resources and the scientific community. As the volume of data increases, the identification and organization of experimental details becomes increasingly intricate and demands careful curation. The ENCODE DCC has created a general purpose software system, known as SnoVault, that supports metadata and file submission, a database used for metadata storage, web pages for displaying the metadata and a robust API for querying the metadata. The software is fully open-source, code and installation instructions can be found at: http://github.com/ENCODE-DCC/snovault/ (for the generic database) and http://github.com/ENCODE-DCC/encoded/ to store genomic data in the manner of ENCODE. The core database engine, SnoVault (which is completely independent of ENCODE, genomic data, or bioinformatic data) has been released as a separate Python package. |
format | Online Article Text |
id | pubmed-5389787 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-53897872017-05-03 SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata Hitz, Benjamin C. Rowe, Laurence D. Podduturi, Nikhil R. Glick, David I. Baymuradov, Ulugbek K. Malladi, Venkat S. Chan, Esther T. Davidson, Jean M. Gabdank, Idan Narayana, Aditi K. Onate, Kathrina C. Hilton, Jason Ho, Marcus C. Lee, Brian T. Miyasato, Stuart R. Dreszer, Timothy R. Sloan, Cricket A. Strattan, J. Seth Tanaka, Forrest Y. Hong, Eurie L. Cherry, J. Michael PLoS One Research Article The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a comprehensive catalog of functional elements initiated shortly after the completion of the Human Genome Project. The current database exceeds 6500 experiments across more than 450 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the H. sapiens and M. musculus genomes. All ENCODE experimental data, metadata, and associated computational analyses are submitted to the ENCODE Data Coordination Center (DCC) for validation, tracking, storage, unified processing, and distribution to community resources and the scientific community. As the volume of data increases, the identification and organization of experimental details becomes increasingly intricate and demands careful curation. The ENCODE DCC has created a general purpose software system, known as SnoVault, that supports metadata and file submission, a database used for metadata storage, web pages for displaying the metadata and a robust API for querying the metadata. The software is fully open-source, code and installation instructions can be found at: http://github.com/ENCODE-DCC/snovault/ (for the generic database) and http://github.com/ENCODE-DCC/encoded/ to store genomic data in the manner of ENCODE. The core database engine, SnoVault (which is completely independent of ENCODE, genomic data, or bioinformatic data) has been released as a separate Python package. Public Library of Science 2017-04-12 /pmc/articles/PMC5389787/ /pubmed/28403240 http://dx.doi.org/10.1371/journal.pone.0175310 Text en © 2017 Hitz et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Hitz, Benjamin C. Rowe, Laurence D. Podduturi, Nikhil R. Glick, David I. Baymuradov, Ulugbek K. Malladi, Venkat S. Chan, Esther T. Davidson, Jean M. Gabdank, Idan Narayana, Aditi K. Onate, Kathrina C. Hilton, Jason Ho, Marcus C. Lee, Brian T. Miyasato, Stuart R. Dreszer, Timothy R. Sloan, Cricket A. Strattan, J. Seth Tanaka, Forrest Y. Hong, Eurie L. Cherry, J. Michael SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata |
title | SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata |
title_full | SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata |
title_fullStr | SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata |
title_full_unstemmed | SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata |
title_short | SnoVault and encodeD: A novel object-based storage system and applications to ENCODE metadata |
title_sort | snovault and encoded: a novel object-based storage system and applications to encode metadata |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5389787/ https://www.ncbi.nlm.nih.gov/pubmed/28403240 http://dx.doi.org/10.1371/journal.pone.0175310 |
work_keys_str_mv | AT hitzbenjaminc snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT rowelaurenced snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT podduturinikhilr snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT glickdavidi snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT baymuradovulugbekk snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT malladivenkats snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT chanesthert snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT davidsonjeanm snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT gabdankidan snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT narayanaaditik snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT onatekathrinac snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT hiltonjason snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT homarcusc snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT leebriant snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT miyasatostuartr snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT dreszertimothyr snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT sloancricketa snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT strattanjseth snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT tanakaforresty snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT hongeuriel snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata AT cherryjmichael snovaultandencodedanovelobjectbasedstoragesystemandapplicationstoencodemetadata |