Cargando…
The ENCODE Uniform Analysis Pipelines
The Encyclopedia of DNA elements (ENCODE) project is a collaborative effort to create a comprehensive catalog of functional elements in the human genome. The current database comprises more than 19000 functional genomics experiments across more than 1000 cell lines and tissues using a wide array of...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10104020/ https://www.ncbi.nlm.nih.gov/pubmed/37066421 http://dx.doi.org/10.1101/2023.04.04.535623 |
_version_ | 1785025956723818496 |
---|---|
author | Hitz, Benjamin C. Jin-Wook, Lee Jolanki, Otto Kagda, Meenakshi S. Graham, Keenan Sud, Paul Gabdank, Idan Strattan, J. Seth Sloan, Cricket A. Dreszer, Timothy Rowe, Laurence D. Podduturi, Nikhil R. Malladi, Venkat S. Chan, Esther T. Davidson, Jean M. Ho, Marcus Miyasato, Stuart Simison, Matt Tanaka, Forrest Luo, Yunhai Whaling, Ian Hong, Eurie L. Lee, Brian T. Sandstrom, Richard Rynes, Eric Nelson, Jemma Nishida, Andrew Ingersoll, Alyssa Buckley, Michael Frerker, Mark Kim, Daniel S Boley, Nathan Trout, Diane Dobin, Alex Rahmanian, Sorena Wyman, Dana Balderrama-Gutierrez, Gabriela Reese, Fairlie Durand, Neva C. Dudchenko, Olga Weisz, David Rao, Suhas S. P. Blackburn, Alyssa Gkountaroulis, Dimos Sadr, Mahdi Olshansky, Moshe Eliaz, Yossi Nguyen, Dat Bochkov, Ivan Shamim, Muhammad Saad Mahajan, Ragini Aiden, Erez Gingeras, Tom Heath, Simon Hirst, Martin Kent, W. James Kundaje, Anshul Mortazavi, Ali Wold, Barbara Cherry, J. Michael |
author_facet | Hitz, Benjamin C. Jin-Wook, Lee Jolanki, Otto Kagda, Meenakshi S. Graham, Keenan Sud, Paul Gabdank, Idan Strattan, J. Seth Sloan, Cricket A. Dreszer, Timothy Rowe, Laurence D. Podduturi, Nikhil R. Malladi, Venkat S. Chan, Esther T. Davidson, Jean M. Ho, Marcus Miyasato, Stuart Simison, Matt Tanaka, Forrest Luo, Yunhai Whaling, Ian Hong, Eurie L. Lee, Brian T. Sandstrom, Richard Rynes, Eric Nelson, Jemma Nishida, Andrew Ingersoll, Alyssa Buckley, Michael Frerker, Mark Kim, Daniel S Boley, Nathan Trout, Diane Dobin, Alex Rahmanian, Sorena Wyman, Dana Balderrama-Gutierrez, Gabriela Reese, Fairlie Durand, Neva C. Dudchenko, Olga Weisz, David Rao, Suhas S. P. Blackburn, Alyssa Gkountaroulis, Dimos Sadr, Mahdi Olshansky, Moshe Eliaz, Yossi Nguyen, Dat Bochkov, Ivan Shamim, Muhammad Saad Mahajan, Ragini Aiden, Erez Gingeras, Tom Heath, Simon Hirst, Martin Kent, W. James Kundaje, Anshul Mortazavi, Ali Wold, Barbara Cherry, J. Michael |
author_sort | Hitz, Benjamin C. |
collection | PubMed |
description | The Encyclopedia of DNA elements (ENCODE) project is a collaborative effort to create a comprehensive catalog of functional elements in the human genome. The current database comprises more than 19000 functional genomics experiments across more than 1000 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All experimental data, metadata, and associated computational analyses created by the ENCODE consortium are submitted to the Data Coordination Center (DCC) for validation, tracking, storage, and distribution to community resources and the scientific community. The ENCODE project has engineered and distributed uniform processing pipelines in order to promote data provenance and reproducibility as well as allow interoperability between genomic resources and other consortia. All data files, reference genome versions, software versions, and parameters used by the pipelines are captured and available via the ENCODE Portal. The pipeline code, developed using Docker and Workflow Description Language (WDL; https://openwdl.org/) is publicly available in GitHub, with images available on Dockerhub (https://hub.docker.com), enabling access to a diverse range of biomedical researchers. ENCODE pipelines maintained and used by the DCC can be installed to run on personal computers, local HPC clusters, or in cloud computing environments via Cromwell. Access to the pipelines and data via the cloud allows small labs the ability to use the data or software without access to institutional compute clusters. Standardization of the computational methodologies for analysis and quality control leads to comparable results from different ENCODE collections - a prerequisite for successful integrative analyses. |
format | Online Article Text |
id | pubmed-10104020 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-101040202023-04-15 The ENCODE Uniform Analysis Pipelines Hitz, Benjamin C. Jin-Wook, Lee Jolanki, Otto Kagda, Meenakshi S. Graham, Keenan Sud, Paul Gabdank, Idan Strattan, J. Seth Sloan, Cricket A. Dreszer, Timothy Rowe, Laurence D. Podduturi, Nikhil R. Malladi, Venkat S. Chan, Esther T. Davidson, Jean M. Ho, Marcus Miyasato, Stuart Simison, Matt Tanaka, Forrest Luo, Yunhai Whaling, Ian Hong, Eurie L. Lee, Brian T. Sandstrom, Richard Rynes, Eric Nelson, Jemma Nishida, Andrew Ingersoll, Alyssa Buckley, Michael Frerker, Mark Kim, Daniel S Boley, Nathan Trout, Diane Dobin, Alex Rahmanian, Sorena Wyman, Dana Balderrama-Gutierrez, Gabriela Reese, Fairlie Durand, Neva C. Dudchenko, Olga Weisz, David Rao, Suhas S. P. Blackburn, Alyssa Gkountaroulis, Dimos Sadr, Mahdi Olshansky, Moshe Eliaz, Yossi Nguyen, Dat Bochkov, Ivan Shamim, Muhammad Saad Mahajan, Ragini Aiden, Erez Gingeras, Tom Heath, Simon Hirst, Martin Kent, W. James Kundaje, Anshul Mortazavi, Ali Wold, Barbara Cherry, J. Michael bioRxiv Article The Encyclopedia of DNA elements (ENCODE) project is a collaborative effort to create a comprehensive catalog of functional elements in the human genome. The current database comprises more than 19000 functional genomics experiments across more than 1000 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All experimental data, metadata, and associated computational analyses created by the ENCODE consortium are submitted to the Data Coordination Center (DCC) for validation, tracking, storage, and distribution to community resources and the scientific community. The ENCODE project has engineered and distributed uniform processing pipelines in order to promote data provenance and reproducibility as well as allow interoperability between genomic resources and other consortia. All data files, reference genome versions, software versions, and parameters used by the pipelines are captured and available via the ENCODE Portal. The pipeline code, developed using Docker and Workflow Description Language (WDL; https://openwdl.org/) is publicly available in GitHub, with images available on Dockerhub (https://hub.docker.com), enabling access to a diverse range of biomedical researchers. ENCODE pipelines maintained and used by the DCC can be installed to run on personal computers, local HPC clusters, or in cloud computing environments via Cromwell. Access to the pipelines and data via the cloud allows small labs the ability to use the data or software without access to institutional compute clusters. Standardization of the computational methodologies for analysis and quality control leads to comparable results from different ENCODE collections - a prerequisite for successful integrative analyses. Cold Spring Harbor Laboratory 2023-04-06 /pmc/articles/PMC10104020/ /pubmed/37066421 http://dx.doi.org/10.1101/2023.04.04.535623 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Hitz, Benjamin C. Jin-Wook, Lee Jolanki, Otto Kagda, Meenakshi S. Graham, Keenan Sud, Paul Gabdank, Idan Strattan, J. Seth Sloan, Cricket A. Dreszer, Timothy Rowe, Laurence D. Podduturi, Nikhil R. Malladi, Venkat S. Chan, Esther T. Davidson, Jean M. Ho, Marcus Miyasato, Stuart Simison, Matt Tanaka, Forrest Luo, Yunhai Whaling, Ian Hong, Eurie L. Lee, Brian T. Sandstrom, Richard Rynes, Eric Nelson, Jemma Nishida, Andrew Ingersoll, Alyssa Buckley, Michael Frerker, Mark Kim, Daniel S Boley, Nathan Trout, Diane Dobin, Alex Rahmanian, Sorena Wyman, Dana Balderrama-Gutierrez, Gabriela Reese, Fairlie Durand, Neva C. Dudchenko, Olga Weisz, David Rao, Suhas S. P. Blackburn, Alyssa Gkountaroulis, Dimos Sadr, Mahdi Olshansky, Moshe Eliaz, Yossi Nguyen, Dat Bochkov, Ivan Shamim, Muhammad Saad Mahajan, Ragini Aiden, Erez Gingeras, Tom Heath, Simon Hirst, Martin Kent, W. James Kundaje, Anshul Mortazavi, Ali Wold, Barbara Cherry, J. Michael The ENCODE Uniform Analysis Pipelines |
title | The ENCODE Uniform Analysis Pipelines |
title_full | The ENCODE Uniform Analysis Pipelines |
title_fullStr | The ENCODE Uniform Analysis Pipelines |
title_full_unstemmed | The ENCODE Uniform Analysis Pipelines |
title_short | The ENCODE Uniform Analysis Pipelines |
title_sort | encode uniform analysis pipelines |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10104020/ https://www.ncbi.nlm.nih.gov/pubmed/37066421 http://dx.doi.org/10.1101/2023.04.04.535623 |
work_keys_str_mv | AT hitzbenjaminc theencodeuniformanalysispipelines AT jinwooklee theencodeuniformanalysispipelines AT jolankiotto theencodeuniformanalysispipelines AT kagdameenakshis theencodeuniformanalysispipelines AT grahamkeenan theencodeuniformanalysispipelines AT sudpaul theencodeuniformanalysispipelines AT gabdankidan theencodeuniformanalysispipelines AT strattanjseth theencodeuniformanalysispipelines AT sloancricketa theencodeuniformanalysispipelines AT dreszertimothy theencodeuniformanalysispipelines AT rowelaurenced theencodeuniformanalysispipelines AT podduturinikhilr theencodeuniformanalysispipelines AT malladivenkats theencodeuniformanalysispipelines AT chanesthert theencodeuniformanalysispipelines AT davidsonjeanm theencodeuniformanalysispipelines AT homarcus theencodeuniformanalysispipelines AT miyasatostuart theencodeuniformanalysispipelines AT simisonmatt theencodeuniformanalysispipelines AT tanakaforrest theencodeuniformanalysispipelines AT luoyunhai theencodeuniformanalysispipelines AT whalingian theencodeuniformanalysispipelines AT hongeuriel theencodeuniformanalysispipelines AT leebriant theencodeuniformanalysispipelines AT sandstromrichard theencodeuniformanalysispipelines AT ryneseric theencodeuniformanalysispipelines AT nelsonjemma theencodeuniformanalysispipelines AT nishidaandrew theencodeuniformanalysispipelines AT ingersollalyssa theencodeuniformanalysispipelines AT buckleymichael theencodeuniformanalysispipelines AT frerkermark theencodeuniformanalysispipelines AT kimdaniels theencodeuniformanalysispipelines AT boleynathan theencodeuniformanalysispipelines AT troutdiane theencodeuniformanalysispipelines AT dobinalex theencodeuniformanalysispipelines AT rahmaniansorena theencodeuniformanalysispipelines AT wymandana theencodeuniformanalysispipelines AT balderramagutierrezgabriela theencodeuniformanalysispipelines AT reesefairlie theencodeuniformanalysispipelines AT durandnevac theencodeuniformanalysispipelines AT dudchenkoolga theencodeuniformanalysispipelines AT weiszdavid theencodeuniformanalysispipelines AT raosuhassp theencodeuniformanalysispipelines AT blackburnalyssa theencodeuniformanalysispipelines AT gkountaroulisdimos theencodeuniformanalysispipelines AT sadrmahdi theencodeuniformanalysispipelines AT olshanskymoshe theencodeuniformanalysispipelines AT eliazyossi theencodeuniformanalysispipelines AT nguyendat theencodeuniformanalysispipelines AT bochkovivan theencodeuniformanalysispipelines AT shamimmuhammadsaad theencodeuniformanalysispipelines AT mahajanragini theencodeuniformanalysispipelines AT aidenerez theencodeuniformanalysispipelines AT gingerastom theencodeuniformanalysispipelines AT heathsimon theencodeuniformanalysispipelines AT hirstmartin theencodeuniformanalysispipelines AT kentwjames theencodeuniformanalysispipelines AT kundajeanshul theencodeuniformanalysispipelines AT mortazaviali theencodeuniformanalysispipelines AT woldbarbara theencodeuniformanalysispipelines AT cherryjmichael theencodeuniformanalysispipelines AT hitzbenjaminc encodeuniformanalysispipelines AT jinwooklee encodeuniformanalysispipelines AT jolankiotto encodeuniformanalysispipelines AT kagdameenakshis encodeuniformanalysispipelines AT grahamkeenan encodeuniformanalysispipelines AT sudpaul encodeuniformanalysispipelines AT gabdankidan encodeuniformanalysispipelines AT strattanjseth encodeuniformanalysispipelines AT sloancricketa encodeuniformanalysispipelines AT dreszertimothy encodeuniformanalysispipelines AT rowelaurenced encodeuniformanalysispipelines AT podduturinikhilr encodeuniformanalysispipelines AT malladivenkats encodeuniformanalysispipelines AT chanesthert encodeuniformanalysispipelines AT davidsonjeanm encodeuniformanalysispipelines AT homarcus encodeuniformanalysispipelines AT miyasatostuart encodeuniformanalysispipelines AT simisonmatt encodeuniformanalysispipelines AT tanakaforrest encodeuniformanalysispipelines AT luoyunhai encodeuniformanalysispipelines AT whalingian encodeuniformanalysispipelines AT hongeuriel encodeuniformanalysispipelines AT leebriant encodeuniformanalysispipelines AT sandstromrichard encodeuniformanalysispipelines AT ryneseric encodeuniformanalysispipelines AT nelsonjemma encodeuniformanalysispipelines AT nishidaandrew encodeuniformanalysispipelines AT ingersollalyssa encodeuniformanalysispipelines AT buckleymichael encodeuniformanalysispipelines AT frerkermark encodeuniformanalysispipelines AT kimdaniels encodeuniformanalysispipelines AT boleynathan encodeuniformanalysispipelines AT troutdiane encodeuniformanalysispipelines AT dobinalex encodeuniformanalysispipelines AT rahmaniansorena encodeuniformanalysispipelines AT wymandana encodeuniformanalysispipelines AT balderramagutierrezgabriela encodeuniformanalysispipelines AT reesefairlie encodeuniformanalysispipelines AT durandnevac encodeuniformanalysispipelines AT dudchenkoolga encodeuniformanalysispipelines AT weiszdavid encodeuniformanalysispipelines AT raosuhassp encodeuniformanalysispipelines AT blackburnalyssa encodeuniformanalysispipelines AT gkountaroulisdimos encodeuniformanalysispipelines AT sadrmahdi encodeuniformanalysispipelines AT olshanskymoshe encodeuniformanalysispipelines AT eliazyossi encodeuniformanalysispipelines AT nguyendat encodeuniformanalysispipelines AT bochkovivan encodeuniformanalysispipelines AT shamimmuhammadsaad encodeuniformanalysispipelines AT mahajanragini encodeuniformanalysispipelines AT aidenerez encodeuniformanalysispipelines AT gingerastom encodeuniformanalysispipelines AT heathsimon encodeuniformanalysispipelines AT hirstmartin encodeuniformanalysispipelines AT kentwjames encodeuniformanalysispipelines AT kundajeanshul encodeuniformanalysispipelines AT mortazaviali encodeuniformanalysispipelines AT woldbarbara encodeuniformanalysispipelines AT cherryjmichael encodeuniformanalysispipelines |