Cargando…
RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data
Chromatin is a tightly packaged structure of DNA and protein within the nucleus of a cell. The arrangement of different protein complexes along the DNA modulates and is modulated by gene expression. Measuring the binding locations and occupancy levels of different transcription factors (TFs) and nuc...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8373080/ https://www.ncbi.nlm.nih.gov/pubmed/34255854 http://dx.doi.org/10.1093/nar/gkab553 |
_version_ | 1783739881816588288 |
---|---|
author | Mitra, Sneha Zhong, Jianling Tran, Trung Q MacAlpine, David M Hartemink, Alexander J |
author_facet | Mitra, Sneha Zhong, Jianling Tran, Trung Q MacAlpine, David M Hartemink, Alexander J |
author_sort | Mitra, Sneha |
collection | PubMed |
description | Chromatin is a tightly packaged structure of DNA and protein within the nucleus of a cell. The arrangement of different protein complexes along the DNA modulates and is modulated by gene expression. Measuring the binding locations and occupancy levels of different transcription factors (TFs) and nucleosomes is therefore crucial to understanding gene regulation. Antibody-based methods for assaying chromatin occupancy are capable of identifying the binding sites of specific DNA binding factors, but only one factor at a time. In contrast, epigenomic accessibility data like MNase-seq, DNase-seq, and ATAC-seq provide insight into the chromatin landscape of all factors bound along the genome, but with little insight into the identities of those factors. Here, we present RoboCOP, a multivariate state space model that integrates chromatin accessibility data with nucleotide sequence to jointly compute genome-wide probabilistic scores of nucleosome and TF occupancy, for hundreds of different factors. We apply RoboCOP to MNase-seq and ATAC-seq data to elucidate the protein-binding landscape of nucleosomes and 150 TFs across the yeast genome, and show that our model makes better predictions than existing methods. We also compute a chromatin occupancy profile of the yeast genome under cadmium stress, revealing chromatin dynamics associated with transcriptional regulation. |
format | Online Article Text |
id | pubmed-8373080 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-83730802021-08-19 RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data Mitra, Sneha Zhong, Jianling Tran, Trung Q MacAlpine, David M Hartemink, Alexander J Nucleic Acids Res Computational Biology Chromatin is a tightly packaged structure of DNA and protein within the nucleus of a cell. The arrangement of different protein complexes along the DNA modulates and is modulated by gene expression. Measuring the binding locations and occupancy levels of different transcription factors (TFs) and nucleosomes is therefore crucial to understanding gene regulation. Antibody-based methods for assaying chromatin occupancy are capable of identifying the binding sites of specific DNA binding factors, but only one factor at a time. In contrast, epigenomic accessibility data like MNase-seq, DNase-seq, and ATAC-seq provide insight into the chromatin landscape of all factors bound along the genome, but with little insight into the identities of those factors. Here, we present RoboCOP, a multivariate state space model that integrates chromatin accessibility data with nucleotide sequence to jointly compute genome-wide probabilistic scores of nucleosome and TF occupancy, for hundreds of different factors. We apply RoboCOP to MNase-seq and ATAC-seq data to elucidate the protein-binding landscape of nucleosomes and 150 TFs across the yeast genome, and show that our model makes better predictions than existing methods. We also compute a chromatin occupancy profile of the yeast genome under cadmium stress, revealing chromatin dynamics associated with transcriptional regulation. Oxford University Press 2021-07-13 /pmc/articles/PMC8373080/ /pubmed/34255854 http://dx.doi.org/10.1093/nar/gkab553 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Biology Mitra, Sneha Zhong, Jianling Tran, Trung Q MacAlpine, David M Hartemink, Alexander J RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data |
title | RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data |
title_full | RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data |
title_fullStr | RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data |
title_full_unstemmed | RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data |
title_short | RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data |
title_sort | robocop: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8373080/ https://www.ncbi.nlm.nih.gov/pubmed/34255854 http://dx.doi.org/10.1093/nar/gkab553 |
work_keys_str_mv | AT mitrasneha robocopjointlycomputingchromatinoccupancyprofilesfornumerousfactorsfromchromatinaccessibilitydata AT zhongjianling robocopjointlycomputingchromatinoccupancyprofilesfornumerousfactorsfromchromatinaccessibilitydata AT trantrungq robocopjointlycomputingchromatinoccupancyprofilesfornumerousfactorsfromchromatinaccessibilitydata AT macalpinedavidm robocopjointlycomputingchromatinoccupancyprofilesfornumerousfactorsfromchromatinaccessibilitydata AT harteminkalexanderj robocopjointlycomputingchromatinoccupancyprofilesfornumerousfactorsfromchromatinaccessibilitydata |