Cargando…

RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data

Chromatin is a tightly packaged structure of DNA and protein within the nucleus of a cell. The arrangement of different protein complexes along the DNA modulates and is modulated by gene expression. Measuring the binding locations and occupancy levels of different transcription factors (TFs) and nuc...

Descripción completa

Detalles Bibliográficos
Autores principales: Mitra, Sneha, Zhong, Jianling, Tran, Trung Q, MacAlpine, David M, Hartemink, Alexander J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8373080/
https://www.ncbi.nlm.nih.gov/pubmed/34255854
http://dx.doi.org/10.1093/nar/gkab553
_version_ 1783739881816588288
author Mitra, Sneha
Zhong, Jianling
Tran, Trung Q
MacAlpine, David M
Hartemink, Alexander J
author_facet Mitra, Sneha
Zhong, Jianling
Tran, Trung Q
MacAlpine, David M
Hartemink, Alexander J
author_sort Mitra, Sneha
collection PubMed
description Chromatin is a tightly packaged structure of DNA and protein within the nucleus of a cell. The arrangement of different protein complexes along the DNA modulates and is modulated by gene expression. Measuring the binding locations and occupancy levels of different transcription factors (TFs) and nucleosomes is therefore crucial to understanding gene regulation. Antibody-based methods for assaying chromatin occupancy are capable of identifying the binding sites of specific DNA binding factors, but only one factor at a time. In contrast, epigenomic accessibility data like MNase-seq, DNase-seq, and ATAC-seq provide insight into the chromatin landscape of all factors bound along the genome, but with little insight into the identities of those factors. Here, we present RoboCOP, a multivariate state space model that integrates chromatin accessibility data with nucleotide sequence to jointly compute genome-wide probabilistic scores of nucleosome and TF occupancy, for hundreds of different factors. We apply RoboCOP to MNase-seq and ATAC-seq data to elucidate the protein-binding landscape of nucleosomes and 150 TFs across the yeast genome, and show that our model makes better predictions than existing methods. We also compute a chromatin occupancy profile of the yeast genome under cadmium stress, revealing chromatin dynamics associated with transcriptional regulation.
format Online
Article
Text
id pubmed-8373080
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-83730802021-08-19 RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data Mitra, Sneha Zhong, Jianling Tran, Trung Q MacAlpine, David M Hartemink, Alexander J Nucleic Acids Res Computational Biology Chromatin is a tightly packaged structure of DNA and protein within the nucleus of a cell. The arrangement of different protein complexes along the DNA modulates and is modulated by gene expression. Measuring the binding locations and occupancy levels of different transcription factors (TFs) and nucleosomes is therefore crucial to understanding gene regulation. Antibody-based methods for assaying chromatin occupancy are capable of identifying the binding sites of specific DNA binding factors, but only one factor at a time. In contrast, epigenomic accessibility data like MNase-seq, DNase-seq, and ATAC-seq provide insight into the chromatin landscape of all factors bound along the genome, but with little insight into the identities of those factors. Here, we present RoboCOP, a multivariate state space model that integrates chromatin accessibility data with nucleotide sequence to jointly compute genome-wide probabilistic scores of nucleosome and TF occupancy, for hundreds of different factors. We apply RoboCOP to MNase-seq and ATAC-seq data to elucidate the protein-binding landscape of nucleosomes and 150 TFs across the yeast genome, and show that our model makes better predictions than existing methods. We also compute a chromatin occupancy profile of the yeast genome under cadmium stress, revealing chromatin dynamics associated with transcriptional regulation. Oxford University Press 2021-07-13 /pmc/articles/PMC8373080/ /pubmed/34255854 http://dx.doi.org/10.1093/nar/gkab553 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Mitra, Sneha
Zhong, Jianling
Tran, Trung Q
MacAlpine, David M
Hartemink, Alexander J
RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data
title RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data
title_full RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data
title_fullStr RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data
title_full_unstemmed RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data
title_short RoboCOP: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data
title_sort robocop: jointly computing chromatin occupancy profiles for numerous factors from chromatin accessibility data
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8373080/
https://www.ncbi.nlm.nih.gov/pubmed/34255854
http://dx.doi.org/10.1093/nar/gkab553
work_keys_str_mv AT mitrasneha robocopjointlycomputingchromatinoccupancyprofilesfornumerousfactorsfromchromatinaccessibilitydata
AT zhongjianling robocopjointlycomputingchromatinoccupancyprofilesfornumerousfactorsfromchromatinaccessibilitydata
AT trantrungq robocopjointlycomputingchromatinoccupancyprofilesfornumerousfactorsfromchromatinaccessibilitydata
AT macalpinedavidm robocopjointlycomputingchromatinoccupancyprofilesfornumerousfactorsfromchromatinaccessibilitydata
AT harteminkalexanderj robocopjointlycomputingchromatinoccupancyprofilesfornumerousfactorsfromchromatinaccessibilitydata