Cargando…
Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data
Gene transcription mediated by RNA polymerase II (pol-II) is a key step in gene expression. The dynamics of pol-II moving along the transcribed region influence the rate and timing of gene expression. In this work, we present a probabilistic model of transcription dynamics which is fitted to pol-II...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4022483/ https://www.ncbi.nlm.nih.gov/pubmed/24830797 http://dx.doi.org/10.1371/journal.pcbi.1003598 |
_version_ | 1782316415783534592 |
---|---|
author | wa Maina, Ciira Honkela, Antti Matarese, Filomena Grote, Korbinian Stunnenberg, Hendrik G. Reid, George Lawrence, Neil D. Rattray, Magnus |
author_facet | wa Maina, Ciira Honkela, Antti Matarese, Filomena Grote, Korbinian Stunnenberg, Hendrik G. Reid, George Lawrence, Neil D. Rattray, Magnus |
author_sort | wa Maina, Ciira |
collection | PubMed |
description | Gene transcription mediated by RNA polymerase II (pol-II) is a key step in gene expression. The dynamics of pol-II moving along the transcribed region influence the rate and timing of gene expression. In this work, we present a probabilistic model of transcription dynamics which is fitted to pol-II occupancy time course data measured using ChIP-Seq. The model can be used to estimate transcription speed and to infer the temporal pol-II activity profile at the gene promoter. Model parameters are estimated using either maximum likelihood estimation or via Bayesian inference using Markov chain Monte Carlo sampling. The Bayesian approach provides confidence intervals for parameter estimates and allows the use of priors that capture domain knowledge, e.g. the expected range of transcription speeds, based on previous experiments. The model describes the movement of pol-II down the gene body and can be used to identify the time of induction for transcriptionally engaged genes. By clustering the inferred promoter activity time profiles, we are able to determine which genes respond quickly to stimuli and group genes that share activity profiles and may therefore be co-regulated. We apply our methodology to biological data obtained using ChIP-seq to measure pol-II occupancy genome-wide when MCF-7 human breast cancer cells are treated with estradiol (E2). The transcription speeds we obtain agree with those obtained previously for smaller numbers of genes with the advantage that our approach can be applied genome-wide. We validate the biological significance of the pol-II promoter activity clusters by investigating cluster-specific transcription factor binding patterns and determining canonical pathway enrichment. We find that rapidly induced genes are enriched for both estrogen receptor alpha (ER[Image: see text]) and FOXA1 binding in their proximal promoter regions. |
format | Online Article Text |
id | pubmed-4022483 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-40224832014-05-21 Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data wa Maina, Ciira Honkela, Antti Matarese, Filomena Grote, Korbinian Stunnenberg, Hendrik G. Reid, George Lawrence, Neil D. Rattray, Magnus PLoS Comput Biol Research Article Gene transcription mediated by RNA polymerase II (pol-II) is a key step in gene expression. The dynamics of pol-II moving along the transcribed region influence the rate and timing of gene expression. In this work, we present a probabilistic model of transcription dynamics which is fitted to pol-II occupancy time course data measured using ChIP-Seq. The model can be used to estimate transcription speed and to infer the temporal pol-II activity profile at the gene promoter. Model parameters are estimated using either maximum likelihood estimation or via Bayesian inference using Markov chain Monte Carlo sampling. The Bayesian approach provides confidence intervals for parameter estimates and allows the use of priors that capture domain knowledge, e.g. the expected range of transcription speeds, based on previous experiments. The model describes the movement of pol-II down the gene body and can be used to identify the time of induction for transcriptionally engaged genes. By clustering the inferred promoter activity time profiles, we are able to determine which genes respond quickly to stimuli and group genes that share activity profiles and may therefore be co-regulated. We apply our methodology to biological data obtained using ChIP-seq to measure pol-II occupancy genome-wide when MCF-7 human breast cancer cells are treated with estradiol (E2). The transcription speeds we obtain agree with those obtained previously for smaller numbers of genes with the advantage that our approach can be applied genome-wide. We validate the biological significance of the pol-II promoter activity clusters by investigating cluster-specific transcription factor binding patterns and determining canonical pathway enrichment. We find that rapidly induced genes are enriched for both estrogen receptor alpha (ER[Image: see text]) and FOXA1 binding in their proximal promoter regions. Public Library of Science 2014-05-15 /pmc/articles/PMC4022483/ /pubmed/24830797 http://dx.doi.org/10.1371/journal.pcbi.1003598 Text en © 2014 wa Maina et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article wa Maina, Ciira Honkela, Antti Matarese, Filomena Grote, Korbinian Stunnenberg, Hendrik G. Reid, George Lawrence, Neil D. Rattray, Magnus Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data |
title | Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data |
title_full | Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data |
title_fullStr | Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data |
title_full_unstemmed | Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data |
title_short | Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data |
title_sort | inference of rna polymerase ii transcription dynamics from chromatin immunoprecipitation time course data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4022483/ https://www.ncbi.nlm.nih.gov/pubmed/24830797 http://dx.doi.org/10.1371/journal.pcbi.1003598 |
work_keys_str_mv | AT wamainaciira inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata AT honkelaantti inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata AT mataresefilomena inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata AT grotekorbinian inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata AT stunnenberghendrikg inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata AT reidgeorge inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata AT lawrenceneild inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata AT rattraymagnus inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata |