Cargando…

Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data

Gene transcription mediated by RNA polymerase II (pol-II) is a key step in gene expression. The dynamics of pol-II moving along the transcribed region influence the rate and timing of gene expression. In this work, we present a probabilistic model of transcription dynamics which is fitted to pol-II...

Descripción completa

Detalles Bibliográficos
Autores principales: wa Maina, Ciira, Honkela, Antti, Matarese, Filomena, Grote, Korbinian, Stunnenberg, Hendrik G., Reid, George, Lawrence, Neil D., Rattray, Magnus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4022483/
https://www.ncbi.nlm.nih.gov/pubmed/24830797
http://dx.doi.org/10.1371/journal.pcbi.1003598
_version_ 1782316415783534592
author wa Maina, Ciira
Honkela, Antti
Matarese, Filomena
Grote, Korbinian
Stunnenberg, Hendrik G.
Reid, George
Lawrence, Neil D.
Rattray, Magnus
author_facet wa Maina, Ciira
Honkela, Antti
Matarese, Filomena
Grote, Korbinian
Stunnenberg, Hendrik G.
Reid, George
Lawrence, Neil D.
Rattray, Magnus
author_sort wa Maina, Ciira
collection PubMed
description Gene transcription mediated by RNA polymerase II (pol-II) is a key step in gene expression. The dynamics of pol-II moving along the transcribed region influence the rate and timing of gene expression. In this work, we present a probabilistic model of transcription dynamics which is fitted to pol-II occupancy time course data measured using ChIP-Seq. The model can be used to estimate transcription speed and to infer the temporal pol-II activity profile at the gene promoter. Model parameters are estimated using either maximum likelihood estimation or via Bayesian inference using Markov chain Monte Carlo sampling. The Bayesian approach provides confidence intervals for parameter estimates and allows the use of priors that capture domain knowledge, e.g. the expected range of transcription speeds, based on previous experiments. The model describes the movement of pol-II down the gene body and can be used to identify the time of induction for transcriptionally engaged genes. By clustering the inferred promoter activity time profiles, we are able to determine which genes respond quickly to stimuli and group genes that share activity profiles and may therefore be co-regulated. We apply our methodology to biological data obtained using ChIP-seq to measure pol-II occupancy genome-wide when MCF-7 human breast cancer cells are treated with estradiol (E2). The transcription speeds we obtain agree with those obtained previously for smaller numbers of genes with the advantage that our approach can be applied genome-wide. We validate the biological significance of the pol-II promoter activity clusters by investigating cluster-specific transcription factor binding patterns and determining canonical pathway enrichment. We find that rapidly induced genes are enriched for both estrogen receptor alpha (ER[Image: see text]) and FOXA1 binding in their proximal promoter regions.
format Online
Article
Text
id pubmed-4022483
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40224832014-05-21 Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data wa Maina, Ciira Honkela, Antti Matarese, Filomena Grote, Korbinian Stunnenberg, Hendrik G. Reid, George Lawrence, Neil D. Rattray, Magnus PLoS Comput Biol Research Article Gene transcription mediated by RNA polymerase II (pol-II) is a key step in gene expression. The dynamics of pol-II moving along the transcribed region influence the rate and timing of gene expression. In this work, we present a probabilistic model of transcription dynamics which is fitted to pol-II occupancy time course data measured using ChIP-Seq. The model can be used to estimate transcription speed and to infer the temporal pol-II activity profile at the gene promoter. Model parameters are estimated using either maximum likelihood estimation or via Bayesian inference using Markov chain Monte Carlo sampling. The Bayesian approach provides confidence intervals for parameter estimates and allows the use of priors that capture domain knowledge, e.g. the expected range of transcription speeds, based on previous experiments. The model describes the movement of pol-II down the gene body and can be used to identify the time of induction for transcriptionally engaged genes. By clustering the inferred promoter activity time profiles, we are able to determine which genes respond quickly to stimuli and group genes that share activity profiles and may therefore be co-regulated. We apply our methodology to biological data obtained using ChIP-seq to measure pol-II occupancy genome-wide when MCF-7 human breast cancer cells are treated with estradiol (E2). The transcription speeds we obtain agree with those obtained previously for smaller numbers of genes with the advantage that our approach can be applied genome-wide. We validate the biological significance of the pol-II promoter activity clusters by investigating cluster-specific transcription factor binding patterns and determining canonical pathway enrichment. We find that rapidly induced genes are enriched for both estrogen receptor alpha (ER[Image: see text]) and FOXA1 binding in their proximal promoter regions. Public Library of Science 2014-05-15 /pmc/articles/PMC4022483/ /pubmed/24830797 http://dx.doi.org/10.1371/journal.pcbi.1003598 Text en © 2014 wa Maina et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
wa Maina, Ciira
Honkela, Antti
Matarese, Filomena
Grote, Korbinian
Stunnenberg, Hendrik G.
Reid, George
Lawrence, Neil D.
Rattray, Magnus
Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data
title Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data
title_full Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data
title_fullStr Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data
title_full_unstemmed Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data
title_short Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data
title_sort inference of rna polymerase ii transcription dynamics from chromatin immunoprecipitation time course data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4022483/
https://www.ncbi.nlm.nih.gov/pubmed/24830797
http://dx.doi.org/10.1371/journal.pcbi.1003598
work_keys_str_mv AT wamainaciira inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata
AT honkelaantti inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata
AT mataresefilomena inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata
AT grotekorbinian inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata
AT stunnenberghendrikg inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata
AT reidgeorge inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata
AT lawrenceneild inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata
AT rattraymagnus inferenceofrnapolymeraseiitranscriptiondynamicsfromchromatinimmunoprecipitationtimecoursedata