Cargando…
Detecting sequence dependent transcriptional pauses from RNA and protein number time series
BACKGROUND: Evidence suggests that in prokaryotes sequence-dependent transcriptional pauses affect the dynamics of transcription and translation, as well as of small genetic circuits. So far, a few pause-prone sequences have been identified from in vitro measurements of transcription elongation kine...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534578/ https://www.ncbi.nlm.nih.gov/pubmed/22741547 http://dx.doi.org/10.1186/1471-2105-13-152 |
_version_ | 1782475359130746880 |
---|---|
author | Emmert-Streib, Frank Häkkinen, Antti Ribeiro, Andre S |
author_facet | Emmert-Streib, Frank Häkkinen, Antti Ribeiro, Andre S |
author_sort | Emmert-Streib, Frank |
collection | PubMed |
description | BACKGROUND: Evidence suggests that in prokaryotes sequence-dependent transcriptional pauses affect the dynamics of transcription and translation, as well as of small genetic circuits. So far, a few pause-prone sequences have been identified from in vitro measurements of transcription elongation kinetics. RESULTS: Using a stochastic model of gene expression at the nucleotide and codon levels with realistic parameter values, we investigate three different but related questions and present statistical methods for their analysis. First, we show that information from in vivo RNA and protein temporal numbers is sufficient to discriminate between models with and without a pause site in their coding sequence. Second, we demonstrate that it is possible to separate a large variety of models from each other with pauses of various durations and locations in the template by means of a hierarchical clustering and a random forest classifier. Third, we introduce an approximate likelihood function that allows to estimate the location of a pause site. CONCLUSIONS: This method can aid in detecting unknown pause-prone sequences from temporal measurements of RNA and protein numbers at a genome-wide scale and thus elucidate possible roles that these sequences play in the dynamics of genetic networks and phenotype. |
format | Online Article Text |
id | pubmed-3534578 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35345782013-01-03 Detecting sequence dependent transcriptional pauses from RNA and protein number time series Emmert-Streib, Frank Häkkinen, Antti Ribeiro, Andre S BMC Bioinformatics Research Article BACKGROUND: Evidence suggests that in prokaryotes sequence-dependent transcriptional pauses affect the dynamics of transcription and translation, as well as of small genetic circuits. So far, a few pause-prone sequences have been identified from in vitro measurements of transcription elongation kinetics. RESULTS: Using a stochastic model of gene expression at the nucleotide and codon levels with realistic parameter values, we investigate three different but related questions and present statistical methods for their analysis. First, we show that information from in vivo RNA and protein temporal numbers is sufficient to discriminate between models with and without a pause site in their coding sequence. Second, we demonstrate that it is possible to separate a large variety of models from each other with pauses of various durations and locations in the template by means of a hierarchical clustering and a random forest classifier. Third, we introduce an approximate likelihood function that allows to estimate the location of a pause site. CONCLUSIONS: This method can aid in detecting unknown pause-prone sequences from temporal measurements of RNA and protein numbers at a genome-wide scale and thus elucidate possible roles that these sequences play in the dynamics of genetic networks and phenotype. BioMed Central 2012-06-28 /pmc/articles/PMC3534578/ /pubmed/22741547 http://dx.doi.org/10.1186/1471-2105-13-152 Text en Copyright ©2012 Emmert-Streib et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Emmert-Streib, Frank Häkkinen, Antti Ribeiro, Andre S Detecting sequence dependent transcriptional pauses from RNA and protein number time series |
title | Detecting sequence dependent transcriptional pauses from RNA and protein number time series |
title_full | Detecting sequence dependent transcriptional pauses from RNA and protein number time series |
title_fullStr | Detecting sequence dependent transcriptional pauses from RNA and protein number time series |
title_full_unstemmed | Detecting sequence dependent transcriptional pauses from RNA and protein number time series |
title_short | Detecting sequence dependent transcriptional pauses from RNA and protein number time series |
title_sort | detecting sequence dependent transcriptional pauses from rna and protein number time series |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534578/ https://www.ncbi.nlm.nih.gov/pubmed/22741547 http://dx.doi.org/10.1186/1471-2105-13-152 |
work_keys_str_mv | AT emmertstreibfrank detectingsequencedependenttranscriptionalpausesfromrnaandproteinnumbertimeseries AT hakkinenantti detectingsequencedependenttranscriptionalpausesfromrnaandproteinnumbertimeseries AT ribeiroandres detectingsequencedependenttranscriptionalpausesfromrnaandproteinnumbertimeseries |