Cargando…
Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures
Promoter-proximal Polymerase II (Pol II) pausing is a key rate-limiting step for gene expression. DNA and RNA-binding trans-acting factors regulating the extent of pausing have been identified. However, we lack a quantitative model of how interactions of these factors determine pausing, therefore th...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9976927/ https://www.ncbi.nlm.nih.gov/pubmed/36727445 http://dx.doi.org/10.1093/nar/gkac1272 |
_version_ | 1784899182965817344 |
---|---|
author | Akcan, Toray S Vilov, Sergey Heinig, Matthias |
author_facet | Akcan, Toray S Vilov, Sergey Heinig, Matthias |
author_sort | Akcan, Toray S |
collection | PubMed |
description | Promoter-proximal Polymerase II (Pol II) pausing is a key rate-limiting step for gene expression. DNA and RNA-binding trans-acting factors regulating the extent of pausing have been identified. However, we lack a quantitative model of how interactions of these factors determine pausing, therefore the relative importance of implicated factors is unknown. Moreover, previously unknown regulators might exist. Here we address this gap with a machine learning model that accurately predicts the extent of promoter-proximal Pol II pausing from large-scale genome and transcriptome binding maps and gene annotation and sequence composition features. We demonstrate high accuracy and generalizability of the model by validation on an independent cell line which reveals the model's cell line agnostic character. Model interpretation in light of prior knowledge about molecular functions of regulatory factors confirms the interconnection of pausing with other RNA processing steps. Harnessing underlying feature contributions, we assess the relative importance of each factor, quantify their predictive effects and systematically identify previously unknown regulators of pausing. We additionally identify 16 previously unknown 7SK ncRNA interacting RNA-binding proteins predictive of pausing. Our work provides a framework to further our understanding of the regulation of the critical early steps in transcriptional elongation. |
format | Online Article Text |
id | pubmed-9976927 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-99769272023-03-02 Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures Akcan, Toray S Vilov, Sergey Heinig, Matthias Nucleic Acids Res Computational Biology Promoter-proximal Polymerase II (Pol II) pausing is a key rate-limiting step for gene expression. DNA and RNA-binding trans-acting factors regulating the extent of pausing have been identified. However, we lack a quantitative model of how interactions of these factors determine pausing, therefore the relative importance of implicated factors is unknown. Moreover, previously unknown regulators might exist. Here we address this gap with a machine learning model that accurately predicts the extent of promoter-proximal Pol II pausing from large-scale genome and transcriptome binding maps and gene annotation and sequence composition features. We demonstrate high accuracy and generalizability of the model by validation on an independent cell line which reveals the model's cell line agnostic character. Model interpretation in light of prior knowledge about molecular functions of regulatory factors confirms the interconnection of pausing with other RNA processing steps. Harnessing underlying feature contributions, we assess the relative importance of each factor, quantify their predictive effects and systematically identify previously unknown regulators of pausing. We additionally identify 16 previously unknown 7SK ncRNA interacting RNA-binding proteins predictive of pausing. Our work provides a framework to further our understanding of the regulation of the critical early steps in transcriptional elongation. Oxford University Press 2023-02-02 /pmc/articles/PMC9976927/ /pubmed/36727445 http://dx.doi.org/10.1093/nar/gkac1272 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Akcan, Toray S Vilov, Sergey Heinig, Matthias Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures |
title | Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures |
title_full | Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures |
title_fullStr | Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures |
title_full_unstemmed | Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures |
title_short | Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures |
title_sort | predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9976927/ https://www.ncbi.nlm.nih.gov/pubmed/36727445 http://dx.doi.org/10.1093/nar/gkac1272 |
work_keys_str_mv | AT akcantorays predictivemodeloftranscriptionalelongationcontrolidentifiestransregulatoryfactorsfromchromatinsignatures AT vilovsergey predictivemodeloftranscriptionalelongationcontrolidentifiestransregulatoryfactorsfromchromatinsignatures AT heinigmatthias predictivemodeloftranscriptionalelongationcontrolidentifiestransregulatoryfactorsfromchromatinsignatures |