Cargando…

Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures

Promoter-proximal Polymerase II (Pol II) pausing is a key rate-limiting step for gene expression. DNA and RNA-binding trans-acting factors regulating the extent of pausing have been identified. However, we lack a quantitative model of how interactions of these factors determine pausing, therefore th...

Descripción completa

Detalles Bibliográficos
Autores principales: Akcan, Toray S, Vilov, Sergey, Heinig, Matthias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9976927/
https://www.ncbi.nlm.nih.gov/pubmed/36727445
http://dx.doi.org/10.1093/nar/gkac1272
_version_ 1784899182965817344
author Akcan, Toray S
Vilov, Sergey
Heinig, Matthias
author_facet Akcan, Toray S
Vilov, Sergey
Heinig, Matthias
author_sort Akcan, Toray S
collection PubMed
description Promoter-proximal Polymerase II (Pol II) pausing is a key rate-limiting step for gene expression. DNA and RNA-binding trans-acting factors regulating the extent of pausing have been identified. However, we lack a quantitative model of how interactions of these factors determine pausing, therefore the relative importance of implicated factors is unknown. Moreover, previously unknown regulators might exist. Here we address this gap with a machine learning model that accurately predicts the extent of promoter-proximal Pol II pausing from large-scale genome and transcriptome binding maps and gene annotation and sequence composition features. We demonstrate high accuracy and generalizability of the model by validation on an independent cell line which reveals the model's cell line agnostic character. Model interpretation in light of prior knowledge about molecular functions of regulatory factors confirms the interconnection of pausing with other RNA processing steps. Harnessing underlying feature contributions, we assess the relative importance of each factor, quantify their predictive effects and systematically identify previously unknown regulators of pausing. We additionally identify 16 previously unknown 7SK ncRNA interacting RNA-binding proteins predictive of pausing. Our work provides a framework to further our understanding of the regulation of the critical early steps in transcriptional elongation.
format Online
Article
Text
id pubmed-9976927
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99769272023-03-02 Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures Akcan, Toray S Vilov, Sergey Heinig, Matthias Nucleic Acids Res Computational Biology Promoter-proximal Polymerase II (Pol II) pausing is a key rate-limiting step for gene expression. DNA and RNA-binding trans-acting factors regulating the extent of pausing have been identified. However, we lack a quantitative model of how interactions of these factors determine pausing, therefore the relative importance of implicated factors is unknown. Moreover, previously unknown regulators might exist. Here we address this gap with a machine learning model that accurately predicts the extent of promoter-proximal Pol II pausing from large-scale genome and transcriptome binding maps and gene annotation and sequence composition features. We demonstrate high accuracy and generalizability of the model by validation on an independent cell line which reveals the model's cell line agnostic character. Model interpretation in light of prior knowledge about molecular functions of regulatory factors confirms the interconnection of pausing with other RNA processing steps. Harnessing underlying feature contributions, we assess the relative importance of each factor, quantify their predictive effects and systematically identify previously unknown regulators of pausing. We additionally identify 16 previously unknown 7SK ncRNA interacting RNA-binding proteins predictive of pausing. Our work provides a framework to further our understanding of the regulation of the critical early steps in transcriptional elongation. Oxford University Press 2023-02-02 /pmc/articles/PMC9976927/ /pubmed/36727445 http://dx.doi.org/10.1093/nar/gkac1272 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Akcan, Toray S
Vilov, Sergey
Heinig, Matthias
Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures
title Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures
title_full Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures
title_fullStr Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures
title_full_unstemmed Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures
title_short Predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures
title_sort predictive model of transcriptional elongation control identifies trans regulatory factors from chromatin signatures
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9976927/
https://www.ncbi.nlm.nih.gov/pubmed/36727445
http://dx.doi.org/10.1093/nar/gkac1272
work_keys_str_mv AT akcantorays predictivemodeloftranscriptionalelongationcontrolidentifiestransregulatoryfactorsfromchromatinsignatures
AT vilovsergey predictivemodeloftranscriptionalelongationcontrolidentifiestransregulatoryfactorsfromchromatinsignatures
AT heinigmatthias predictivemodeloftranscriptionalelongationcontrolidentifiestransregulatoryfactorsfromchromatinsignatures