Cargando…

Sequence basis of transcription initiation in human genome

Transcription initiation is an essential process for ensuring proper function of any gene, however, a unified understanding of sequence patterns and rules that determine transcription initiation sites in human genome remains elusive. By explaining transcription initiation at basepair resolution from...

Descripción completa

Detalles Bibliográficos
Autores principales: Dudnyk, Kseniia, Shi, Chenlai, Zhou, Jian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327147/
https://www.ncbi.nlm.nih.gov/pubmed/37425823
http://dx.doi.org/10.1101/2023.06.27.546584
_version_ 1785069565896556544
author Dudnyk, Kseniia
Shi, Chenlai
Zhou, Jian
author_facet Dudnyk, Kseniia
Shi, Chenlai
Zhou, Jian
author_sort Dudnyk, Kseniia
collection PubMed
description Transcription initiation is an essential process for ensuring proper function of any gene, however, a unified understanding of sequence patterns and rules that determine transcription initiation sites in human genome remains elusive. By explaining transcription initiation at basepair resolution from sequence with a deep learning-inspired explainable modeling approach, here we show that simple rules can explain the vast majority of human promoters. We identified key sequence patterns that contribute to human promoter function, each activating transcription with a distinct position-specific effect curve that likely reflects its mechanism of promoting transcription initiation. Most of these position-specific effects have not been previously characterized, and we verified them using experimental perturbations of transcription factors and sequences. We revealed the sequence basis of bidirectional transcription at promoters and links between promoter selectivity and gene expression variation across cell types. Additionally, by analyzing 241 mammalian genomes and mouse transcription initiation site data, we showed that the sequence determinants are conserved across mammalian species. Taken together, we provide a unified model of the sequence basis of transcription initiation at the basepair level that is broadly applicable across mammalian species, and shed new light on basic questions related to promoter sequence and function.
format Online
Article
Text
id pubmed-10327147
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-103271472023-07-08 Sequence basis of transcription initiation in human genome Dudnyk, Kseniia Shi, Chenlai Zhou, Jian bioRxiv Article Transcription initiation is an essential process for ensuring proper function of any gene, however, a unified understanding of sequence patterns and rules that determine transcription initiation sites in human genome remains elusive. By explaining transcription initiation at basepair resolution from sequence with a deep learning-inspired explainable modeling approach, here we show that simple rules can explain the vast majority of human promoters. We identified key sequence patterns that contribute to human promoter function, each activating transcription with a distinct position-specific effect curve that likely reflects its mechanism of promoting transcription initiation. Most of these position-specific effects have not been previously characterized, and we verified them using experimental perturbations of transcription factors and sequences. We revealed the sequence basis of bidirectional transcription at promoters and links between promoter selectivity and gene expression variation across cell types. Additionally, by analyzing 241 mammalian genomes and mouse transcription initiation site data, we showed that the sequence determinants are conserved across mammalian species. Taken together, we provide a unified model of the sequence basis of transcription initiation at the basepair level that is broadly applicable across mammalian species, and shed new light on basic questions related to promoter sequence and function. Cold Spring Harbor Laboratory 2023-06-29 /pmc/articles/PMC10327147/ /pubmed/37425823 http://dx.doi.org/10.1101/2023.06.27.546584 Text en https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Dudnyk, Kseniia
Shi, Chenlai
Zhou, Jian
Sequence basis of transcription initiation in human genome
title Sequence basis of transcription initiation in human genome
title_full Sequence basis of transcription initiation in human genome
title_fullStr Sequence basis of transcription initiation in human genome
title_full_unstemmed Sequence basis of transcription initiation in human genome
title_short Sequence basis of transcription initiation in human genome
title_sort sequence basis of transcription initiation in human genome
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10327147/
https://www.ncbi.nlm.nih.gov/pubmed/37425823
http://dx.doi.org/10.1101/2023.06.27.546584
work_keys_str_mv AT dudnykkseniia sequencebasisoftranscriptioninitiationinhumangenome
AT shichenlai sequencebasisoftranscriptioninitiationinhumangenome
AT zhoujian sequencebasisoftranscriptioninitiationinhumangenome