Cargando…

Explaining the correlations among properties of mammalian promoters

Proximal promoters are fundamental genomic elements for gene expression. They vary in terms of GC percentage, CpG abundance, presence of TATA signal, evolutionary conservation, chromosomal spread of transcription start sites and breadth of expression across cell types. These properties are correlate...

Descripción completa

Detalles Bibliográficos
Autor principal: Frith, Martin C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005656/
https://www.ncbi.nlm.nih.gov/pubmed/24682821
http://dx.doi.org/10.1093/nar/gku115
_version_ 1782314134928359424
author Frith, Martin C.
author_facet Frith, Martin C.
author_sort Frith, Martin C.
collection PubMed
description Proximal promoters are fundamental genomic elements for gene expression. They vary in terms of GC percentage, CpG abundance, presence of TATA signal, evolutionary conservation, chromosomal spread of transcription start sites and breadth of expression across cell types. These properties are correlated, and it has been suggested that there are two classes of promoters: one class with high CpG, widely spread transcription start sites and broad expression, and another with TATA signals, narrow spread and restricted expression. However, it has been unclear why these properties are correlated in this way. We reexamined these features using the deep FANTOM5 CAGE data from hundreds of cell types. First, we point out subtle but important biases in previous definitions of promoters and of expression breadth. Second, we show that most promoters are rather nonspecifically expressed across many cell types. Third, promoters’ expression breadth is independent of maximum expression level, and therefore correlates with average expression level. Fourth, the data show a more complex picture than two classes, with a network of direct and indirect correlations among promoter properties. By tentatively distinguishing the direct from the indirect correlations, we reveal simple explanations for them.
format Online
Article
Text
id pubmed-4005656
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40056562014-05-01 Explaining the correlations among properties of mammalian promoters Frith, Martin C. Nucleic Acids Res Computational Biology Proximal promoters are fundamental genomic elements for gene expression. They vary in terms of GC percentage, CpG abundance, presence of TATA signal, evolutionary conservation, chromosomal spread of transcription start sites and breadth of expression across cell types. These properties are correlated, and it has been suggested that there are two classes of promoters: one class with high CpG, widely spread transcription start sites and broad expression, and another with TATA signals, narrow spread and restricted expression. However, it has been unclear why these properties are correlated in this way. We reexamined these features using the deep FANTOM5 CAGE data from hundreds of cell types. First, we point out subtle but important biases in previous definitions of promoters and of expression breadth. Second, we show that most promoters are rather nonspecifically expressed across many cell types. Third, promoters’ expression breadth is independent of maximum expression level, and therefore correlates with average expression level. Fourth, the data show a more complex picture than two classes, with a network of direct and indirect correlations among promoter properties. By tentatively distinguishing the direct from the indirect correlations, we reveal simple explanations for them. Oxford University Press 2014-04 2014-03-27 /pmc/articles/PMC4005656/ /pubmed/24682821 http://dx.doi.org/10.1093/nar/gku115 Text en © The Author(s) 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Frith, Martin C.
Explaining the correlations among properties of mammalian promoters
title Explaining the correlations among properties of mammalian promoters
title_full Explaining the correlations among properties of mammalian promoters
title_fullStr Explaining the correlations among properties of mammalian promoters
title_full_unstemmed Explaining the correlations among properties of mammalian promoters
title_short Explaining the correlations among properties of mammalian promoters
title_sort explaining the correlations among properties of mammalian promoters
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005656/
https://www.ncbi.nlm.nih.gov/pubmed/24682821
http://dx.doi.org/10.1093/nar/gku115
work_keys_str_mv AT frithmartinc explainingthecorrelationsamongpropertiesofmammalianpromoters
AT explainingthecorrelationsamongpropertiesofmammalianpromoters