Cargando…

Modelling of the breadth of expression from promoter architectures identifies pro-housekeeping transcription factors

Understanding how regulatory elements control mammalian gene expression is a challenge of post-genomic era. We previously reported that size of proximal promoter architecture predicted the breadth of expression (fraction of tissues in which a gene is expressed). Herein, the contributions of individu...

Descripción completa

Detalles Bibliográficos
Autor principal: Huminiecki, Lukasz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6013173/
https://www.ncbi.nlm.nih.gov/pubmed/29928029
http://dx.doi.org/10.1371/journal.pone.0198961
_version_ 1783333977460834304
author Huminiecki, Lukasz
author_facet Huminiecki, Lukasz
author_sort Huminiecki, Lukasz
collection PubMed
description Understanding how regulatory elements control mammalian gene expression is a challenge of post-genomic era. We previously reported that size of proximal promoter architecture predicted the breadth of expression (fraction of tissues in which a gene is expressed). Herein, the contributions of individual transcription factors (TFs) were quantified. Several technologies of statistical modelling were utilized and compared: tree models, generalized linear models (GLMs, without and with regularization), Bayesian GLMs and random forest. Both linear and non-linear modelling strategies were explored. Encouragingly, different models led to similar statistical conclusions and biological interpretations. The majority of ENCODE TFs correlated positively with housekeeping expression, a minority correlated negatively. Thus, housekeeping expression can be understood as a cumulative effect of many types of TF binding sites. This is accompanied by the exclusion of fewer types of binding sites for TFs which are repressors, or support cell lineage commitment or temporarily inducible or spatially-restricted expression.
format Online
Article
Text
id pubmed-6013173
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-60131732018-07-06 Modelling of the breadth of expression from promoter architectures identifies pro-housekeeping transcription factors Huminiecki, Lukasz PLoS One Research Article Understanding how regulatory elements control mammalian gene expression is a challenge of post-genomic era. We previously reported that size of proximal promoter architecture predicted the breadth of expression (fraction of tissues in which a gene is expressed). Herein, the contributions of individual transcription factors (TFs) were quantified. Several technologies of statistical modelling were utilized and compared: tree models, generalized linear models (GLMs, without and with regularization), Bayesian GLMs and random forest. Both linear and non-linear modelling strategies were explored. Encouragingly, different models led to similar statistical conclusions and biological interpretations. The majority of ENCODE TFs correlated positively with housekeeping expression, a minority correlated negatively. Thus, housekeeping expression can be understood as a cumulative effect of many types of TF binding sites. This is accompanied by the exclusion of fewer types of binding sites for TFs which are repressors, or support cell lineage commitment or temporarily inducible or spatially-restricted expression. Public Library of Science 2018-06-21 /pmc/articles/PMC6013173/ /pubmed/29928029 http://dx.doi.org/10.1371/journal.pone.0198961 Text en © 2018 Lukasz Huminiecki http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Huminiecki, Lukasz
Modelling of the breadth of expression from promoter architectures identifies pro-housekeeping transcription factors
title Modelling of the breadth of expression from promoter architectures identifies pro-housekeeping transcription factors
title_full Modelling of the breadth of expression from promoter architectures identifies pro-housekeeping transcription factors
title_fullStr Modelling of the breadth of expression from promoter architectures identifies pro-housekeeping transcription factors
title_full_unstemmed Modelling of the breadth of expression from promoter architectures identifies pro-housekeeping transcription factors
title_short Modelling of the breadth of expression from promoter architectures identifies pro-housekeeping transcription factors
title_sort modelling of the breadth of expression from promoter architectures identifies pro-housekeeping transcription factors
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6013173/
https://www.ncbi.nlm.nih.gov/pubmed/29928029
http://dx.doi.org/10.1371/journal.pone.0198961
work_keys_str_mv AT huminieckilukasz modellingofthebreadthofexpressionfrompromoterarchitecturesidentifiesprohousekeepingtranscriptionfactors