Cargando…

Quantitative modeling of gene expression using DNA shape features of binding sites

Prediction of gene expression levels driven by regulatory sequences is pivotal in genomic biology. A major focus in transcriptional regulation is sequence-to-expression modeling, which interprets the enhancer sequence based on transcription factor concentrations and DNA binding specificities and pre...

Descripción completa

Detalles Bibliográficos
Autores principales: Peng, Pei-Chen, Sinha, Saurabh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5291265/
https://www.ncbi.nlm.nih.gov/pubmed/27257066
http://dx.doi.org/10.1093/nar/gkw446
_version_ 1782504751914549248
author Peng, Pei-Chen
Sinha, Saurabh
author_facet Peng, Pei-Chen
Sinha, Saurabh
author_sort Peng, Pei-Chen
collection PubMed
description Prediction of gene expression levels driven by regulatory sequences is pivotal in genomic biology. A major focus in transcriptional regulation is sequence-to-expression modeling, which interprets the enhancer sequence based on transcription factor concentrations and DNA binding specificities and predicts precise gene expression levels in varying cellular contexts. Such models largely rely on the position weight matrix (PWM) model for DNA binding, and the effect of alternative models based on DNA shape remains unexplored. Here, we propose a statistical thermodynamics model of gene expression using DNA shape features of binding sites. We used rigorous methods to evaluate the fits of expression readouts of 37 enhancers regulating spatial gene expression patterns in Drosophila embryo, and show that DNA shape-based models perform arguably better than PWM-based models. We also observed DNA shape captures information complimentary to the PWM, in a way that is useful for expression modeling. Furthermore, we tested if combining shape and PWM-based features provides better predictions than using either binding model alone. Our work demonstrates that the increasingly popular DNA-binding models based on local DNA shape can be useful in sequence-to-expression modeling. It also provides a framework for future studies to predict gene expression better than with PWM models alone.
format Online
Article
Text
id pubmed-5291265
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-52912652017-02-10 Quantitative modeling of gene expression using DNA shape features of binding sites Peng, Pei-Chen Sinha, Saurabh Nucleic Acids Res Method Online Prediction of gene expression levels driven by regulatory sequences is pivotal in genomic biology. A major focus in transcriptional regulation is sequence-to-expression modeling, which interprets the enhancer sequence based on transcription factor concentrations and DNA binding specificities and predicts precise gene expression levels in varying cellular contexts. Such models largely rely on the position weight matrix (PWM) model for DNA binding, and the effect of alternative models based on DNA shape remains unexplored. Here, we propose a statistical thermodynamics model of gene expression using DNA shape features of binding sites. We used rigorous methods to evaluate the fits of expression readouts of 37 enhancers regulating spatial gene expression patterns in Drosophila embryo, and show that DNA shape-based models perform arguably better than PWM-based models. We also observed DNA shape captures information complimentary to the PWM, in a way that is useful for expression modeling. Furthermore, we tested if combining shape and PWM-based features provides better predictions than using either binding model alone. Our work demonstrates that the increasingly popular DNA-binding models based on local DNA shape can be useful in sequence-to-expression modeling. It also provides a framework for future studies to predict gene expression better than with PWM models alone. Oxford University Press 2016-07-27 2016-06-01 /pmc/articles/PMC5291265/ /pubmed/27257066 http://dx.doi.org/10.1093/nar/gkw446 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Method Online
Peng, Pei-Chen
Sinha, Saurabh
Quantitative modeling of gene expression using DNA shape features of binding sites
title Quantitative modeling of gene expression using DNA shape features of binding sites
title_full Quantitative modeling of gene expression using DNA shape features of binding sites
title_fullStr Quantitative modeling of gene expression using DNA shape features of binding sites
title_full_unstemmed Quantitative modeling of gene expression using DNA shape features of binding sites
title_short Quantitative modeling of gene expression using DNA shape features of binding sites
title_sort quantitative modeling of gene expression using dna shape features of binding sites
topic Method Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5291265/
https://www.ncbi.nlm.nih.gov/pubmed/27257066
http://dx.doi.org/10.1093/nar/gkw446
work_keys_str_mv AT pengpeichen quantitativemodelingofgeneexpressionusingdnashapefeaturesofbindingsites
AT sinhasaurabh quantitativemodelingofgeneexpressionusingdnashapefeaturesofbindingsites