Cargando…
Quantitative modeling of gene expression using DNA shape features of binding sites
Prediction of gene expression levels driven by regulatory sequences is pivotal in genomic biology. A major focus in transcriptional regulation is sequence-to-expression modeling, which interprets the enhancer sequence based on transcription factor concentrations and DNA binding specificities and pre...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5291265/ https://www.ncbi.nlm.nih.gov/pubmed/27257066 http://dx.doi.org/10.1093/nar/gkw446 |
_version_ | 1782504751914549248 |
---|---|
author | Peng, Pei-Chen Sinha, Saurabh |
author_facet | Peng, Pei-Chen Sinha, Saurabh |
author_sort | Peng, Pei-Chen |
collection | PubMed |
description | Prediction of gene expression levels driven by regulatory sequences is pivotal in genomic biology. A major focus in transcriptional regulation is sequence-to-expression modeling, which interprets the enhancer sequence based on transcription factor concentrations and DNA binding specificities and predicts precise gene expression levels in varying cellular contexts. Such models largely rely on the position weight matrix (PWM) model for DNA binding, and the effect of alternative models based on DNA shape remains unexplored. Here, we propose a statistical thermodynamics model of gene expression using DNA shape features of binding sites. We used rigorous methods to evaluate the fits of expression readouts of 37 enhancers regulating spatial gene expression patterns in Drosophila embryo, and show that DNA shape-based models perform arguably better than PWM-based models. We also observed DNA shape captures information complimentary to the PWM, in a way that is useful for expression modeling. Furthermore, we tested if combining shape and PWM-based features provides better predictions than using either binding model alone. Our work demonstrates that the increasingly popular DNA-binding models based on local DNA shape can be useful in sequence-to-expression modeling. It also provides a framework for future studies to predict gene expression better than with PWM models alone. |
format | Online Article Text |
id | pubmed-5291265 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-52912652017-02-10 Quantitative modeling of gene expression using DNA shape features of binding sites Peng, Pei-Chen Sinha, Saurabh Nucleic Acids Res Method Online Prediction of gene expression levels driven by regulatory sequences is pivotal in genomic biology. A major focus in transcriptional regulation is sequence-to-expression modeling, which interprets the enhancer sequence based on transcription factor concentrations and DNA binding specificities and predicts precise gene expression levels in varying cellular contexts. Such models largely rely on the position weight matrix (PWM) model for DNA binding, and the effect of alternative models based on DNA shape remains unexplored. Here, we propose a statistical thermodynamics model of gene expression using DNA shape features of binding sites. We used rigorous methods to evaluate the fits of expression readouts of 37 enhancers regulating spatial gene expression patterns in Drosophila embryo, and show that DNA shape-based models perform arguably better than PWM-based models. We also observed DNA shape captures information complimentary to the PWM, in a way that is useful for expression modeling. Furthermore, we tested if combining shape and PWM-based features provides better predictions than using either binding model alone. Our work demonstrates that the increasingly popular DNA-binding models based on local DNA shape can be useful in sequence-to-expression modeling. It also provides a framework for future studies to predict gene expression better than with PWM models alone. Oxford University Press 2016-07-27 2016-06-01 /pmc/articles/PMC5291265/ /pubmed/27257066 http://dx.doi.org/10.1093/nar/gkw446 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Method Online Peng, Pei-Chen Sinha, Saurabh Quantitative modeling of gene expression using DNA shape features of binding sites |
title | Quantitative modeling of gene expression using DNA shape features of binding sites |
title_full | Quantitative modeling of gene expression using DNA shape features of binding sites |
title_fullStr | Quantitative modeling of gene expression using DNA shape features of binding sites |
title_full_unstemmed | Quantitative modeling of gene expression using DNA shape features of binding sites |
title_short | Quantitative modeling of gene expression using DNA shape features of binding sites |
title_sort | quantitative modeling of gene expression using dna shape features of binding sites |
topic | Method Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5291265/ https://www.ncbi.nlm.nih.gov/pubmed/27257066 http://dx.doi.org/10.1093/nar/gkw446 |
work_keys_str_mv | AT pengpeichen quantitativemodelingofgeneexpressionusingdnashapefeaturesofbindingsites AT sinhasaurabh quantitativemodelingofgeneexpressionusingdnashapefeaturesofbindingsites |