Cargando…

TherMos: Estimating protein–DNA binding energies from in vivo binding profiles

Accurately characterizing transcription factor (TF)-DNA affinity is a central goal of regulatory genomics. Although thermodynamics provides the most natural language for describing the continuous range of TF-DNA affinity, traditional motif discovery algorithms focus instead on classification paradig...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Wenjie, Hu, Xiaoming, Lim, Michael H. K., Ng, Calista K. L., Choo, Siew Hua, Castro, Diogo S., Drechsel, Daniela, Guillemot, François, Kolatkar, Prasanna R., Jauch, Ralf, Prabhakar, Shyam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3675472/
https://www.ncbi.nlm.nih.gov/pubmed/23595148
http://dx.doi.org/10.1093/nar/gkt250
_version_ 1782476098047574016
author Sun, Wenjie
Hu, Xiaoming
Lim, Michael H. K.
Ng, Calista K. L.
Choo, Siew Hua
Castro, Diogo S.
Drechsel, Daniela
Guillemot, François
Kolatkar, Prasanna R.
Jauch, Ralf
Prabhakar, Shyam
author_facet Sun, Wenjie
Hu, Xiaoming
Lim, Michael H. K.
Ng, Calista K. L.
Choo, Siew Hua
Castro, Diogo S.
Drechsel, Daniela
Guillemot, François
Kolatkar, Prasanna R.
Jauch, Ralf
Prabhakar, Shyam
author_sort Sun, Wenjie
collection PubMed
description Accurately characterizing transcription factor (TF)-DNA affinity is a central goal of regulatory genomics. Although thermodynamics provides the most natural language for describing the continuous range of TF-DNA affinity, traditional motif discovery algorithms focus instead on classification paradigms that aim to discriminate ‘bound’ and ‘unbound’ sequences. Moreover, these algorithms do not directly model the distribution of tags in ChIP-seq data. Here, we present a new algorithm named Thermodynamic Modeling of ChIP-seq (TherMos), which directly estimates a position-specific binding energy matrix (PSEM) from ChIP-seq/exo tag profiles. In cross-validation tests on seven genome-wide TF-DNA binding profiles, one of which we generated via ChIP-seq on a complex developing tissue, TherMos predicted quantitative TF-DNA binding with greater accuracy than five well-known algorithms. We experimentally validated TherMos binding energy models for Klf4 and Esrrb, using a novel protocol to measure PSEMs in vitro. Strikingly, our measurements revealed strong non-additivity at multiple positions within the two PSEMs. Among the algorithms tested, only TherMos was able to model the entire binding energy landscape of Klf4 and Esrrb. Our study reveals new insights into the energetics of TF-DNA binding in vivo and provides an accurate first-principles approach to binding energy inference from ChIP-seq and ChIP-exo data.
format Online
Article
Text
id pubmed-3675472
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36754722013-06-07 TherMos: Estimating protein–DNA binding energies from in vivo binding profiles Sun, Wenjie Hu, Xiaoming Lim, Michael H. K. Ng, Calista K. L. Choo, Siew Hua Castro, Diogo S. Drechsel, Daniela Guillemot, François Kolatkar, Prasanna R. Jauch, Ralf Prabhakar, Shyam Nucleic Acids Res Computational Biology Accurately characterizing transcription factor (TF)-DNA affinity is a central goal of regulatory genomics. Although thermodynamics provides the most natural language for describing the continuous range of TF-DNA affinity, traditional motif discovery algorithms focus instead on classification paradigms that aim to discriminate ‘bound’ and ‘unbound’ sequences. Moreover, these algorithms do not directly model the distribution of tags in ChIP-seq data. Here, we present a new algorithm named Thermodynamic Modeling of ChIP-seq (TherMos), which directly estimates a position-specific binding energy matrix (PSEM) from ChIP-seq/exo tag profiles. In cross-validation tests on seven genome-wide TF-DNA binding profiles, one of which we generated via ChIP-seq on a complex developing tissue, TherMos predicted quantitative TF-DNA binding with greater accuracy than five well-known algorithms. We experimentally validated TherMos binding energy models for Klf4 and Esrrb, using a novel protocol to measure PSEMs in vitro. Strikingly, our measurements revealed strong non-additivity at multiple positions within the two PSEMs. Among the algorithms tested, only TherMos was able to model the entire binding energy landscape of Klf4 and Esrrb. Our study reveals new insights into the energetics of TF-DNA binding in vivo and provides an accurate first-principles approach to binding energy inference from ChIP-seq and ChIP-exo data. Oxford University Press 2013-06 2013-04-16 /pmc/articles/PMC3675472/ /pubmed/23595148 http://dx.doi.org/10.1093/nar/gkt250 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Sun, Wenjie
Hu, Xiaoming
Lim, Michael H. K.
Ng, Calista K. L.
Choo, Siew Hua
Castro, Diogo S.
Drechsel, Daniela
Guillemot, François
Kolatkar, Prasanna R.
Jauch, Ralf
Prabhakar, Shyam
TherMos: Estimating protein–DNA binding energies from in vivo binding profiles
title TherMos: Estimating protein–DNA binding energies from in vivo binding profiles
title_full TherMos: Estimating protein–DNA binding energies from in vivo binding profiles
title_fullStr TherMos: Estimating protein–DNA binding energies from in vivo binding profiles
title_full_unstemmed TherMos: Estimating protein–DNA binding energies from in vivo binding profiles
title_short TherMos: Estimating protein–DNA binding energies from in vivo binding profiles
title_sort thermos: estimating protein–dna binding energies from in vivo binding profiles
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3675472/
https://www.ncbi.nlm.nih.gov/pubmed/23595148
http://dx.doi.org/10.1093/nar/gkt250
work_keys_str_mv AT sunwenjie thermosestimatingproteindnabindingenergiesfrominvivobindingprofiles
AT huxiaoming thermosestimatingproteindnabindingenergiesfrominvivobindingprofiles
AT limmichaelhk thermosestimatingproteindnabindingenergiesfrominvivobindingprofiles
AT ngcalistakl thermosestimatingproteindnabindingenergiesfrominvivobindingprofiles
AT choosiewhua thermosestimatingproteindnabindingenergiesfrominvivobindingprofiles
AT castrodiogos thermosestimatingproteindnabindingenergiesfrominvivobindingprofiles
AT drechseldaniela thermosestimatingproteindnabindingenergiesfrominvivobindingprofiles
AT guillemotfrancois thermosestimatingproteindnabindingenergiesfrominvivobindingprofiles
AT kolatkarprasannar thermosestimatingproteindnabindingenergiesfrominvivobindingprofiles
AT jauchralf thermosestimatingproteindnabindingenergiesfrominvivobindingprofiles
AT prabhakarshyam thermosestimatingproteindnabindingenergiesfrominvivobindingprofiles