Cargando…

Statistical Modeling of Transcription Factor Binding Affinities Predicts Regulatory Interactions

Recent experimental and theoretical efforts have highlighted the fact that binding of transcription factors to DNA can be more accurately described by continuous measures of their binding affinities, rather than a discrete description in terms of binding sites. While the binding affinities can be pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Manke, Thomas, Roider, Helge G., Vingron, Martin
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2266803/
https://www.ncbi.nlm.nih.gov/pubmed/18369429
http://dx.doi.org/10.1371/journal.pcbi.1000039
_version_ 1782151565515161600
author Manke, Thomas
Roider, Helge G.
Vingron, Martin
author_facet Manke, Thomas
Roider, Helge G.
Vingron, Martin
author_sort Manke, Thomas
collection PubMed
description Recent experimental and theoretical efforts have highlighted the fact that binding of transcription factors to DNA can be more accurately described by continuous measures of their binding affinities, rather than a discrete description in terms of binding sites. While the binding affinities can be predicted from a physical model, it is often desirable to know the distribution of binding affinities for specific sequence backgrounds. In this paper, we present a statistical approach to derive the exact distribution for sequence models with fixed GC content. We demonstrate that the affinity distribution of almost all known transcription factors can be effectively parametrized by a class of generalized extreme value distributions. Moreover, this parameterization also describes the affinity distribution for sequence backgrounds with variable GC content, such as human promoter sequences. Our approach is applicable to arbitrary sequences and all transcription factors with known binding preferences that can be described in terms of a motif matrix. The statistical treatment also provides a proper framework to directly compare transcription factors with very different affinity distributions. This is illustrated by our analysis of human promoters with known binding sites, for many of which we could identify the known regulators as those with the highest affinity. The combination of physical model and statistical normalization provides a quantitative measure which ranks transcription factors for a given sequence, and which can be compared directly with large-scale binding data. Its successful application to human promoter sequences serves as an encouraging example of how the method can be applied to other sequences.
format Text
id pubmed-2266803
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-22668032008-03-21 Statistical Modeling of Transcription Factor Binding Affinities Predicts Regulatory Interactions Manke, Thomas Roider, Helge G. Vingron, Martin PLoS Comput Biol Research Article Recent experimental and theoretical efforts have highlighted the fact that binding of transcription factors to DNA can be more accurately described by continuous measures of their binding affinities, rather than a discrete description in terms of binding sites. While the binding affinities can be predicted from a physical model, it is often desirable to know the distribution of binding affinities for specific sequence backgrounds. In this paper, we present a statistical approach to derive the exact distribution for sequence models with fixed GC content. We demonstrate that the affinity distribution of almost all known transcription factors can be effectively parametrized by a class of generalized extreme value distributions. Moreover, this parameterization also describes the affinity distribution for sequence backgrounds with variable GC content, such as human promoter sequences. Our approach is applicable to arbitrary sequences and all transcription factors with known binding preferences that can be described in terms of a motif matrix. The statistical treatment also provides a proper framework to directly compare transcription factors with very different affinity distributions. This is illustrated by our analysis of human promoters with known binding sites, for many of which we could identify the known regulators as those with the highest affinity. The combination of physical model and statistical normalization provides a quantitative measure which ranks transcription factors for a given sequence, and which can be compared directly with large-scale binding data. Its successful application to human promoter sequences serves as an encouraging example of how the method can be applied to other sequences. Public Library of Science 2008-03-21 /pmc/articles/PMC2266803/ /pubmed/18369429 http://dx.doi.org/10.1371/journal.pcbi.1000039 Text en Manke et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Manke, Thomas
Roider, Helge G.
Vingron, Martin
Statistical Modeling of Transcription Factor Binding Affinities Predicts Regulatory Interactions
title Statistical Modeling of Transcription Factor Binding Affinities Predicts Regulatory Interactions
title_full Statistical Modeling of Transcription Factor Binding Affinities Predicts Regulatory Interactions
title_fullStr Statistical Modeling of Transcription Factor Binding Affinities Predicts Regulatory Interactions
title_full_unstemmed Statistical Modeling of Transcription Factor Binding Affinities Predicts Regulatory Interactions
title_short Statistical Modeling of Transcription Factor Binding Affinities Predicts Regulatory Interactions
title_sort statistical modeling of transcription factor binding affinities predicts regulatory interactions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2266803/
https://www.ncbi.nlm.nih.gov/pubmed/18369429
http://dx.doi.org/10.1371/journal.pcbi.1000039
work_keys_str_mv AT mankethomas statisticalmodelingoftranscriptionfactorbindingaffinitiespredictsregulatoryinteractions
AT roiderhelgeg statisticalmodelingoftranscriptionfactorbindingaffinitiespredictsregulatoryinteractions
AT vingronmartin statisticalmodelingoftranscriptionfactorbindingaffinitiespredictsregulatoryinteractions