Cargando…

Inherent limitations of probabilistic models for protein-DNA binding specificity

The specificities of transcription factors are most commonly represented with probabilistic models. These models provide a probability for each base occurring at each position within the binding site and the positions are assumed to contribute independently. The model is simple and intuitive and is...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruan, Shuxiang, Stormo, Gary D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5521849/
https://www.ncbi.nlm.nih.gov/pubmed/28686588
http://dx.doi.org/10.1371/journal.pcbi.1005638
_version_ 1783252049077469184
author Ruan, Shuxiang
Stormo, Gary D.
author_facet Ruan, Shuxiang
Stormo, Gary D.
author_sort Ruan, Shuxiang
collection PubMed
description The specificities of transcription factors are most commonly represented with probabilistic models. These models provide a probability for each base occurring at each position within the binding site and the positions are assumed to contribute independently. The model is simple and intuitive and is the basis for many motif discovery algorithms. However, the model also has inherent limitations that prevent it from accurately representing true binding probabilities, especially for the highest affinity sites under conditions of high protein concentration. The limitations are not due to the assumption of independence between positions but rather are caused by the non-linear relationship between binding affinity and binding probability and the fact that independent normalization at each position skews the site probabilities. Generally probabilistic models are reasonably good approximations, but new high-throughput methods allow for biophysical models with increased accuracy that should be used whenever possible.
format Online
Article
Text
id pubmed-5521849
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55218492017-08-07 Inherent limitations of probabilistic models for protein-DNA binding specificity Ruan, Shuxiang Stormo, Gary D. PLoS Comput Biol Research Article The specificities of transcription factors are most commonly represented with probabilistic models. These models provide a probability for each base occurring at each position within the binding site and the positions are assumed to contribute independently. The model is simple and intuitive and is the basis for many motif discovery algorithms. However, the model also has inherent limitations that prevent it from accurately representing true binding probabilities, especially for the highest affinity sites under conditions of high protein concentration. The limitations are not due to the assumption of independence between positions but rather are caused by the non-linear relationship between binding affinity and binding probability and the fact that independent normalization at each position skews the site probabilities. Generally probabilistic models are reasonably good approximations, but new high-throughput methods allow for biophysical models with increased accuracy that should be used whenever possible. Public Library of Science 2017-07-07 /pmc/articles/PMC5521849/ /pubmed/28686588 http://dx.doi.org/10.1371/journal.pcbi.1005638 Text en © 2017 Ruan, Stormo http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ruan, Shuxiang
Stormo, Gary D.
Inherent limitations of probabilistic models for protein-DNA binding specificity
title Inherent limitations of probabilistic models for protein-DNA binding specificity
title_full Inherent limitations of probabilistic models for protein-DNA binding specificity
title_fullStr Inherent limitations of probabilistic models for protein-DNA binding specificity
title_full_unstemmed Inherent limitations of probabilistic models for protein-DNA binding specificity
title_short Inherent limitations of probabilistic models for protein-DNA binding specificity
title_sort inherent limitations of probabilistic models for protein-dna binding specificity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5521849/
https://www.ncbi.nlm.nih.gov/pubmed/28686588
http://dx.doi.org/10.1371/journal.pcbi.1005638
work_keys_str_mv AT ruanshuxiang inherentlimitationsofprobabilisticmodelsforproteindnabindingspecificity
AT stormogaryd inherentlimitationsofprobabilisticmodelsforproteindnabindingspecificity