Cargando…

Reliable scaling of position weight matrices for binding strength comparisons between transcription factors

BACKGROUND: Scoring DNA sequences against Position Weight Matrices (PWMs) is a widely adopted method to identify putative transcription factor binding sites. While common bioinformatics tools produce scores that can reflect the binding strength between a specific transcription factor and the DNA, th...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Xiaoyan, Ezer, Daphne, Navarro, Carmen, Adryan, Boris
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4545934/
https://www.ncbi.nlm.nih.gov/pubmed/26289072
http://dx.doi.org/10.1186/s12859-015-0666-1
_version_ 1782386817000013824
author Ma, Xiaoyan
Ezer, Daphne
Navarro, Carmen
Adryan, Boris
author_facet Ma, Xiaoyan
Ezer, Daphne
Navarro, Carmen
Adryan, Boris
author_sort Ma, Xiaoyan
collection PubMed
description BACKGROUND: Scoring DNA sequences against Position Weight Matrices (PWMs) is a widely adopted method to identify putative transcription factor binding sites. While common bioinformatics tools produce scores that can reflect the binding strength between a specific transcription factor and the DNA, these scores are not directly comparable between different transcription factors. Other methods, including p-value associated approaches (Touzet H, Varré J-S. Efficient and accurate p-value computation for position weight matrices. Algorithms Mol Biol. 2007;2(1510.1186):1748–7188), provide more rigorous ways to identify potential binding sites, but their results are difficult to interpret in terms of binding energy, which is essential for the modeling of transcription factor binding dynamics and enhancer activities. RESULTS: Here, we provide two different ways to find the scaling parameter λ that allows us to infer binding energy from a PWM score. The first approach uses a PWM and background genomic sequence as input to estimate λ for a specific transcription factor, which we applied to show that λ distributions for different transcription factor families correspond with their DNA binding properties. Our second method can reliably convert λ between different PWMs of the same transcription factor, which allows us to directly compare PWMs that were generated by different approaches. CONCLUSION: These two approaches provide computationally efficient ways to scale PWM scores and estimate the strength of transcription factor binding sites in quantitative studies of binding dynamics. Their results are consistent with each other and previous reports in most of cases. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0666-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4545934
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45459342015-08-23 Reliable scaling of position weight matrices for binding strength comparisons between transcription factors Ma, Xiaoyan Ezer, Daphne Navarro, Carmen Adryan, Boris BMC Bioinformatics Research Article BACKGROUND: Scoring DNA sequences against Position Weight Matrices (PWMs) is a widely adopted method to identify putative transcription factor binding sites. While common bioinformatics tools produce scores that can reflect the binding strength between a specific transcription factor and the DNA, these scores are not directly comparable between different transcription factors. Other methods, including p-value associated approaches (Touzet H, Varré J-S. Efficient and accurate p-value computation for position weight matrices. Algorithms Mol Biol. 2007;2(1510.1186):1748–7188), provide more rigorous ways to identify potential binding sites, but their results are difficult to interpret in terms of binding energy, which is essential for the modeling of transcription factor binding dynamics and enhancer activities. RESULTS: Here, we provide two different ways to find the scaling parameter λ that allows us to infer binding energy from a PWM score. The first approach uses a PWM and background genomic sequence as input to estimate λ for a specific transcription factor, which we applied to show that λ distributions for different transcription factor families correspond with their DNA binding properties. Our second method can reliably convert λ between different PWMs of the same transcription factor, which allows us to directly compare PWMs that were generated by different approaches. CONCLUSION: These two approaches provide computationally efficient ways to scale PWM scores and estimate the strength of transcription factor binding sites in quantitative studies of binding dynamics. Their results are consistent with each other and previous reports in most of cases. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0666-1) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-20 /pmc/articles/PMC4545934/ /pubmed/26289072 http://dx.doi.org/10.1186/s12859-015-0666-1 Text en © Ma et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ma, Xiaoyan
Ezer, Daphne
Navarro, Carmen
Adryan, Boris
Reliable scaling of position weight matrices for binding strength comparisons between transcription factors
title Reliable scaling of position weight matrices for binding strength comparisons between transcription factors
title_full Reliable scaling of position weight matrices for binding strength comparisons between transcription factors
title_fullStr Reliable scaling of position weight matrices for binding strength comparisons between transcription factors
title_full_unstemmed Reliable scaling of position weight matrices for binding strength comparisons between transcription factors
title_short Reliable scaling of position weight matrices for binding strength comparisons between transcription factors
title_sort reliable scaling of position weight matrices for binding strength comparisons between transcription factors
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4545934/
https://www.ncbi.nlm.nih.gov/pubmed/26289072
http://dx.doi.org/10.1186/s12859-015-0666-1
work_keys_str_mv AT maxiaoyan reliablescalingofpositionweightmatricesforbindingstrengthcomparisonsbetweentranscriptionfactors
AT ezerdaphne reliablescalingofpositionweightmatricesforbindingstrengthcomparisonsbetweentranscriptionfactors
AT navarrocarmen reliablescalingofpositionweightmatricesforbindingstrengthcomparisonsbetweentranscriptionfactors
AT adryanboris reliablescalingofpositionweightmatricesforbindingstrengthcomparisonsbetweentranscriptionfactors