Cargando…

Erroneous attribution of relevant transcription factor binding sites despite successful prediction of cis-regulatory modules

BACKGROUND: Cis-regulatory modules are bound by transcription factors to regulate gene expression. Characterizing these DNA sequences is central to understanding gene regulatory networks and gaining insight into mechanisms of transcriptional regulation, but genome-scale regulatory module discovery r...

Descripción completa

Detalles Bibliográficos
Autores principales: Halfon, Marc S, Zhu, Qianqian, Brennan, Elizabeth R, Zhou, Yiyun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3235160/
https://www.ncbi.nlm.nih.gov/pubmed/22115527
http://dx.doi.org/10.1186/1471-2164-12-578
_version_ 1782218575688237056
author Halfon, Marc S
Zhu, Qianqian
Brennan, Elizabeth R
Zhou, Yiyun
author_facet Halfon, Marc S
Zhu, Qianqian
Brennan, Elizabeth R
Zhou, Yiyun
author_sort Halfon, Marc S
collection PubMed
description BACKGROUND: Cis-regulatory modules are bound by transcription factors to regulate gene expression. Characterizing these DNA sequences is central to understanding gene regulatory networks and gaining insight into mechanisms of transcriptional regulation, but genome-scale regulatory module discovery remains a challenge. One popular approach is to scan the genome for clusters of transcription factor binding sites, especially those conserved in related species. When such approaches are successful, it is typically assumed that the activity of the modules is mediated by the identified binding sites and their cognate transcription factors. However, the validity of this assumption is often not assessed. RESULTS: We successfully predicted five new cis-regulatory modules by combining binding site identification with sequence conservation and compared these to unsuccessful predictions from a related approach not utilizing sequence conservation. Despite greatly improved predictive success, the positive set had similar degrees of sequence and binding site conservation as the negative set. We explored the reasons for this by mutagenizing putative binding sites in three cis-regulatory modules. A large proportion of the tested sites had little or no demonstrable role in mediating regulatory element activity. Examination of loss-of-function mutants also showed that some transcription factors supposedly binding to the modules are not required for their function. CONCLUSIONS: Our results raise important questions about interpreting regulatory module predictions obtained by finding clusters of conserved binding sites. Attribution of function to these sites and their cognate transcription factors may be incorrect even when modules are successfully identified. Our study underscores the importance of empirical validation of computational results even when these results are in line with expectation.
format Online
Article
Text
id pubmed-3235160
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32351602011-12-10 Erroneous attribution of relevant transcription factor binding sites despite successful prediction of cis-regulatory modules Halfon, Marc S Zhu, Qianqian Brennan, Elizabeth R Zhou, Yiyun BMC Genomics Research Article BACKGROUND: Cis-regulatory modules are bound by transcription factors to regulate gene expression. Characterizing these DNA sequences is central to understanding gene regulatory networks and gaining insight into mechanisms of transcriptional regulation, but genome-scale regulatory module discovery remains a challenge. One popular approach is to scan the genome for clusters of transcription factor binding sites, especially those conserved in related species. When such approaches are successful, it is typically assumed that the activity of the modules is mediated by the identified binding sites and their cognate transcription factors. However, the validity of this assumption is often not assessed. RESULTS: We successfully predicted five new cis-regulatory modules by combining binding site identification with sequence conservation and compared these to unsuccessful predictions from a related approach not utilizing sequence conservation. Despite greatly improved predictive success, the positive set had similar degrees of sequence and binding site conservation as the negative set. We explored the reasons for this by mutagenizing putative binding sites in three cis-regulatory modules. A large proportion of the tested sites had little or no demonstrable role in mediating regulatory element activity. Examination of loss-of-function mutants also showed that some transcription factors supposedly binding to the modules are not required for their function. CONCLUSIONS: Our results raise important questions about interpreting regulatory module predictions obtained by finding clusters of conserved binding sites. Attribution of function to these sites and their cognate transcription factors may be incorrect even when modules are successfully identified. Our study underscores the importance of empirical validation of computational results even when these results are in line with expectation. BioMed Central 2011-11-25 /pmc/articles/PMC3235160/ /pubmed/22115527 http://dx.doi.org/10.1186/1471-2164-12-578 Text en Copyright ©2011 Halfon et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Halfon, Marc S
Zhu, Qianqian
Brennan, Elizabeth R
Zhou, Yiyun
Erroneous attribution of relevant transcription factor binding sites despite successful prediction of cis-regulatory modules
title Erroneous attribution of relevant transcription factor binding sites despite successful prediction of cis-regulatory modules
title_full Erroneous attribution of relevant transcription factor binding sites despite successful prediction of cis-regulatory modules
title_fullStr Erroneous attribution of relevant transcription factor binding sites despite successful prediction of cis-regulatory modules
title_full_unstemmed Erroneous attribution of relevant transcription factor binding sites despite successful prediction of cis-regulatory modules
title_short Erroneous attribution of relevant transcription factor binding sites despite successful prediction of cis-regulatory modules
title_sort erroneous attribution of relevant transcription factor binding sites despite successful prediction of cis-regulatory modules
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3235160/
https://www.ncbi.nlm.nih.gov/pubmed/22115527
http://dx.doi.org/10.1186/1471-2164-12-578
work_keys_str_mv AT halfonmarcs erroneousattributionofrelevanttranscriptionfactorbindingsitesdespitesuccessfulpredictionofcisregulatorymodules
AT zhuqianqian erroneousattributionofrelevanttranscriptionfactorbindingsitesdespitesuccessfulpredictionofcisregulatorymodules
AT brennanelizabethr erroneousattributionofrelevanttranscriptionfactorbindingsitesdespitesuccessfulpredictionofcisregulatorymodules
AT zhouyiyun erroneousattributionofrelevanttranscriptionfactorbindingsitesdespitesuccessfulpredictionofcisregulatorymodules