Cargando…

Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems

Mapping genome-wide binding sites of all transcription factors (TFs) in all biological contexts is a critical step toward understanding gene regulation. The state-of-the-art technologies for mapping transcription factor binding sites (TFBSs) couple chromatin immunoprecipitation (ChIP) with high-thro...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Yingying, Wu, George, Ji, Hongkai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer-Verlag 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3677239/
https://www.ncbi.nlm.nih.gov/pubmed/23762209
http://dx.doi.org/10.1007/s12561-012-9066-5
_version_ 1782272715236835328
author Wei, Yingying
Wu, George
Ji, Hongkai
author_facet Wei, Yingying
Wu, George
Ji, Hongkai
author_sort Wei, Yingying
collection PubMed
description Mapping genome-wide binding sites of all transcription factors (TFs) in all biological contexts is a critical step toward understanding gene regulation. The state-of-the-art technologies for mapping transcription factor binding sites (TFBSs) couple chromatin immunoprecipitation (ChIP) with high-throughput sequencing (ChIP-seq) or tiling array hybridization (ChIP-chip). These technologies have limitations: they are low-throughput with respect to surveying many TFs. Recent advances in genome-wide chromatin profiling, including development of technologies such as DNase-seq, FAIRE-seq and ChIP-seq for histone modifications, make it possible to predict in vivo TFBSs by analyzing chromatin features at computationally determined DNA motif sites. This promising new approach may allow researchers to monitor the genome-wide binding sites of many TFs simultaneously. In this article, we discuss various experimental design and data analysis issues that arise when applying this approach. Through a systematic analysis of the data from the Encyclopedia Of DNA Elements (ENCODE) project, we compare the predictive power of individual and combinations of chromatin marks using supervised and unsupervised learning methods, and evaluate the value of integrating information from public ChIP and gene expression data. We also highlight the challenges and opportunities for developing novel analytical methods, such as resolving the one-motif-multiple-TF ambiguity and distinguishing functional and non-functional TF binding targets from the predicted binding sites. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s12561-012-9066-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-3677239
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Springer-Verlag
record_format MEDLINE/PubMed
spelling pubmed-36772392013-06-10 Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems Wei, Yingying Wu, George Ji, Hongkai Stat Biosci Article Mapping genome-wide binding sites of all transcription factors (TFs) in all biological contexts is a critical step toward understanding gene regulation. The state-of-the-art technologies for mapping transcription factor binding sites (TFBSs) couple chromatin immunoprecipitation (ChIP) with high-throughput sequencing (ChIP-seq) or tiling array hybridization (ChIP-chip). These technologies have limitations: they are low-throughput with respect to surveying many TFs. Recent advances in genome-wide chromatin profiling, including development of technologies such as DNase-seq, FAIRE-seq and ChIP-seq for histone modifications, make it possible to predict in vivo TFBSs by analyzing chromatin features at computationally determined DNA motif sites. This promising new approach may allow researchers to monitor the genome-wide binding sites of many TFs simultaneously. In this article, we discuss various experimental design and data analysis issues that arise when applying this approach. Through a systematic analysis of the data from the Encyclopedia Of DNA Elements (ENCODE) project, we compare the predictive power of individual and combinations of chromatin marks using supervised and unsupervised learning methods, and evaluate the value of integrating information from public ChIP and gene expression data. We also highlight the challenges and opportunities for developing novel analytical methods, such as resolving the one-motif-multiple-TF ambiguity and distinguishing functional and non-functional TF binding targets from the predicted binding sites. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s12561-012-9066-5) contains supplementary material, which is available to authorized users. Springer-Verlag 2012-05-23 2013 /pmc/articles/PMC3677239/ /pubmed/23762209 http://dx.doi.org/10.1007/s12561-012-9066-5 Text en © The Author(s) 2012 https://creativecommons.org/licenses/by/4.0/ This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
spellingShingle Article
Wei, Yingying
Wu, George
Ji, Hongkai
Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems
title Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems
title_full Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems
title_fullStr Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems
title_full_unstemmed Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems
title_short Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems
title_sort global mapping of transcription factor binding sites by sequencing chromatin surrogates: a perspective on experimental design, data analysis, and open problems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3677239/
https://www.ncbi.nlm.nih.gov/pubmed/23762209
http://dx.doi.org/10.1007/s12561-012-9066-5
work_keys_str_mv AT weiyingying globalmappingoftranscriptionfactorbindingsitesbysequencingchromatinsurrogatesaperspectiveonexperimentaldesigndataanalysisandopenproblems
AT wugeorge globalmappingoftranscriptionfactorbindingsitesbysequencingchromatinsurrogatesaperspectiveonexperimentaldesigndataanalysisandopenproblems
AT jihongkai globalmappingoftranscriptionfactorbindingsitesbysequencingchromatinsurrogatesaperspectiveonexperimentaldesigndataanalysisandopenproblems