Cargando…
Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems
Mapping genome-wide binding sites of all transcription factors (TFs) in all biological contexts is a critical step toward understanding gene regulation. The state-of-the-art technologies for mapping transcription factor binding sites (TFBSs) couple chromatin immunoprecipitation (ChIP) with high-thro...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer-Verlag
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3677239/ https://www.ncbi.nlm.nih.gov/pubmed/23762209 http://dx.doi.org/10.1007/s12561-012-9066-5 |
_version_ | 1782272715236835328 |
---|---|
author | Wei, Yingying Wu, George Ji, Hongkai |
author_facet | Wei, Yingying Wu, George Ji, Hongkai |
author_sort | Wei, Yingying |
collection | PubMed |
description | Mapping genome-wide binding sites of all transcription factors (TFs) in all biological contexts is a critical step toward understanding gene regulation. The state-of-the-art technologies for mapping transcription factor binding sites (TFBSs) couple chromatin immunoprecipitation (ChIP) with high-throughput sequencing (ChIP-seq) or tiling array hybridization (ChIP-chip). These technologies have limitations: they are low-throughput with respect to surveying many TFs. Recent advances in genome-wide chromatin profiling, including development of technologies such as DNase-seq, FAIRE-seq and ChIP-seq for histone modifications, make it possible to predict in vivo TFBSs by analyzing chromatin features at computationally determined DNA motif sites. This promising new approach may allow researchers to monitor the genome-wide binding sites of many TFs simultaneously. In this article, we discuss various experimental design and data analysis issues that arise when applying this approach. Through a systematic analysis of the data from the Encyclopedia Of DNA Elements (ENCODE) project, we compare the predictive power of individual and combinations of chromatin marks using supervised and unsupervised learning methods, and evaluate the value of integrating information from public ChIP and gene expression data. We also highlight the challenges and opportunities for developing novel analytical methods, such as resolving the one-motif-multiple-TF ambiguity and distinguishing functional and non-functional TF binding targets from the predicted binding sites. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s12561-012-9066-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-3677239 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Springer-Verlag |
record_format | MEDLINE/PubMed |
spelling | pubmed-36772392013-06-10 Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems Wei, Yingying Wu, George Ji, Hongkai Stat Biosci Article Mapping genome-wide binding sites of all transcription factors (TFs) in all biological contexts is a critical step toward understanding gene regulation. The state-of-the-art technologies for mapping transcription factor binding sites (TFBSs) couple chromatin immunoprecipitation (ChIP) with high-throughput sequencing (ChIP-seq) or tiling array hybridization (ChIP-chip). These technologies have limitations: they are low-throughput with respect to surveying many TFs. Recent advances in genome-wide chromatin profiling, including development of technologies such as DNase-seq, FAIRE-seq and ChIP-seq for histone modifications, make it possible to predict in vivo TFBSs by analyzing chromatin features at computationally determined DNA motif sites. This promising new approach may allow researchers to monitor the genome-wide binding sites of many TFs simultaneously. In this article, we discuss various experimental design and data analysis issues that arise when applying this approach. Through a systematic analysis of the data from the Encyclopedia Of DNA Elements (ENCODE) project, we compare the predictive power of individual and combinations of chromatin marks using supervised and unsupervised learning methods, and evaluate the value of integrating information from public ChIP and gene expression data. We also highlight the challenges and opportunities for developing novel analytical methods, such as resolving the one-motif-multiple-TF ambiguity and distinguishing functional and non-functional TF binding targets from the predicted binding sites. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s12561-012-9066-5) contains supplementary material, which is available to authorized users. Springer-Verlag 2012-05-23 2013 /pmc/articles/PMC3677239/ /pubmed/23762209 http://dx.doi.org/10.1007/s12561-012-9066-5 Text en © The Author(s) 2012 https://creativecommons.org/licenses/by/4.0/ This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited. |
spellingShingle | Article Wei, Yingying Wu, George Ji, Hongkai Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems |
title | Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems |
title_full | Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems |
title_fullStr | Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems |
title_full_unstemmed | Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems |
title_short | Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems |
title_sort | global mapping of transcription factor binding sites by sequencing chromatin surrogates: a perspective on experimental design, data analysis, and open problems |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3677239/ https://www.ncbi.nlm.nih.gov/pubmed/23762209 http://dx.doi.org/10.1007/s12561-012-9066-5 |
work_keys_str_mv | AT weiyingying globalmappingoftranscriptionfactorbindingsitesbysequencingchromatinsurrogatesaperspectiveonexperimentaldesigndataanalysisandopenproblems AT wugeorge globalmappingoftranscriptionfactorbindingsitesbysequencingchromatinsurrogatesaperspectiveonexperimentaldesigndataanalysisandopenproblems AT jihongkai globalmappingoftranscriptionfactorbindingsitesbysequencingchromatinsurrogatesaperspectiveonexperimentaldesigndataanalysisandopenproblems |