Cargando…
On the detection and refinement of transcription factor binding sites using ChIP-Seq data
Coupling chromatin immunoprecipitation (ChIP) with recently developed massively parallel sequencing technologies has enabled genome-wide detection of protein–DNA interactions with unprecedented sensitivity and specificity. This new technology, ChIP-Seq, presents opportunities for in-depth analysis o...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2853110/ https://www.ncbi.nlm.nih.gov/pubmed/20056654 http://dx.doi.org/10.1093/nar/gkp1180 |
_version_ | 1782180010615898112 |
---|---|
author | Hu, Ming Yu, Jindan Taylor, Jeremy M. G. Chinnaiyan, Arul M. Qin, Zhaohui S. |
author_facet | Hu, Ming Yu, Jindan Taylor, Jeremy M. G. Chinnaiyan, Arul M. Qin, Zhaohui S. |
author_sort | Hu, Ming |
collection | PubMed |
description | Coupling chromatin immunoprecipitation (ChIP) with recently developed massively parallel sequencing technologies has enabled genome-wide detection of protein–DNA interactions with unprecedented sensitivity and specificity. This new technology, ChIP-Seq, presents opportunities for in-depth analysis of transcription regulation. In this study, we explore the value of using ChIP-Seq data to better detect and refine transcription factor binding sites (TFBS). We introduce a novel computational algorithm named Hybrid Motif Sampler (HMS), specifically designed for TFBS motif discovery in ChIP-Seq data. We propose a Bayesian model that incorporates sequencing depth information to aid motif identification. Our model also allows intra-motif dependency to describe more accurately the underlying motif pattern. Our algorithm combines stochastic sampling and deterministic ‘greedy’ search steps into a novel hybrid iterative scheme. This combination accelerates the computation process. Simulation studies demonstrate favorable performance of HMS compared to other existing methods. When applying HMS to real ChIP-Seq datasets, we find that (i) the accuracy of existing TFBS motif patterns can be significantly improved; and (ii) there is significant intra-motif dependency inside all the TFBS motifs we tested; modeling these dependencies further improves the accuracy of these TFBS motif patterns. These findings may offer new biological insights into the mechanisms of transcription factor regulation. |
format | Text |
id | pubmed-2853110 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-28531102010-04-12 On the detection and refinement of transcription factor binding sites using ChIP-Seq data Hu, Ming Yu, Jindan Taylor, Jeremy M. G. Chinnaiyan, Arul M. Qin, Zhaohui S. Nucleic Acids Res Computational Biology Coupling chromatin immunoprecipitation (ChIP) with recently developed massively parallel sequencing technologies has enabled genome-wide detection of protein–DNA interactions with unprecedented sensitivity and specificity. This new technology, ChIP-Seq, presents opportunities for in-depth analysis of transcription regulation. In this study, we explore the value of using ChIP-Seq data to better detect and refine transcription factor binding sites (TFBS). We introduce a novel computational algorithm named Hybrid Motif Sampler (HMS), specifically designed for TFBS motif discovery in ChIP-Seq data. We propose a Bayesian model that incorporates sequencing depth information to aid motif identification. Our model also allows intra-motif dependency to describe more accurately the underlying motif pattern. Our algorithm combines stochastic sampling and deterministic ‘greedy’ search steps into a novel hybrid iterative scheme. This combination accelerates the computation process. Simulation studies demonstrate favorable performance of HMS compared to other existing methods. When applying HMS to real ChIP-Seq datasets, we find that (i) the accuracy of existing TFBS motif patterns can be significantly improved; and (ii) there is significant intra-motif dependency inside all the TFBS motifs we tested; modeling these dependencies further improves the accuracy of these TFBS motif patterns. These findings may offer new biological insights into the mechanisms of transcription factor regulation. Oxford University Press 2010-04 2010-01-06 /pmc/articles/PMC2853110/ /pubmed/20056654 http://dx.doi.org/10.1093/nar/gkp1180 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Hu, Ming Yu, Jindan Taylor, Jeremy M. G. Chinnaiyan, Arul M. Qin, Zhaohui S. On the detection and refinement of transcription factor binding sites using ChIP-Seq data |
title | On the detection and refinement of transcription factor binding sites using ChIP-Seq data |
title_full | On the detection and refinement of transcription factor binding sites using ChIP-Seq data |
title_fullStr | On the detection and refinement of transcription factor binding sites using ChIP-Seq data |
title_full_unstemmed | On the detection and refinement of transcription factor binding sites using ChIP-Seq data |
title_short | On the detection and refinement of transcription factor binding sites using ChIP-Seq data |
title_sort | on the detection and refinement of transcription factor binding sites using chip-seq data |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2853110/ https://www.ncbi.nlm.nih.gov/pubmed/20056654 http://dx.doi.org/10.1093/nar/gkp1180 |
work_keys_str_mv | AT huming onthedetectionandrefinementoftranscriptionfactorbindingsitesusingchipseqdata AT yujindan onthedetectionandrefinementoftranscriptionfactorbindingsitesusingchipseqdata AT taylorjeremymg onthedetectionandrefinementoftranscriptionfactorbindingsitesusingchipseqdata AT chinnaiyanarulm onthedetectionandrefinementoftranscriptionfactorbindingsitesusingchipseqdata AT qinzhaohuis onthedetectionandrefinementoftranscriptionfactorbindingsitesusingchipseqdata |