Cargando…

On the detection and refinement of transcription factor binding sites using ChIP-Seq data

Coupling chromatin immunoprecipitation (ChIP) with recently developed massively parallel sequencing technologies has enabled genome-wide detection of protein–DNA interactions with unprecedented sensitivity and specificity. This new technology, ChIP-Seq, presents opportunities for in-depth analysis o...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Ming, Yu, Jindan, Taylor, Jeremy M. G., Chinnaiyan, Arul M., Qin, Zhaohui S.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2853110/
https://www.ncbi.nlm.nih.gov/pubmed/20056654
http://dx.doi.org/10.1093/nar/gkp1180
_version_ 1782180010615898112
author Hu, Ming
Yu, Jindan
Taylor, Jeremy M. G.
Chinnaiyan, Arul M.
Qin, Zhaohui S.
author_facet Hu, Ming
Yu, Jindan
Taylor, Jeremy M. G.
Chinnaiyan, Arul M.
Qin, Zhaohui S.
author_sort Hu, Ming
collection PubMed
description Coupling chromatin immunoprecipitation (ChIP) with recently developed massively parallel sequencing technologies has enabled genome-wide detection of protein–DNA interactions with unprecedented sensitivity and specificity. This new technology, ChIP-Seq, presents opportunities for in-depth analysis of transcription regulation. In this study, we explore the value of using ChIP-Seq data to better detect and refine transcription factor binding sites (TFBS). We introduce a novel computational algorithm named Hybrid Motif Sampler (HMS), specifically designed for TFBS motif discovery in ChIP-Seq data. We propose a Bayesian model that incorporates sequencing depth information to aid motif identification. Our model also allows intra-motif dependency to describe more accurately the underlying motif pattern. Our algorithm combines stochastic sampling and deterministic ‘greedy’ search steps into a novel hybrid iterative scheme. This combination accelerates the computation process. Simulation studies demonstrate favorable performance of HMS compared to other existing methods. When applying HMS to real ChIP-Seq datasets, we find that (i) the accuracy of existing TFBS motif patterns can be significantly improved; and (ii) there is significant intra-motif dependency inside all the TFBS motifs we tested; modeling these dependencies further improves the accuracy of these TFBS motif patterns. These findings may offer new biological insights into the mechanisms of transcription factor regulation.
format Text
id pubmed-2853110
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28531102010-04-12 On the detection and refinement of transcription factor binding sites using ChIP-Seq data Hu, Ming Yu, Jindan Taylor, Jeremy M. G. Chinnaiyan, Arul M. Qin, Zhaohui S. Nucleic Acids Res Computational Biology Coupling chromatin immunoprecipitation (ChIP) with recently developed massively parallel sequencing technologies has enabled genome-wide detection of protein–DNA interactions with unprecedented sensitivity and specificity. This new technology, ChIP-Seq, presents opportunities for in-depth analysis of transcription regulation. In this study, we explore the value of using ChIP-Seq data to better detect and refine transcription factor binding sites (TFBS). We introduce a novel computational algorithm named Hybrid Motif Sampler (HMS), specifically designed for TFBS motif discovery in ChIP-Seq data. We propose a Bayesian model that incorporates sequencing depth information to aid motif identification. Our model also allows intra-motif dependency to describe more accurately the underlying motif pattern. Our algorithm combines stochastic sampling and deterministic ‘greedy’ search steps into a novel hybrid iterative scheme. This combination accelerates the computation process. Simulation studies demonstrate favorable performance of HMS compared to other existing methods. When applying HMS to real ChIP-Seq datasets, we find that (i) the accuracy of existing TFBS motif patterns can be significantly improved; and (ii) there is significant intra-motif dependency inside all the TFBS motifs we tested; modeling these dependencies further improves the accuracy of these TFBS motif patterns. These findings may offer new biological insights into the mechanisms of transcription factor regulation. Oxford University Press 2010-04 2010-01-06 /pmc/articles/PMC2853110/ /pubmed/20056654 http://dx.doi.org/10.1093/nar/gkp1180 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Hu, Ming
Yu, Jindan
Taylor, Jeremy M. G.
Chinnaiyan, Arul M.
Qin, Zhaohui S.
On the detection and refinement of transcription factor binding sites using ChIP-Seq data
title On the detection and refinement of transcription factor binding sites using ChIP-Seq data
title_full On the detection and refinement of transcription factor binding sites using ChIP-Seq data
title_fullStr On the detection and refinement of transcription factor binding sites using ChIP-Seq data
title_full_unstemmed On the detection and refinement of transcription factor binding sites using ChIP-Seq data
title_short On the detection and refinement of transcription factor binding sites using ChIP-Seq data
title_sort on the detection and refinement of transcription factor binding sites using chip-seq data
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2853110/
https://www.ncbi.nlm.nih.gov/pubmed/20056654
http://dx.doi.org/10.1093/nar/gkp1180
work_keys_str_mv AT huming onthedetectionandrefinementoftranscriptionfactorbindingsitesusingchipseqdata
AT yujindan onthedetectionandrefinementoftranscriptionfactorbindingsitesusingchipseqdata
AT taylorjeremymg onthedetectionandrefinementoftranscriptionfactorbindingsitesusingchipseqdata
AT chinnaiyanarulm onthedetectionandrefinementoftranscriptionfactorbindingsitesusingchipseqdata
AT qinzhaohuis onthedetectionandrefinementoftranscriptionfactorbindingsitesusingchipseqdata