Cargando…

Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite

BACKGROUND: Chromatin immunoprecipitation combined with the next-generation DNA sequencing technologies (ChIP-seq) becomes a key approach for detecting genome-wide sets of genomic sites bound by proteins, such as transcription factors (TFs). Several methods and open-source tools have been developed...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xi, Zhang, Xuegong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287483/
https://www.ncbi.nlm.nih.gov/pubmed/22784574
http://dx.doi.org/10.1186/1752-0509-5-S2-S3
_version_ 1782224673482735616
author Wang, Xi
Zhang, Xuegong
author_facet Wang, Xi
Zhang, Xuegong
author_sort Wang, Xi
collection PubMed
description BACKGROUND: Chromatin immunoprecipitation combined with the next-generation DNA sequencing technologies (ChIP-seq) becomes a key approach for detecting genome-wide sets of genomic sites bound by proteins, such as transcription factors (TFs). Several methods and open-source tools have been developed to analyze ChIP-seq data. However, most of them are designed for detecting TF binding regions instead of accurately locating transcription factor binding sites (TFBSs). It is still challenging to pinpoint TFBSs directly from ChIP-seq data, especially in regions with closely spaced binding events. RESULTS: With the aim to pinpoint TFBSs at a high resolution, we propose a novel method named SeqSite, implementing a two-step strategy: detecting tag-enriched regions first and pinpointing binding sites in the detected regions. The second step is done by modeling the tag density profile, locating TFBSs on each strand with a least-squares model fitting strategy, and merging the detections from the two strands. Experiments on simulation data show that SeqSite can locate most of the binding sites more than 40-bp from each other. Applications on three human TF ChIP-seq datasets demonstrate the advantage of SeqSite for its higher resolution in pinpointing binding sites compared with existing methods. CONCLUSIONS: We have developed a computational tool named SeqSite, which can pinpoint both closely spaced and isolated binding sites, and consequently improves the resolution of TFBS detection from ChIP-seq data.
format Online
Article
Text
id pubmed-3287483
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32874832012-02-28 Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite Wang, Xi Zhang, Xuegong BMC Syst Biol Proceedings BACKGROUND: Chromatin immunoprecipitation combined with the next-generation DNA sequencing technologies (ChIP-seq) becomes a key approach for detecting genome-wide sets of genomic sites bound by proteins, such as transcription factors (TFs). Several methods and open-source tools have been developed to analyze ChIP-seq data. However, most of them are designed for detecting TF binding regions instead of accurately locating transcription factor binding sites (TFBSs). It is still challenging to pinpoint TFBSs directly from ChIP-seq data, especially in regions with closely spaced binding events. RESULTS: With the aim to pinpoint TFBSs at a high resolution, we propose a novel method named SeqSite, implementing a two-step strategy: detecting tag-enriched regions first and pinpointing binding sites in the detected regions. The second step is done by modeling the tag density profile, locating TFBSs on each strand with a least-squares model fitting strategy, and merging the detections from the two strands. Experiments on simulation data show that SeqSite can locate most of the binding sites more than 40-bp from each other. Applications on three human TF ChIP-seq datasets demonstrate the advantage of SeqSite for its higher resolution in pinpointing binding sites compared with existing methods. CONCLUSIONS: We have developed a computational tool named SeqSite, which can pinpoint both closely spaced and isolated binding sites, and consequently improves the resolution of TFBS detection from ChIP-seq data. BioMed Central 2011-12-14 /pmc/articles/PMC3287483/ /pubmed/22784574 http://dx.doi.org/10.1186/1752-0509-5-S2-S3 Text en Copyright ©2011 Wang and Zhang; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Wang, Xi
Zhang, Xuegong
Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite
title Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite
title_full Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite
title_fullStr Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite
title_full_unstemmed Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite
title_short Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite
title_sort pinpointing transcription factor binding sites from chip-seq data with seqsite
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287483/
https://www.ncbi.nlm.nih.gov/pubmed/22784574
http://dx.doi.org/10.1186/1752-0509-5-S2-S3
work_keys_str_mv AT wangxi pinpointingtranscriptionfactorbindingsitesfromchipseqdatawithseqsite
AT zhangxuegong pinpointingtranscriptionfactorbindingsitesfromchipseqdatawithseqsite