Cargando…

CTF: a CRF-based transcription factor binding sites finding system

BACKGROUND: Identifying the location of transcription factor bindings is crucial to understand transcriptional regulation. Currently, Chromatin Immunoprecipitation followed with high-throughput Sequencing (ChIP-seq) is able to locate the transcription factor binding sites (TFBSs) accurately in high...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Yupeng, Zhang, Yizhe, Zheng, Guangyong, Wei, Chaochun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3535700/
https://www.ncbi.nlm.nih.gov/pubmed/23282203
http://dx.doi.org/10.1186/1471-2164-13-S8-S18
_version_ 1782254700060475392
author He, Yupeng
Zhang, Yizhe
Zheng, Guangyong
Wei, Chaochun
author_facet He, Yupeng
Zhang, Yizhe
Zheng, Guangyong
Wei, Chaochun
author_sort He, Yupeng
collection PubMed
description BACKGROUND: Identifying the location of transcription factor bindings is crucial to understand transcriptional regulation. Currently, Chromatin Immunoprecipitation followed with high-throughput Sequencing (ChIP-seq) is able to locate the transcription factor binding sites (TFBSs) accurately in high throughput and it has become the gold-standard method for TFBS finding experimentally. However, due to its high cost, it is impractical to apply the method in a very large scale. Considering the large number of transcription factors, numerous cell types and various conditions, computational methods are still very valuable to accurate TFBS identification. RESULTS: In this paper, we proposed a novel integrated TFBS prediction system, CTF, based on Conditional Random Fields (CRFs). Integrating information from different sources, CTF was able to capture patterns of TFBSs contained in different features (sequence, chromatin and etc) and predicted the TFBS locations with a high accuracy. We compared CTF with several existing tools as well as the PWM baseline method on a dataset generated by ChIP-seq experiments (TFBSs of 13 transcription factors in mouse genome). Results showed that CTF performed significantly better than existing methods tested. CONCLUSIONS: CTF is a powerful tool to predict TFBSs by integrating high throughput data and different features. It can be a useful complement to ChIP-seq and other experimental methods for TFBS identification and thus improve our ability to investigate functional elements in post-genomic era. Availability: CTF is freely available to academic users at: http://cbb.sjtu.edu.cn/~ccwei/pub/software/CTF/CTF.php
format Online
Article
Text
id pubmed-3535700
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35357002013-01-04 CTF: a CRF-based transcription factor binding sites finding system He, Yupeng Zhang, Yizhe Zheng, Guangyong Wei, Chaochun BMC Genomics Introduction BACKGROUND: Identifying the location of transcription factor bindings is crucial to understand transcriptional regulation. Currently, Chromatin Immunoprecipitation followed with high-throughput Sequencing (ChIP-seq) is able to locate the transcription factor binding sites (TFBSs) accurately in high throughput and it has become the gold-standard method for TFBS finding experimentally. However, due to its high cost, it is impractical to apply the method in a very large scale. Considering the large number of transcription factors, numerous cell types and various conditions, computational methods are still very valuable to accurate TFBS identification. RESULTS: In this paper, we proposed a novel integrated TFBS prediction system, CTF, based on Conditional Random Fields (CRFs). Integrating information from different sources, CTF was able to capture patterns of TFBSs contained in different features (sequence, chromatin and etc) and predicted the TFBS locations with a high accuracy. We compared CTF with several existing tools as well as the PWM baseline method on a dataset generated by ChIP-seq experiments (TFBSs of 13 transcription factors in mouse genome). Results showed that CTF performed significantly better than existing methods tested. CONCLUSIONS: CTF is a powerful tool to predict TFBSs by integrating high throughput data and different features. It can be a useful complement to ChIP-seq and other experimental methods for TFBS identification and thus improve our ability to investigate functional elements in post-genomic era. Availability: CTF is freely available to academic users at: http://cbb.sjtu.edu.cn/~ccwei/pub/software/CTF/CTF.php BioMed Central 2012-12-17 /pmc/articles/PMC3535700/ /pubmed/23282203 http://dx.doi.org/10.1186/1471-2164-13-S8-S18 Text en Copyright © 2012 He et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Introduction
He, Yupeng
Zhang, Yizhe
Zheng, Guangyong
Wei, Chaochun
CTF: a CRF-based transcription factor binding sites finding system
title CTF: a CRF-based transcription factor binding sites finding system
title_full CTF: a CRF-based transcription factor binding sites finding system
title_fullStr CTF: a CRF-based transcription factor binding sites finding system
title_full_unstemmed CTF: a CRF-based transcription factor binding sites finding system
title_short CTF: a CRF-based transcription factor binding sites finding system
title_sort ctf: a crf-based transcription factor binding sites finding system
topic Introduction
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3535700/
https://www.ncbi.nlm.nih.gov/pubmed/23282203
http://dx.doi.org/10.1186/1471-2164-13-S8-S18
work_keys_str_mv AT heyupeng ctfacrfbasedtranscriptionfactorbindingsitesfindingsystem
AT zhangyizhe ctfacrfbasedtranscriptionfactorbindingsitesfindingsystem
AT zhengguangyong ctfacrfbasedtranscriptionfactorbindingsitesfindingsystem
AT weichaochun ctfacrfbasedtranscriptionfactorbindingsitesfindingsystem