Cargando…

Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes

BACKGROUND: Our current understanding of transcription factor binding sites (TFBSs) in sequenced prokaryotic genomes is very limited due to the lack of an accurate and efficient computational method for the prediction of TFBSs at a genome scale. In an attempt to change this situation, we have recent...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Shaoqiang, Li, Shan, Pham, Phuc T, Su, Zhengchang
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2920276/
https://www.ncbi.nlm.nih.gov/pubmed/20653963
http://dx.doi.org/10.1186/1471-2105-11-397
_version_ 1782185262858633216
author Zhang, Shaoqiang
Li, Shan
Pham, Phuc T
Su, Zhengchang
author_facet Zhang, Shaoqiang
Li, Shan
Pham, Phuc T
Su, Zhengchang
author_sort Zhang, Shaoqiang
collection PubMed
description BACKGROUND: Our current understanding of transcription factor binding sites (TFBSs) in sequenced prokaryotic genomes is very limited due to the lack of an accurate and efficient computational method for the prediction of TFBSs at a genome scale. In an attempt to change this situation, we have recently developed a comparative genomics based algorithm called GLECLUBS for de novo genome-wide prediction of TFBSs in a target genome. Although GLECLUBS has achieved rather high prediction accuracy of TFBSs in a target genome, it is still not efficient enough to be applied to all the sequenced prokaryotic genomes. RESULTS: Here, we designed a new algorithm based on GLECLUBS called extended GLECLUBS (eGLECLUBS) for simultaneous prediction of TFBSs in a group of related prokaryotic genomes. When tested on a group of γ-proteobacterial genomes including E. coli K12, a group of firmicutes genomes including B. subtilis and a group of cyanobacterial genomes using the same parameter settings, eGLECLUBS predicts more than 82% of known TFBSs in extracted inter-operonic sequences in both E. coli K12 and B. subtilis. Because each genome in a group is equally treated, it is highly likely that similar prediction accuracy has been achieved for each genome in the group. CONCLUSIONS: We have developed a new algorithm for genome-wide de novo prediction of TFBSs in a group of related prokaryotic genomes. The algorithm has achieved the same level of accuracy and robustness as its predecessor GLECLUBS, but can work on dozens of genomes at the same time.
format Text
id pubmed-2920276
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29202762010-08-12 Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes Zhang, Shaoqiang Li, Shan Pham, Phuc T Su, Zhengchang BMC Bioinformatics Methodology Article BACKGROUND: Our current understanding of transcription factor binding sites (TFBSs) in sequenced prokaryotic genomes is very limited due to the lack of an accurate and efficient computational method for the prediction of TFBSs at a genome scale. In an attempt to change this situation, we have recently developed a comparative genomics based algorithm called GLECLUBS for de novo genome-wide prediction of TFBSs in a target genome. Although GLECLUBS has achieved rather high prediction accuracy of TFBSs in a target genome, it is still not efficient enough to be applied to all the sequenced prokaryotic genomes. RESULTS: Here, we designed a new algorithm based on GLECLUBS called extended GLECLUBS (eGLECLUBS) for simultaneous prediction of TFBSs in a group of related prokaryotic genomes. When tested on a group of γ-proteobacterial genomes including E. coli K12, a group of firmicutes genomes including B. subtilis and a group of cyanobacterial genomes using the same parameter settings, eGLECLUBS predicts more than 82% of known TFBSs in extracted inter-operonic sequences in both E. coli K12 and B. subtilis. Because each genome in a group is equally treated, it is highly likely that similar prediction accuracy has been achieved for each genome in the group. CONCLUSIONS: We have developed a new algorithm for genome-wide de novo prediction of TFBSs in a group of related prokaryotic genomes. The algorithm has achieved the same level of accuracy and robustness as its predecessor GLECLUBS, but can work on dozens of genomes at the same time. BioMed Central 2010-07-23 /pmc/articles/PMC2920276/ /pubmed/20653963 http://dx.doi.org/10.1186/1471-2105-11-397 Text en Copyright ©2010 Zhang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Zhang, Shaoqiang
Li, Shan
Pham, Phuc T
Su, Zhengchang
Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes
title Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes
title_full Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes
title_fullStr Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes
title_full_unstemmed Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes
title_short Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes
title_sort simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2920276/
https://www.ncbi.nlm.nih.gov/pubmed/20653963
http://dx.doi.org/10.1186/1471-2105-11-397
work_keys_str_mv AT zhangshaoqiang simultaneouspredictionoftranscriptionfactorbindingsitesinagroupofprokaryoticgenomes
AT lishan simultaneouspredictionoftranscriptionfactorbindingsitesinagroupofprokaryoticgenomes
AT phamphuct simultaneouspredictionoftranscriptionfactorbindingsitesinagroupofprokaryoticgenomes
AT suzhengchang simultaneouspredictionoftranscriptionfactorbindingsitesinagroupofprokaryoticgenomes