Cargando…

Detection of prokaryotic promoters from the genomic distribution of hexanucleotide pairs

BACKGROUND: In bacteria, sigma factors and other transcriptional regulatory proteins recognize DNA patterns upstream of their target genes and interact with RNA polymerase to control transcription. As a consequence of evolution, DNA sequences recognized by transcription factors are thought to be enr...

Descripción completa

Detalles Bibliográficos
Autores principales: Jacques, Pierre-Étienne, Rodrigue, Sébastien, Gaudreau, Luc, Goulet, Jean, Brzezinski, Ryszard
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1615881/
https://www.ncbi.nlm.nih.gov/pubmed/17014715
http://dx.doi.org/10.1186/1471-2105-7-423
_version_ 1782130492583182336
author Jacques, Pierre-Étienne
Rodrigue, Sébastien
Gaudreau, Luc
Goulet, Jean
Brzezinski, Ryszard
author_facet Jacques, Pierre-Étienne
Rodrigue, Sébastien
Gaudreau, Luc
Goulet, Jean
Brzezinski, Ryszard
author_sort Jacques, Pierre-Étienne
collection PubMed
description BACKGROUND: In bacteria, sigma factors and other transcriptional regulatory proteins recognize DNA patterns upstream of their target genes and interact with RNA polymerase to control transcription. As a consequence of evolution, DNA sequences recognized by transcription factors are thought to be enriched in intergenic regions (IRs) and depleted from coding regions of prokaryotic genomes. RESULTS: In this work, we report that genomic distribution of transcription factors binding sites is biased towards IRs, and that this bias is conserved amongst bacterial species. We further take advantage of this observation to develop an algorithm that can efficiently identify promoter boxes by a distribution-dependent approach rather than a direct sequence comparison approach. This strategy, which can easily be combined with other methodologies, allowed the identification of promoter sequences in ten species and can be used with any annotated bacterial genome, with results that rival with current methodologies. Experimental validations of predicted promoters also support our approach. CONCLUSION: Considering that complete genomic sequences of over 1000 bacteria will soon be available and that little transcriptional information is available for most of them, our algorithm constitutes a promising tool for the prediction of promoter sequences. Importantly, our methodology could also be adapted to identify DNA sequences recognized by other regulatory proteins.
format Text
id pubmed-1615881
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16158812006-10-18 Detection of prokaryotic promoters from the genomic distribution of hexanucleotide pairs Jacques, Pierre-Étienne Rodrigue, Sébastien Gaudreau, Luc Goulet, Jean Brzezinski, Ryszard BMC Bioinformatics Research Article BACKGROUND: In bacteria, sigma factors and other transcriptional regulatory proteins recognize DNA patterns upstream of their target genes and interact with RNA polymerase to control transcription. As a consequence of evolution, DNA sequences recognized by transcription factors are thought to be enriched in intergenic regions (IRs) and depleted from coding regions of prokaryotic genomes. RESULTS: In this work, we report that genomic distribution of transcription factors binding sites is biased towards IRs, and that this bias is conserved amongst bacterial species. We further take advantage of this observation to develop an algorithm that can efficiently identify promoter boxes by a distribution-dependent approach rather than a direct sequence comparison approach. This strategy, which can easily be combined with other methodologies, allowed the identification of promoter sequences in ten species and can be used with any annotated bacterial genome, with results that rival with current methodologies. Experimental validations of predicted promoters also support our approach. CONCLUSION: Considering that complete genomic sequences of over 1000 bacteria will soon be available and that little transcriptional information is available for most of them, our algorithm constitutes a promising tool for the prediction of promoter sequences. Importantly, our methodology could also be adapted to identify DNA sequences recognized by other regulatory proteins. BioMed Central 2006-10-02 /pmc/articles/PMC1615881/ /pubmed/17014715 http://dx.doi.org/10.1186/1471-2105-7-423 Text en Copyright © 2006 Jacques et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jacques, Pierre-Étienne
Rodrigue, Sébastien
Gaudreau, Luc
Goulet, Jean
Brzezinski, Ryszard
Detection of prokaryotic promoters from the genomic distribution of hexanucleotide pairs
title Detection of prokaryotic promoters from the genomic distribution of hexanucleotide pairs
title_full Detection of prokaryotic promoters from the genomic distribution of hexanucleotide pairs
title_fullStr Detection of prokaryotic promoters from the genomic distribution of hexanucleotide pairs
title_full_unstemmed Detection of prokaryotic promoters from the genomic distribution of hexanucleotide pairs
title_short Detection of prokaryotic promoters from the genomic distribution of hexanucleotide pairs
title_sort detection of prokaryotic promoters from the genomic distribution of hexanucleotide pairs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1615881/
https://www.ncbi.nlm.nih.gov/pubmed/17014715
http://dx.doi.org/10.1186/1471-2105-7-423
work_keys_str_mv AT jacquespierreetienne detectionofprokaryoticpromotersfromthegenomicdistributionofhexanucleotidepairs
AT rodriguesebastien detectionofprokaryoticpromotersfromthegenomicdistributionofhexanucleotidepairs
AT gaudreauluc detectionofprokaryoticpromotersfromthegenomicdistributionofhexanucleotidepairs
AT gouletjean detectionofprokaryoticpromotersfromthegenomicdistributionofhexanucleotidepairs
AT brzezinskiryszard detectionofprokaryoticpromotersfromthegenomicdistributionofhexanucleotidepairs