Cargando…

An iterative strategy combining biophysical criteria and duration hidden Markov models for structural predictions of Chlamydia trachomatis σ(66 )promoters

BACKGROUND: Promoter identification is a first step in the quest to explain gene regulation in bacteria. It has been demonstrated that the initiation of bacterial transcription depends upon the stability and topology of DNA in the promoter region as well as the binding affinity between the RNA polym...

Descripción completa

Detalles Bibliográficos
Autores principales: Mallios, Ronna R, Ojcius, David M, Ardell, David H
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2743672/
https://www.ncbi.nlm.nih.gov/pubmed/19715597
http://dx.doi.org/10.1186/1471-2105-10-271
_version_ 1782171873830764544
author Mallios, Ronna R
Ojcius, David M
Ardell, David H
author_facet Mallios, Ronna R
Ojcius, David M
Ardell, David H
author_sort Mallios, Ronna R
collection PubMed
description BACKGROUND: Promoter identification is a first step in the quest to explain gene regulation in bacteria. It has been demonstrated that the initiation of bacterial transcription depends upon the stability and topology of DNA in the promoter region as well as the binding affinity between the RNA polymerase σ-factor and promoter. However, promoter prediction algorithms to date have not explicitly used an ensemble of these factors as predictors. In addition, most promoter models have been trained on data from Escherichia coli. Although it has been shown that transcriptional mechanisms are similar among various bacteria, it is quite possible that the differences between Escherichia coli and Chlamydia trachomatis are large enough to recommend an organism-specific modeling effort. RESULTS: Here we present an iterative stochastic model building procedure that combines such biophysical metrics as DNA stability, curvature, twist and stress-induced DNA duplex destabilization along with duration hidden Markov model parameters to model Chlamydia trachomatis σ(66 )promoters from 29 experimentally verified sequences. Initially, iterative duration hidden Markov modeling of the training set sequences provides a scoring algorithm for Chlamydia trachomatis RNA polymerase σ(66)/DNA binding. Subsequently, an iterative application of Stepwise Binary Logistic Regression selects multiple promoter predictors and deletes/replaces training set sequences to determine an optimal training set. The resulting model predicts the final training set with a high degree of accuracy and provides insights into the structure of the promoter region. Model based genome-wide predictions are provided so that optimal promoter candidates can be experimentally evaluated, and refined models developed. Co-predictions with three other algorithms are also supplied to enhance reliability. CONCLUSION: This strategy and resulting model support the conjecture that DNA biophysical properties, along with RNA polymerase σ-factor/DNA binding collaboratively, contribute to a sequence's ability to promote transcription. This work provides a baseline model that can evolve as new Chlamydia trachomatis σ(66 )promoters are identified with assistance from the provided genome-wide predictions. The proposed methodology is ideal for organisms with few identified promoters and relatively small genomes.
format Text
id pubmed-2743672
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27436722009-09-15 An iterative strategy combining biophysical criteria and duration hidden Markov models for structural predictions of Chlamydia trachomatis σ(66 )promoters Mallios, Ronna R Ojcius, David M Ardell, David H BMC Bioinformatics Research Article BACKGROUND: Promoter identification is a first step in the quest to explain gene regulation in bacteria. It has been demonstrated that the initiation of bacterial transcription depends upon the stability and topology of DNA in the promoter region as well as the binding affinity between the RNA polymerase σ-factor and promoter. However, promoter prediction algorithms to date have not explicitly used an ensemble of these factors as predictors. In addition, most promoter models have been trained on data from Escherichia coli. Although it has been shown that transcriptional mechanisms are similar among various bacteria, it is quite possible that the differences between Escherichia coli and Chlamydia trachomatis are large enough to recommend an organism-specific modeling effort. RESULTS: Here we present an iterative stochastic model building procedure that combines such biophysical metrics as DNA stability, curvature, twist and stress-induced DNA duplex destabilization along with duration hidden Markov model parameters to model Chlamydia trachomatis σ(66 )promoters from 29 experimentally verified sequences. Initially, iterative duration hidden Markov modeling of the training set sequences provides a scoring algorithm for Chlamydia trachomatis RNA polymerase σ(66)/DNA binding. Subsequently, an iterative application of Stepwise Binary Logistic Regression selects multiple promoter predictors and deletes/replaces training set sequences to determine an optimal training set. The resulting model predicts the final training set with a high degree of accuracy and provides insights into the structure of the promoter region. Model based genome-wide predictions are provided so that optimal promoter candidates can be experimentally evaluated, and refined models developed. Co-predictions with three other algorithms are also supplied to enhance reliability. CONCLUSION: This strategy and resulting model support the conjecture that DNA biophysical properties, along with RNA polymerase σ-factor/DNA binding collaboratively, contribute to a sequence's ability to promote transcription. This work provides a baseline model that can evolve as new Chlamydia trachomatis σ(66 )promoters are identified with assistance from the provided genome-wide predictions. The proposed methodology is ideal for organisms with few identified promoters and relatively small genomes. BioMed Central 2009-08-28 /pmc/articles/PMC2743672/ /pubmed/19715597 http://dx.doi.org/10.1186/1471-2105-10-271 Text en Copyright © 2009 Mallios et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mallios, Ronna R
Ojcius, David M
Ardell, David H
An iterative strategy combining biophysical criteria and duration hidden Markov models for structural predictions of Chlamydia trachomatis σ(66 )promoters
title An iterative strategy combining biophysical criteria and duration hidden Markov models for structural predictions of Chlamydia trachomatis σ(66 )promoters
title_full An iterative strategy combining biophysical criteria and duration hidden Markov models for structural predictions of Chlamydia trachomatis σ(66 )promoters
title_fullStr An iterative strategy combining biophysical criteria and duration hidden Markov models for structural predictions of Chlamydia trachomatis σ(66 )promoters
title_full_unstemmed An iterative strategy combining biophysical criteria and duration hidden Markov models for structural predictions of Chlamydia trachomatis σ(66 )promoters
title_short An iterative strategy combining biophysical criteria and duration hidden Markov models for structural predictions of Chlamydia trachomatis σ(66 )promoters
title_sort iterative strategy combining biophysical criteria and duration hidden markov models for structural predictions of chlamydia trachomatis σ(66 )promoters
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2743672/
https://www.ncbi.nlm.nih.gov/pubmed/19715597
http://dx.doi.org/10.1186/1471-2105-10-271
work_keys_str_mv AT malliosronnar aniterativestrategycombiningbiophysicalcriteriaanddurationhiddenmarkovmodelsforstructuralpredictionsofchlamydiatrachomatiss66promoters
AT ojciusdavidm aniterativestrategycombiningbiophysicalcriteriaanddurationhiddenmarkovmodelsforstructuralpredictionsofchlamydiatrachomatiss66promoters
AT ardelldavidh aniterativestrategycombiningbiophysicalcriteriaanddurationhiddenmarkovmodelsforstructuralpredictionsofchlamydiatrachomatiss66promoters
AT malliosronnar iterativestrategycombiningbiophysicalcriteriaanddurationhiddenmarkovmodelsforstructuralpredictionsofchlamydiatrachomatiss66promoters
AT ojciusdavidm iterativestrategycombiningbiophysicalcriteriaanddurationhiddenmarkovmodelsforstructuralpredictionsofchlamydiatrachomatiss66promoters
AT ardelldavidh iterativestrategycombiningbiophysicalcriteriaanddurationhiddenmarkovmodelsforstructuralpredictionsofchlamydiatrachomatiss66promoters