Cargando…

Recognition of prokaryotic promoters based on a novel variable-window Z-curve method

Transcription is the first step in gene expression, and it is the step at which most of the regulation of expression occurs. Although sequenced prokaryotic genomes provide a wealth of information, transcriptional regulatory networks are still poorly understood using the available genomic information...

Descripción completa

Detalles Bibliográficos
Autor principal: Song, Kai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3273801/
https://www.ncbi.nlm.nih.gov/pubmed/21954440
http://dx.doi.org/10.1093/nar/gkr795
_version_ 1782222965565292544
author Song, Kai
author_facet Song, Kai
author_sort Song, Kai
collection PubMed
description Transcription is the first step in gene expression, and it is the step at which most of the regulation of expression occurs. Although sequenced prokaryotic genomes provide a wealth of information, transcriptional regulatory networks are still poorly understood using the available genomic information, largely because accurate prediction of promoters is difficult. To improve promoter recognition performance, a novel variable-window Z-curve method is developed to extract general features of prokaryotic promoters. The features are used for further classification by the partial least squares technique. To verify the prediction performance, the proposed method is applied to predict promoter fragments of two representative prokaryotic model organisms (Escherichia coli and Bacillus subtilis). Depending on the feature extraction and selection power of the proposed method, the promoter prediction accuracies are improved markedly over most existing approaches: for E. coli, the accuracies are 96.05% (σ(70) promoters, coding negative samples), 90.44% (σ(70) promoters, non-coding negative samples), 92.13% (known sigma-factor promoters, coding negative samples), 92.50% (known sigma-factor promoters, non-coding negative samples), respectively; for B. subtilis, the accuracies are 95.83% (known sigma-factor promoters, coding negative samples) and 99.09% (known sigma-factor promoters, non-coding negative samples). Additionally, being a linear technique, the computational simplicity of the proposed method makes it easy to run in a matter of minutes on ordinary personal computers or even laptops. More importantly, there is no need to optimize parameters, so it is very practical for predicting other species promoters without any prior knowledge or prior information of the statistical properties of the samples.
format Online
Article
Text
id pubmed-3273801
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-32738012012-02-07 Recognition of prokaryotic promoters based on a novel variable-window Z-curve method Song, Kai Nucleic Acids Res Computational Biology Transcription is the first step in gene expression, and it is the step at which most of the regulation of expression occurs. Although sequenced prokaryotic genomes provide a wealth of information, transcriptional regulatory networks are still poorly understood using the available genomic information, largely because accurate prediction of promoters is difficult. To improve promoter recognition performance, a novel variable-window Z-curve method is developed to extract general features of prokaryotic promoters. The features are used for further classification by the partial least squares technique. To verify the prediction performance, the proposed method is applied to predict promoter fragments of two representative prokaryotic model organisms (Escherichia coli and Bacillus subtilis). Depending on the feature extraction and selection power of the proposed method, the promoter prediction accuracies are improved markedly over most existing approaches: for E. coli, the accuracies are 96.05% (σ(70) promoters, coding negative samples), 90.44% (σ(70) promoters, non-coding negative samples), 92.13% (known sigma-factor promoters, coding negative samples), 92.50% (known sigma-factor promoters, non-coding negative samples), respectively; for B. subtilis, the accuracies are 95.83% (known sigma-factor promoters, coding negative samples) and 99.09% (known sigma-factor promoters, non-coding negative samples). Additionally, being a linear technique, the computational simplicity of the proposed method makes it easy to run in a matter of minutes on ordinary personal computers or even laptops. More importantly, there is no need to optimize parameters, so it is very practical for predicting other species promoters without any prior knowledge or prior information of the statistical properties of the samples. Oxford University Press 2012-02 2011-09-27 /pmc/articles/PMC3273801/ /pubmed/21954440 http://dx.doi.org/10.1093/nar/gkr795 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Song, Kai
Recognition of prokaryotic promoters based on a novel variable-window Z-curve method
title Recognition of prokaryotic promoters based on a novel variable-window Z-curve method
title_full Recognition of prokaryotic promoters based on a novel variable-window Z-curve method
title_fullStr Recognition of prokaryotic promoters based on a novel variable-window Z-curve method
title_full_unstemmed Recognition of prokaryotic promoters based on a novel variable-window Z-curve method
title_short Recognition of prokaryotic promoters based on a novel variable-window Z-curve method
title_sort recognition of prokaryotic promoters based on a novel variable-window z-curve method
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3273801/
https://www.ncbi.nlm.nih.gov/pubmed/21954440
http://dx.doi.org/10.1093/nar/gkr795
work_keys_str_mv AT songkai recognitionofprokaryoticpromotersbasedonanovelvariablewindowzcurvemethod