Cargando…
Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains
Sigma70 factor plays a crucial role in prokaryotes and regulates the transcription of most of the housekeeping genes. One of the major challenges is to predict the sigma70 promoter or sigma70 factor binding site with high precision. In this study, we trained and evaluate our models on a dataset cons...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9701712/ https://www.ncbi.nlm.nih.gov/pubmed/36452927 http://dx.doi.org/10.3389/fmicb.2022.1042127 |
_version_ | 1784839595586748416 |
---|---|
author | Patiyal, Sumeet Singh, Nitindeep Ali, Mohd Zartab Pundir, Dhawal Singh Raghava, Gajendra P. S. |
author_facet | Patiyal, Sumeet Singh, Nitindeep Ali, Mohd Zartab Pundir, Dhawal Singh Raghava, Gajendra P. S. |
author_sort | Patiyal, Sumeet |
collection | PubMed |
description | Sigma70 factor plays a crucial role in prokaryotes and regulates the transcription of most of the housekeeping genes. One of the major challenges is to predict the sigma70 promoter or sigma70 factor binding site with high precision. In this study, we trained and evaluate our models on a dataset consists of 741 sigma70 promoters and 1,400 non-promoters. We have generated a wide range of features around 8,000, which includes Dinucleotide Auto-Correlation, Dinucleotide Cross-Correlation, Dinucleotide Auto Cross-Correlation, Moran Auto-Correlation, Normalized Moreau-Broto Auto-Correlation, Parallel Correlation Pseudo Tri-Nucleotide Composition, etc. Our SVM based model achieved maximum accuracy 97.38% with AUROC 0.99 on training dataset, using 200 most relevant features. In order to check the robustness of the model, we have tested our model on the independent dataset made by using RegulonDB10.8, which included 1,134 sigma70 and 638 non-promoters, and able to achieve accuracy of 90.41% with AUROC of 0.95. Our model successfully predicted constitutive promoters with accuracy of 81.46% on an independent dataset. We have developed a method, Sigma70Pred, which is available as webserver and standalone packages at https://webs.iiitd.edu.in/raghava/sigma70pred/. The services are freely accessible. |
format | Online Article Text |
id | pubmed-9701712 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-97017122022-11-29 Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains Patiyal, Sumeet Singh, Nitindeep Ali, Mohd Zartab Pundir, Dhawal Singh Raghava, Gajendra P. S. Front Microbiol Microbiology Sigma70 factor plays a crucial role in prokaryotes and regulates the transcription of most of the housekeeping genes. One of the major challenges is to predict the sigma70 promoter or sigma70 factor binding site with high precision. In this study, we trained and evaluate our models on a dataset consists of 741 sigma70 promoters and 1,400 non-promoters. We have generated a wide range of features around 8,000, which includes Dinucleotide Auto-Correlation, Dinucleotide Cross-Correlation, Dinucleotide Auto Cross-Correlation, Moran Auto-Correlation, Normalized Moreau-Broto Auto-Correlation, Parallel Correlation Pseudo Tri-Nucleotide Composition, etc. Our SVM based model achieved maximum accuracy 97.38% with AUROC 0.99 on training dataset, using 200 most relevant features. In order to check the robustness of the model, we have tested our model on the independent dataset made by using RegulonDB10.8, which included 1,134 sigma70 and 638 non-promoters, and able to achieve accuracy of 90.41% with AUROC of 0.95. Our model successfully predicted constitutive promoters with accuracy of 81.46% on an independent dataset. We have developed a method, Sigma70Pred, which is available as webserver and standalone packages at https://webs.iiitd.edu.in/raghava/sigma70pred/. The services are freely accessible. Frontiers Media S.A. 2022-11-14 /pmc/articles/PMC9701712/ /pubmed/36452927 http://dx.doi.org/10.3389/fmicb.2022.1042127 Text en Copyright © 2022 Patiyal, Singh, Ali, Pundir and Raghava. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Microbiology Patiyal, Sumeet Singh, Nitindeep Ali, Mohd Zartab Pundir, Dhawal Singh Raghava, Gajendra P. S. Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains |
title | Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains |
title_full | Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains |
title_fullStr | Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains |
title_full_unstemmed | Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains |
title_short | Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains |
title_sort | sigma70pred: a highly accurate method for predicting sigma70 promoter in escherichia coli k-12 strains |
topic | Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9701712/ https://www.ncbi.nlm.nih.gov/pubmed/36452927 http://dx.doi.org/10.3389/fmicb.2022.1042127 |
work_keys_str_mv | AT patiyalsumeet sigma70predahighlyaccuratemethodforpredictingsigma70promoterinescherichiacolik12strains AT singhnitindeep sigma70predahighlyaccuratemethodforpredictingsigma70promoterinescherichiacolik12strains AT alimohdzartab sigma70predahighlyaccuratemethodforpredictingsigma70promoterinescherichiacolik12strains AT pundirdhawalsingh sigma70predahighlyaccuratemethodforpredictingsigma70promoterinescherichiacolik12strains AT raghavagajendraps sigma70predahighlyaccuratemethodforpredictingsigma70promoterinescherichiacolik12strains |