Cargando…

Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains

Sigma70 factor plays a crucial role in prokaryotes and regulates the transcription of most of the housekeeping genes. One of the major challenges is to predict the sigma70 promoter or sigma70 factor binding site with high precision. In this study, we trained and evaluate our models on a dataset cons...

Descripción completa

Detalles Bibliográficos
Autores principales: Patiyal, Sumeet, Singh, Nitindeep, Ali, Mohd Zartab, Pundir, Dhawal Singh, Raghava, Gajendra P. S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9701712/
https://www.ncbi.nlm.nih.gov/pubmed/36452927
http://dx.doi.org/10.3389/fmicb.2022.1042127
_version_ 1784839595586748416
author Patiyal, Sumeet
Singh, Nitindeep
Ali, Mohd Zartab
Pundir, Dhawal Singh
Raghava, Gajendra P. S.
author_facet Patiyal, Sumeet
Singh, Nitindeep
Ali, Mohd Zartab
Pundir, Dhawal Singh
Raghava, Gajendra P. S.
author_sort Patiyal, Sumeet
collection PubMed
description Sigma70 factor plays a crucial role in prokaryotes and regulates the transcription of most of the housekeeping genes. One of the major challenges is to predict the sigma70 promoter or sigma70 factor binding site with high precision. In this study, we trained and evaluate our models on a dataset consists of 741 sigma70 promoters and 1,400 non-promoters. We have generated a wide range of features around 8,000, which includes Dinucleotide Auto-Correlation, Dinucleotide Cross-Correlation, Dinucleotide Auto Cross-Correlation, Moran Auto-Correlation, Normalized Moreau-Broto Auto-Correlation, Parallel Correlation Pseudo Tri-Nucleotide Composition, etc. Our SVM based model achieved maximum accuracy 97.38% with AUROC 0.99 on training dataset, using 200 most relevant features. In order to check the robustness of the model, we have tested our model on the independent dataset made by using RegulonDB10.8, which included 1,134 sigma70 and 638 non-promoters, and able to achieve accuracy of 90.41% with AUROC of 0.95. Our model successfully predicted constitutive promoters with accuracy of 81.46% on an independent dataset. We have developed a method, Sigma70Pred, which is available as webserver and standalone packages at https://webs.iiitd.edu.in/raghava/sigma70pred/. The services are freely accessible.
format Online
Article
Text
id pubmed-9701712
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-97017122022-11-29 Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains Patiyal, Sumeet Singh, Nitindeep Ali, Mohd Zartab Pundir, Dhawal Singh Raghava, Gajendra P. S. Front Microbiol Microbiology Sigma70 factor plays a crucial role in prokaryotes and regulates the transcription of most of the housekeeping genes. One of the major challenges is to predict the sigma70 promoter or sigma70 factor binding site with high precision. In this study, we trained and evaluate our models on a dataset consists of 741 sigma70 promoters and 1,400 non-promoters. We have generated a wide range of features around 8,000, which includes Dinucleotide Auto-Correlation, Dinucleotide Cross-Correlation, Dinucleotide Auto Cross-Correlation, Moran Auto-Correlation, Normalized Moreau-Broto Auto-Correlation, Parallel Correlation Pseudo Tri-Nucleotide Composition, etc. Our SVM based model achieved maximum accuracy 97.38% with AUROC 0.99 on training dataset, using 200 most relevant features. In order to check the robustness of the model, we have tested our model on the independent dataset made by using RegulonDB10.8, which included 1,134 sigma70 and 638 non-promoters, and able to achieve accuracy of 90.41% with AUROC of 0.95. Our model successfully predicted constitutive promoters with accuracy of 81.46% on an independent dataset. We have developed a method, Sigma70Pred, which is available as webserver and standalone packages at https://webs.iiitd.edu.in/raghava/sigma70pred/. The services are freely accessible. Frontiers Media S.A. 2022-11-14 /pmc/articles/PMC9701712/ /pubmed/36452927 http://dx.doi.org/10.3389/fmicb.2022.1042127 Text en Copyright © 2022 Patiyal, Singh, Ali, Pundir and Raghava. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Patiyal, Sumeet
Singh, Nitindeep
Ali, Mohd Zartab
Pundir, Dhawal Singh
Raghava, Gajendra P. S.
Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains
title Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains
title_full Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains
title_fullStr Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains
title_full_unstemmed Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains
title_short Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains
title_sort sigma70pred: a highly accurate method for predicting sigma70 promoter in escherichia coli k-12 strains
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9701712/
https://www.ncbi.nlm.nih.gov/pubmed/36452927
http://dx.doi.org/10.3389/fmicb.2022.1042127
work_keys_str_mv AT patiyalsumeet sigma70predahighlyaccuratemethodforpredictingsigma70promoterinescherichiacolik12strains
AT singhnitindeep sigma70predahighlyaccuratemethodforpredictingsigma70promoterinescherichiacolik12strains
AT alimohdzartab sigma70predahighlyaccuratemethodforpredictingsigma70promoterinescherichiacolik12strains
AT pundirdhawalsingh sigma70predahighlyaccuratemethodforpredictingsigma70promoterinescherichiacolik12strains
AT raghavagajendraps sigma70predahighlyaccuratemethodforpredictingsigma70promoterinescherichiacolik12strains