Cargando…

Predicting Corynebacterium glutamicum promoters based on novel feature descriptor and feature selection technique

The promoter is an important noncoding DNA regulatory element, which combines with RNA polymerase to activate the expression of downstream genes. In industry, artificial arginine is mainly synthesized by Corynebacterium glutamicum. Replication of specific promoter regions can increase arginine produ...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, HongFei, Zhang, Jingyu, Zhao, Yuming, Yang, Wen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10018189/
https://www.ncbi.nlm.nih.gov/pubmed/36937275
http://dx.doi.org/10.3389/fmicb.2023.1141227
_version_ 1784907760404529152
author Li, HongFei
Zhang, Jingyu
Zhao, Yuming
Yang, Wen
author_facet Li, HongFei
Zhang, Jingyu
Zhao, Yuming
Yang, Wen
author_sort Li, HongFei
collection PubMed
description The promoter is an important noncoding DNA regulatory element, which combines with RNA polymerase to activate the expression of downstream genes. In industry, artificial arginine is mainly synthesized by Corynebacterium glutamicum. Replication of specific promoter regions can increase arginine production. Therefore, it is necessary to accurately locate the promoter in C. glutamicum. In the wet experiment, promoter identification depends on sigma factors and DNA splicing technology, this is a laborious job. To quickly and conveniently identify the promoters in C. glutamicum, we have developed a method based on novel feature representation and feature selection to complete this task, describing the DNA sequences through statistical parameters of multiple physicochemical properties, filtering redundant features by combining analysis of variance and hierarchical clustering, the prediction accuracy of the which is as high as 91.6%, the sensitivity of 91.9% can effectively identify promoters, and the specificity of 91.2% can accurately identify non-promoters. In addition, our model can correctly identify 181 promoters and 174 non-promoters among 400 independent samples, which proves that the developed prediction model has excellent robustness.
format Online
Article
Text
id pubmed-10018189
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-100181892023-03-17 Predicting Corynebacterium glutamicum promoters based on novel feature descriptor and feature selection technique Li, HongFei Zhang, Jingyu Zhao, Yuming Yang, Wen Front Microbiol Microbiology The promoter is an important noncoding DNA regulatory element, which combines with RNA polymerase to activate the expression of downstream genes. In industry, artificial arginine is mainly synthesized by Corynebacterium glutamicum. Replication of specific promoter regions can increase arginine production. Therefore, it is necessary to accurately locate the promoter in C. glutamicum. In the wet experiment, promoter identification depends on sigma factors and DNA splicing technology, this is a laborious job. To quickly and conveniently identify the promoters in C. glutamicum, we have developed a method based on novel feature representation and feature selection to complete this task, describing the DNA sequences through statistical parameters of multiple physicochemical properties, filtering redundant features by combining analysis of variance and hierarchical clustering, the prediction accuracy of the which is as high as 91.6%, the sensitivity of 91.9% can effectively identify promoters, and the specificity of 91.2% can accurately identify non-promoters. In addition, our model can correctly identify 181 promoters and 174 non-promoters among 400 independent samples, which proves that the developed prediction model has excellent robustness. Frontiers Media S.A. 2023-03-02 /pmc/articles/PMC10018189/ /pubmed/36937275 http://dx.doi.org/10.3389/fmicb.2023.1141227 Text en Copyright © 2023 Li, Zhang, Zhao and Yang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Li, HongFei
Zhang, Jingyu
Zhao, Yuming
Yang, Wen
Predicting Corynebacterium glutamicum promoters based on novel feature descriptor and feature selection technique
title Predicting Corynebacterium glutamicum promoters based on novel feature descriptor and feature selection technique
title_full Predicting Corynebacterium glutamicum promoters based on novel feature descriptor and feature selection technique
title_fullStr Predicting Corynebacterium glutamicum promoters based on novel feature descriptor and feature selection technique
title_full_unstemmed Predicting Corynebacterium glutamicum promoters based on novel feature descriptor and feature selection technique
title_short Predicting Corynebacterium glutamicum promoters based on novel feature descriptor and feature selection technique
title_sort predicting corynebacterium glutamicum promoters based on novel feature descriptor and feature selection technique
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10018189/
https://www.ncbi.nlm.nih.gov/pubmed/36937275
http://dx.doi.org/10.3389/fmicb.2023.1141227
work_keys_str_mv AT lihongfei predictingcorynebacteriumglutamicumpromotersbasedonnovelfeaturedescriptorandfeatureselectiontechnique
AT zhangjingyu predictingcorynebacteriumglutamicumpromotersbasedonnovelfeaturedescriptorandfeatureselectiontechnique
AT zhaoyuming predictingcorynebacteriumglutamicumpromotersbasedonnovelfeaturedescriptorandfeatureselectiontechnique
AT yangwen predictingcorynebacteriumglutamicumpromotersbasedonnovelfeaturedescriptorandfeatureselectiontechnique