Cargando…

PromoterLCNN: A Light CNN-Based Promoter Prediction and Classification Model

Promoter identification is a fundamental step in understanding bacterial gene regulation mechanisms. However, accurate and fast classification of bacterial promoters continues to be challenging. New methods based on deep convolutional networks have been applied to identify and classify bacterial pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Hernández, Daryl, Jara, Nicolás, Araya, Mauricio, Durán, Roberto E., Buil-Aranda, Carlos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9325283/
https://www.ncbi.nlm.nih.gov/pubmed/35885909
http://dx.doi.org/10.3390/genes13071126
_version_ 1784757011937755136
author Hernández, Daryl
Jara, Nicolás
Araya, Mauricio
Durán, Roberto E.
Buil-Aranda, Carlos
author_facet Hernández, Daryl
Jara, Nicolás
Araya, Mauricio
Durán, Roberto E.
Buil-Aranda, Carlos
author_sort Hernández, Daryl
collection PubMed
description Promoter identification is a fundamental step in understanding bacterial gene regulation mechanisms. However, accurate and fast classification of bacterial promoters continues to be challenging. New methods based on deep convolutional networks have been applied to identify and classify bacterial promoters recognized by sigma ([Formula: see text]) factors and RNA polymerase subunits which increase affinity to specific DNA sequences to modulate transcription and respond to nutritional or environmental changes. This work presents a new multiclass promoter prediction model by using convolutional neural networks (CNNs), denoted as PromoterLCNN, which classifies Escherichia coli promoters into subclasses [Formula: see text] , [Formula: see text] , [Formula: see text] , [Formula: see text] , [Formula: see text] , and [Formula: see text]. We present a light, fast, and simple two-stage multiclass CNN architecture for promoter identification and classification. Training and testing were performed on a benchmark dataset, part of RegulonDB. Comparative performance of PromoterLCNN against other CNN-based classifiers using four parameters (Acc, Sn, Sp, MCC) resulted in similar or better performance than those that commonly use cascade architecture, reducing time by approximately 30–90% for training, prediction, and hyperparameter optimization without compromising classification quality.
format Online
Article
Text
id pubmed-9325283
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93252832022-07-27 PromoterLCNN: A Light CNN-Based Promoter Prediction and Classification Model Hernández, Daryl Jara, Nicolás Araya, Mauricio Durán, Roberto E. Buil-Aranda, Carlos Genes (Basel) Article Promoter identification is a fundamental step in understanding bacterial gene regulation mechanisms. However, accurate and fast classification of bacterial promoters continues to be challenging. New methods based on deep convolutional networks have been applied to identify and classify bacterial promoters recognized by sigma ([Formula: see text]) factors and RNA polymerase subunits which increase affinity to specific DNA sequences to modulate transcription and respond to nutritional or environmental changes. This work presents a new multiclass promoter prediction model by using convolutional neural networks (CNNs), denoted as PromoterLCNN, which classifies Escherichia coli promoters into subclasses [Formula: see text] , [Formula: see text] , [Formula: see text] , [Formula: see text] , [Formula: see text] , and [Formula: see text]. We present a light, fast, and simple two-stage multiclass CNN architecture for promoter identification and classification. Training and testing were performed on a benchmark dataset, part of RegulonDB. Comparative performance of PromoterLCNN against other CNN-based classifiers using four parameters (Acc, Sn, Sp, MCC) resulted in similar or better performance than those that commonly use cascade architecture, reducing time by approximately 30–90% for training, prediction, and hyperparameter optimization without compromising classification quality. MDPI 2022-06-23 /pmc/articles/PMC9325283/ /pubmed/35885909 http://dx.doi.org/10.3390/genes13071126 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hernández, Daryl
Jara, Nicolás
Araya, Mauricio
Durán, Roberto E.
Buil-Aranda, Carlos
PromoterLCNN: A Light CNN-Based Promoter Prediction and Classification Model
title PromoterLCNN: A Light CNN-Based Promoter Prediction and Classification Model
title_full PromoterLCNN: A Light CNN-Based Promoter Prediction and Classification Model
title_fullStr PromoterLCNN: A Light CNN-Based Promoter Prediction and Classification Model
title_full_unstemmed PromoterLCNN: A Light CNN-Based Promoter Prediction and Classification Model
title_short PromoterLCNN: A Light CNN-Based Promoter Prediction and Classification Model
title_sort promoterlcnn: a light cnn-based promoter prediction and classification model
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9325283/
https://www.ncbi.nlm.nih.gov/pubmed/35885909
http://dx.doi.org/10.3390/genes13071126
work_keys_str_mv AT hernandezdaryl promoterlcnnalightcnnbasedpromoterpredictionandclassificationmodel
AT jaranicolas promoterlcnnalightcnnbasedpromoterpredictionandclassificationmodel
AT arayamauricio promoterlcnnalightcnnbasedpromoterpredictionandclassificationmodel
AT duranrobertoe promoterlcnnalightcnnbasedpromoterpredictionandclassificationmodel
AT builarandacarlos promoterlcnnalightcnnbasedpromoterpredictionandclassificationmodel