Cargando…

iPTT(2 L)-CNN: A Two-Layer Predictor for Identifying Promoters and Their Types in Plant Genomes by Convolutional Neural Network

A promoter is a short DNA sequence near to the start codon, responsible for initiating transcription of a specific gene in genome. The accurate recognition of promoters has great significance for a better understanding of the transcriptional regulation. Because of their importance in the process of...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Ang, Xiao, Xuan, Xu, Zhaochun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7803414/
https://www.ncbi.nlm.nih.gov/pubmed/33488763
http://dx.doi.org/10.1155/2021/6636350
_version_ 1783635932808740864
author Sun, Ang
Xiao, Xuan
Xu, Zhaochun
author_facet Sun, Ang
Xiao, Xuan
Xu, Zhaochun
author_sort Sun, Ang
collection PubMed
description A promoter is a short DNA sequence near to the start codon, responsible for initiating transcription of a specific gene in genome. The accurate recognition of promoters has great significance for a better understanding of the transcriptional regulation. Because of their importance in the process of biological transcriptional regulation, there is an urgent need to develop in silico tools to identify promoters and their types timely and accurately. A number of prediction methods had been developed in this regard; however, almost all of them were merely used for identifying promoters and their strength or sigma types. Owing to that TATA box region in TATA promoter that influences posttranscriptional processes, in the current study, we developed a two-layer predictor called iPTT(2L)-CNN by using the convolutional neural network (CNN) for identifying TATA and TATA-less promoters. The first layer can be used to identify a given DNA sequence as a promoter or nonpromoter. The second layer is used to identify whether the recognized promoter is TATA promoter or not. The 5-fold crossvalidation and independent testing results demonstrate that the constructed predictor is promising for identifying promoter and classifying TATA and TATA-less promoter. Furthermore, to make it easier for most experimental scientists get the results they need, a user-friendly web server has been established at http://www.jci-bioinfo.cn/iPPT(2L)-CNN.
format Online
Article
Text
id pubmed-7803414
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-78034142021-01-22 iPTT(2 L)-CNN: A Two-Layer Predictor for Identifying Promoters and Their Types in Plant Genomes by Convolutional Neural Network Sun, Ang Xiao, Xuan Xu, Zhaochun Comput Math Methods Med Research Article A promoter is a short DNA sequence near to the start codon, responsible for initiating transcription of a specific gene in genome. The accurate recognition of promoters has great significance for a better understanding of the transcriptional regulation. Because of their importance in the process of biological transcriptional regulation, there is an urgent need to develop in silico tools to identify promoters and their types timely and accurately. A number of prediction methods had been developed in this regard; however, almost all of them were merely used for identifying promoters and their strength or sigma types. Owing to that TATA box region in TATA promoter that influences posttranscriptional processes, in the current study, we developed a two-layer predictor called iPTT(2L)-CNN by using the convolutional neural network (CNN) for identifying TATA and TATA-less promoters. The first layer can be used to identify a given DNA sequence as a promoter or nonpromoter. The second layer is used to identify whether the recognized promoter is TATA promoter or not. The 5-fold crossvalidation and independent testing results demonstrate that the constructed predictor is promising for identifying promoter and classifying TATA and TATA-less promoter. Furthermore, to make it easier for most experimental scientists get the results they need, a user-friendly web server has been established at http://www.jci-bioinfo.cn/iPPT(2L)-CNN. Hindawi 2021-01-05 /pmc/articles/PMC7803414/ /pubmed/33488763 http://dx.doi.org/10.1155/2021/6636350 Text en Copyright © 2021 Ang Sun et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sun, Ang
Xiao, Xuan
Xu, Zhaochun
iPTT(2 L)-CNN: A Two-Layer Predictor for Identifying Promoters and Their Types in Plant Genomes by Convolutional Neural Network
title iPTT(2 L)-CNN: A Two-Layer Predictor for Identifying Promoters and Their Types in Plant Genomes by Convolutional Neural Network
title_full iPTT(2 L)-CNN: A Two-Layer Predictor for Identifying Promoters and Their Types in Plant Genomes by Convolutional Neural Network
title_fullStr iPTT(2 L)-CNN: A Two-Layer Predictor for Identifying Promoters and Their Types in Plant Genomes by Convolutional Neural Network
title_full_unstemmed iPTT(2 L)-CNN: A Two-Layer Predictor for Identifying Promoters and Their Types in Plant Genomes by Convolutional Neural Network
title_short iPTT(2 L)-CNN: A Two-Layer Predictor for Identifying Promoters and Their Types in Plant Genomes by Convolutional Neural Network
title_sort iptt(2 l)-cnn: a two-layer predictor for identifying promoters and their types in plant genomes by convolutional neural network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7803414/
https://www.ncbi.nlm.nih.gov/pubmed/33488763
http://dx.doi.org/10.1155/2021/6636350
work_keys_str_mv AT sunang iptt2lcnnatwolayerpredictorforidentifyingpromotersandtheirtypesinplantgenomesbyconvolutionalneuralnetwork
AT xiaoxuan iptt2lcnnatwolayerpredictorforidentifyingpromotersandtheirtypesinplantgenomesbyconvolutionalneuralnetwork
AT xuzhaochun iptt2lcnnatwolayerpredictorforidentifyingpromotersandtheirtypesinplantgenomesbyconvolutionalneuralnetwork