Cargando…

DeePromoter: Robust Promoter Predictor Using Deep Learning

The promoter region is located near the transcription start sites and regulates transcription initiation of the gene by controlling the binding of RNA polymerase. Thus, promoter region recognition is an important area of interest in the field of bioinformatics. Numerous tools for promoter prediction...

Descripción completa

Detalles Bibliográficos
Autores principales: Oubounyt, Mhaned, Louadi, Zakaria, Tayara, Hilal, Chong, Kil To
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6460014/
https://www.ncbi.nlm.nih.gov/pubmed/31024615
http://dx.doi.org/10.3389/fgene.2019.00286
_version_ 1783410268831744000
author Oubounyt, Mhaned
Louadi, Zakaria
Tayara, Hilal
Chong, Kil To
author_facet Oubounyt, Mhaned
Louadi, Zakaria
Tayara, Hilal
Chong, Kil To
author_sort Oubounyt, Mhaned
collection PubMed
description The promoter region is located near the transcription start sites and regulates transcription initiation of the gene by controlling the binding of RNA polymerase. Thus, promoter region recognition is an important area of interest in the field of bioinformatics. Numerous tools for promoter prediction were proposed. However, the reliability of these tools still needs to be improved. In this work, we propose a robust deep learning model, called DeePromoter, to analyze the characteristics of the short eukaryotic promoter sequences, and accurately recognize the human and mouse promoter sequences. DeePromoter combines a convolutional neural network (CNN) and a long short-term memory (LSTM). Additionally, instead of using non-promoter regions of the genome as a negative set, we derive a more challenging negative set from the promoter sequences. The proposed negative set reconstruction method improves the discrimination ability and significantly reduces the number of false positive predictions. Consequently, DeePromoter outperforms the previously proposed promoter prediction tools. In addition, a web-server for promoter prediction is developed based on the proposed methods and made available at https://home.jbnu.ac.kr/NSCL/deepromoter.htm.
format Online
Article
Text
id pubmed-6460014
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-64600142019-04-25 DeePromoter: Robust Promoter Predictor Using Deep Learning Oubounyt, Mhaned Louadi, Zakaria Tayara, Hilal Chong, Kil To Front Genet Genetics The promoter region is located near the transcription start sites and regulates transcription initiation of the gene by controlling the binding of RNA polymerase. Thus, promoter region recognition is an important area of interest in the field of bioinformatics. Numerous tools for promoter prediction were proposed. However, the reliability of these tools still needs to be improved. In this work, we propose a robust deep learning model, called DeePromoter, to analyze the characteristics of the short eukaryotic promoter sequences, and accurately recognize the human and mouse promoter sequences. DeePromoter combines a convolutional neural network (CNN) and a long short-term memory (LSTM). Additionally, instead of using non-promoter regions of the genome as a negative set, we derive a more challenging negative set from the promoter sequences. The proposed negative set reconstruction method improves the discrimination ability and significantly reduces the number of false positive predictions. Consequently, DeePromoter outperforms the previously proposed promoter prediction tools. In addition, a web-server for promoter prediction is developed based on the proposed methods and made available at https://home.jbnu.ac.kr/NSCL/deepromoter.htm. Frontiers Media S.A. 2019-04-05 /pmc/articles/PMC6460014/ /pubmed/31024615 http://dx.doi.org/10.3389/fgene.2019.00286 Text en Copyright © 2019 Oubounyt, Louadi, Tayara and Chong. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Oubounyt, Mhaned
Louadi, Zakaria
Tayara, Hilal
Chong, Kil To
DeePromoter: Robust Promoter Predictor Using Deep Learning
title DeePromoter: Robust Promoter Predictor Using Deep Learning
title_full DeePromoter: Robust Promoter Predictor Using Deep Learning
title_fullStr DeePromoter: Robust Promoter Predictor Using Deep Learning
title_full_unstemmed DeePromoter: Robust Promoter Predictor Using Deep Learning
title_short DeePromoter: Robust Promoter Predictor Using Deep Learning
title_sort deepromoter: robust promoter predictor using deep learning
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6460014/
https://www.ncbi.nlm.nih.gov/pubmed/31024615
http://dx.doi.org/10.3389/fgene.2019.00286
work_keys_str_mv AT oubounytmhaned deepromoterrobustpromoterpredictorusingdeeplearning
AT louadizakaria deepromoterrobustpromoterpredictorusingdeeplearning
AT tayarahilal deepromoterrobustpromoterpredictorusingdeeplearning
AT chongkilto deepromoterrobustpromoterpredictorusingdeeplearning