Cargando…
DeePromoter: Robust Promoter Predictor Using Deep Learning
The promoter region is located near the transcription start sites and regulates transcription initiation of the gene by controlling the binding of RNA polymerase. Thus, promoter region recognition is an important area of interest in the field of bioinformatics. Numerous tools for promoter prediction...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6460014/ https://www.ncbi.nlm.nih.gov/pubmed/31024615 http://dx.doi.org/10.3389/fgene.2019.00286 |
_version_ | 1783410268831744000 |
---|---|
author | Oubounyt, Mhaned Louadi, Zakaria Tayara, Hilal Chong, Kil To |
author_facet | Oubounyt, Mhaned Louadi, Zakaria Tayara, Hilal Chong, Kil To |
author_sort | Oubounyt, Mhaned |
collection | PubMed |
description | The promoter region is located near the transcription start sites and regulates transcription initiation of the gene by controlling the binding of RNA polymerase. Thus, promoter region recognition is an important area of interest in the field of bioinformatics. Numerous tools for promoter prediction were proposed. However, the reliability of these tools still needs to be improved. In this work, we propose a robust deep learning model, called DeePromoter, to analyze the characteristics of the short eukaryotic promoter sequences, and accurately recognize the human and mouse promoter sequences. DeePromoter combines a convolutional neural network (CNN) and a long short-term memory (LSTM). Additionally, instead of using non-promoter regions of the genome as a negative set, we derive a more challenging negative set from the promoter sequences. The proposed negative set reconstruction method improves the discrimination ability and significantly reduces the number of false positive predictions. Consequently, DeePromoter outperforms the previously proposed promoter prediction tools. In addition, a web-server for promoter prediction is developed based on the proposed methods and made available at https://home.jbnu.ac.kr/NSCL/deepromoter.htm. |
format | Online Article Text |
id | pubmed-6460014 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-64600142019-04-25 DeePromoter: Robust Promoter Predictor Using Deep Learning Oubounyt, Mhaned Louadi, Zakaria Tayara, Hilal Chong, Kil To Front Genet Genetics The promoter region is located near the transcription start sites and regulates transcription initiation of the gene by controlling the binding of RNA polymerase. Thus, promoter region recognition is an important area of interest in the field of bioinformatics. Numerous tools for promoter prediction were proposed. However, the reliability of these tools still needs to be improved. In this work, we propose a robust deep learning model, called DeePromoter, to analyze the characteristics of the short eukaryotic promoter sequences, and accurately recognize the human and mouse promoter sequences. DeePromoter combines a convolutional neural network (CNN) and a long short-term memory (LSTM). Additionally, instead of using non-promoter regions of the genome as a negative set, we derive a more challenging negative set from the promoter sequences. The proposed negative set reconstruction method improves the discrimination ability and significantly reduces the number of false positive predictions. Consequently, DeePromoter outperforms the previously proposed promoter prediction tools. In addition, a web-server for promoter prediction is developed based on the proposed methods and made available at https://home.jbnu.ac.kr/NSCL/deepromoter.htm. Frontiers Media S.A. 2019-04-05 /pmc/articles/PMC6460014/ /pubmed/31024615 http://dx.doi.org/10.3389/fgene.2019.00286 Text en Copyright © 2019 Oubounyt, Louadi, Tayara and Chong. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Oubounyt, Mhaned Louadi, Zakaria Tayara, Hilal Chong, Kil To DeePromoter: Robust Promoter Predictor Using Deep Learning |
title | DeePromoter: Robust Promoter Predictor Using Deep Learning |
title_full | DeePromoter: Robust Promoter Predictor Using Deep Learning |
title_fullStr | DeePromoter: Robust Promoter Predictor Using Deep Learning |
title_full_unstemmed | DeePromoter: Robust Promoter Predictor Using Deep Learning |
title_short | DeePromoter: Robust Promoter Predictor Using Deep Learning |
title_sort | deepromoter: robust promoter predictor using deep learning |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6460014/ https://www.ncbi.nlm.nih.gov/pubmed/31024615 http://dx.doi.org/10.3389/fgene.2019.00286 |
work_keys_str_mv | AT oubounytmhaned deepromoterrobustpromoterpredictorusingdeeplearning AT louadizakaria deepromoterrobustpromoterpredictorusingdeeplearning AT tayarahilal deepromoterrobustpromoterpredictorusingdeeplearning AT chongkilto deepromoterrobustpromoterpredictorusingdeeplearning |