Cargando…

iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework

Enhancers are small regions of DNA that bind to proteins, which enhance the transcription of genes. The enhancer may be located upstream or downstream of the gene. It is not necessarily close to the gene to be acted on, because the entanglement structure of chromatin allows the positions far apart i...

Descripción completa

Detalles Bibliográficos
Autores principales: Liao, Meng, Zhao, Jian-ping, Tian, Jing, Zheng, Chun-Hou
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9664816/
https://www.ncbi.nlm.nih.gov/pubmed/36376800
http://dx.doi.org/10.1186/s12859-022-05033-x
_version_ 1784831179922341888
author Liao, Meng
Zhao, Jian-ping
Tian, Jing
Zheng, Chun-Hou
author_facet Liao, Meng
Zhao, Jian-ping
Tian, Jing
Zheng, Chun-Hou
author_sort Liao, Meng
collection PubMed
description Enhancers are small regions of DNA that bind to proteins, which enhance the transcription of genes. The enhancer may be located upstream or downstream of the gene. It is not necessarily close to the gene to be acted on, because the entanglement structure of chromatin allows the positions far apart in the sequence to have the opportunity to contact each other. Therefore, identifying enhancers and their strength is a complex and challenging task. In this article, a new prediction method based on deep learning is proposed to identify enhancers and enhancer strength, called iEnhancer-DCLA. Firstly, we use word2vec to convert k-mers into number vectors to construct an input matrix. Secondly, we use convolutional neural network and bidirectional long short-term memory network to extract sequence features, and finally use the attention mechanism to extract relatively important features. In the task of predicting enhancers and their strengths, this method has improved to a certain extent in most evaluation indexes. In summary, we believe that this method provides new ideas in the analysis of enhancers. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-05033-x.
format Online
Article
Text
id pubmed-9664816
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-96648162022-11-15 iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework Liao, Meng Zhao, Jian-ping Tian, Jing Zheng, Chun-Hou BMC Bioinformatics Research Enhancers are small regions of DNA that bind to proteins, which enhance the transcription of genes. The enhancer may be located upstream or downstream of the gene. It is not necessarily close to the gene to be acted on, because the entanglement structure of chromatin allows the positions far apart in the sequence to have the opportunity to contact each other. Therefore, identifying enhancers and their strength is a complex and challenging task. In this article, a new prediction method based on deep learning is proposed to identify enhancers and enhancer strength, called iEnhancer-DCLA. Firstly, we use word2vec to convert k-mers into number vectors to construct an input matrix. Secondly, we use convolutional neural network and bidirectional long short-term memory network to extract sequence features, and finally use the attention mechanism to extract relatively important features. In the task of predicting enhancers and their strengths, this method has improved to a certain extent in most evaluation indexes. In summary, we believe that this method provides new ideas in the analysis of enhancers. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-05033-x. BioMed Central 2022-11-14 /pmc/articles/PMC9664816/ /pubmed/36376800 http://dx.doi.org/10.1186/s12859-022-05033-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Liao, Meng
Zhao, Jian-ping
Tian, Jing
Zheng, Chun-Hou
iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework
title iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework
title_full iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework
title_fullStr iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework
title_full_unstemmed iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework
title_short iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework
title_sort ienhancer-dcla: using the original sequence to identify enhancers and their strength based on a deep learning framework
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9664816/
https://www.ncbi.nlm.nih.gov/pubmed/36376800
http://dx.doi.org/10.1186/s12859-022-05033-x
work_keys_str_mv AT liaomeng ienhancerdclausingtheoriginalsequencetoidentifyenhancersandtheirstrengthbasedonadeeplearningframework
AT zhaojianping ienhancerdclausingtheoriginalsequencetoidentifyenhancersandtheirstrengthbasedonadeeplearningframework
AT tianjing ienhancerdclausingtheoriginalsequencetoidentifyenhancersandtheirstrengthbasedonadeeplearningframework
AT zhengchunhou ienhancerdclausingtheoriginalsequencetoidentifyenhancersandtheirstrengthbasedonadeeplearningframework