Cargando…

iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework

Enhancers are small regions of DNA that bind to proteins, which enhance the transcription of genes. The enhancer may be located upstream or downstream of the gene. It is not necessarily close to the gene to be acted on, because the entanglement structure of chromatin allows the positions far apart i...

Descripción completa

Detalles Bibliográficos
Autores principales: Liao, Meng, Zhao, Jian-ping, Tian, Jing, Zheng, Chun-Hou
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9664816/
https://www.ncbi.nlm.nih.gov/pubmed/36376800
http://dx.doi.org/10.1186/s12859-022-05033-x
Descripción
Sumario:Enhancers are small regions of DNA that bind to proteins, which enhance the transcription of genes. The enhancer may be located upstream or downstream of the gene. It is not necessarily close to the gene to be acted on, because the entanglement structure of chromatin allows the positions far apart in the sequence to have the opportunity to contact each other. Therefore, identifying enhancers and their strength is a complex and challenging task. In this article, a new prediction method based on deep learning is proposed to identify enhancers and enhancer strength, called iEnhancer-DCLA. Firstly, we use word2vec to convert k-mers into number vectors to construct an input matrix. Secondly, we use convolutional neural network and bidirectional long short-term memory network to extract sequence features, and finally use the attention mechanism to extract relatively important features. In the task of predicting enhancers and their strengths, this method has improved to a certain extent in most evaluation indexes. In summary, we believe that this method provides new ideas in the analysis of enhancers. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-05033-x.