Cargando…

SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models

OBJECTIVE: To address the challenge of computational identification of cell type-specific regulatory elements on a genome-wide scale. RESULTS: We propose SeqEnhDL, a deep learning framework for classifying cell type-specific enhancers based on sequence features. DNA sequences of “strong enhancer” ch...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yupeng, Jaime-Lara, Rosario B., Roy, Abhrarup, Sun, Ying, Liu, Xinyue, Joseph, Paule V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7980595/
https://www.ncbi.nlm.nih.gov/pubmed/33741075
http://dx.doi.org/10.1186/s13104-021-05518-7
_version_ 1783667459999399936
author Wang, Yupeng
Jaime-Lara, Rosario B.
Roy, Abhrarup
Sun, Ying
Liu, Xinyue
Joseph, Paule V.
author_facet Wang, Yupeng
Jaime-Lara, Rosario B.
Roy, Abhrarup
Sun, Ying
Liu, Xinyue
Joseph, Paule V.
author_sort Wang, Yupeng
collection PubMed
description OBJECTIVE: To address the challenge of computational identification of cell type-specific regulatory elements on a genome-wide scale. RESULTS: We propose SeqEnhDL, a deep learning framework for classifying cell type-specific enhancers based on sequence features. DNA sequences of “strong enhancer” chromatin states in nine cell types from the ENCODE project were retrieved to build and test enhancer classifiers. For any DNA sequence, positional k-mer (k = 5, 7, 9 and 11) fold changes relative to randomly selected non-coding sequences across each nucleotide position were used as features for deep learning models. Three deep learning models were implemented, including multi-layer perceptron (MLP), Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). All models in SeqEnhDL outperform state-of-the-art enhancer classifiers (including gkm-SVM and DanQ) in distinguishing cell type-specific enhancers from randomly selected non-coding sequences. Moreover, SeqEnhDL can directly discriminate enhancers from different cell types, which has not been achieved by other enhancer classifiers. Our analysis suggests that both enhancers and their tissue-specificity can be accurately identified based on their sequence features. SeqEnhDL is publicly available at https://github.com/wyp1125/SeqEnhDL. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13104-021-05518-7.
format Online
Article
Text
id pubmed-7980595
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-79805952021-03-22 SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models Wang, Yupeng Jaime-Lara, Rosario B. Roy, Abhrarup Sun, Ying Liu, Xinyue Joseph, Paule V. BMC Res Notes Research Note OBJECTIVE: To address the challenge of computational identification of cell type-specific regulatory elements on a genome-wide scale. RESULTS: We propose SeqEnhDL, a deep learning framework for classifying cell type-specific enhancers based on sequence features. DNA sequences of “strong enhancer” chromatin states in nine cell types from the ENCODE project were retrieved to build and test enhancer classifiers. For any DNA sequence, positional k-mer (k = 5, 7, 9 and 11) fold changes relative to randomly selected non-coding sequences across each nucleotide position were used as features for deep learning models. Three deep learning models were implemented, including multi-layer perceptron (MLP), Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). All models in SeqEnhDL outperform state-of-the-art enhancer classifiers (including gkm-SVM and DanQ) in distinguishing cell type-specific enhancers from randomly selected non-coding sequences. Moreover, SeqEnhDL can directly discriminate enhancers from different cell types, which has not been achieved by other enhancer classifiers. Our analysis suggests that both enhancers and their tissue-specificity can be accurately identified based on their sequence features. SeqEnhDL is publicly available at https://github.com/wyp1125/SeqEnhDL. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13104-021-05518-7. BioMed Central 2021-03-19 /pmc/articles/PMC7980595/ /pubmed/33741075 http://dx.doi.org/10.1186/s13104-021-05518-7 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Note
Wang, Yupeng
Jaime-Lara, Rosario B.
Roy, Abhrarup
Sun, Ying
Liu, Xinyue
Joseph, Paule V.
SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models
title SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models
title_full SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models
title_fullStr SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models
title_full_unstemmed SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models
title_short SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models
title_sort seqenhdl: sequence-based classification of cell type-specific enhancers using deep learning models
topic Research Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7980595/
https://www.ncbi.nlm.nih.gov/pubmed/33741075
http://dx.doi.org/10.1186/s13104-021-05518-7
work_keys_str_mv AT wangyupeng seqenhdlsequencebasedclassificationofcelltypespecificenhancersusingdeeplearningmodels
AT jaimelararosariob seqenhdlsequencebasedclassificationofcelltypespecificenhancersusingdeeplearningmodels
AT royabhrarup seqenhdlsequencebasedclassificationofcelltypespecificenhancersusingdeeplearningmodels
AT sunying seqenhdlsequencebasedclassificationofcelltypespecificenhancersusingdeeplearningmodels
AT liuxinyue seqenhdlsequencebasedclassificationofcelltypespecificenhancersusingdeeplearningmodels
AT josephpaulev seqenhdlsequencebasedclassificationofcelltypespecificenhancersusingdeeplearningmodels