Cargando…
SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models
OBJECTIVE: To address the challenge of computational identification of cell type-specific regulatory elements on a genome-wide scale. RESULTS: We propose SeqEnhDL, a deep learning framework for classifying cell type-specific enhancers based on sequence features. DNA sequences of “strong enhancer” ch...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7980595/ https://www.ncbi.nlm.nih.gov/pubmed/33741075 http://dx.doi.org/10.1186/s13104-021-05518-7 |
_version_ | 1783667459999399936 |
---|---|
author | Wang, Yupeng Jaime-Lara, Rosario B. Roy, Abhrarup Sun, Ying Liu, Xinyue Joseph, Paule V. |
author_facet | Wang, Yupeng Jaime-Lara, Rosario B. Roy, Abhrarup Sun, Ying Liu, Xinyue Joseph, Paule V. |
author_sort | Wang, Yupeng |
collection | PubMed |
description | OBJECTIVE: To address the challenge of computational identification of cell type-specific regulatory elements on a genome-wide scale. RESULTS: We propose SeqEnhDL, a deep learning framework for classifying cell type-specific enhancers based on sequence features. DNA sequences of “strong enhancer” chromatin states in nine cell types from the ENCODE project were retrieved to build and test enhancer classifiers. For any DNA sequence, positional k-mer (k = 5, 7, 9 and 11) fold changes relative to randomly selected non-coding sequences across each nucleotide position were used as features for deep learning models. Three deep learning models were implemented, including multi-layer perceptron (MLP), Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). All models in SeqEnhDL outperform state-of-the-art enhancer classifiers (including gkm-SVM and DanQ) in distinguishing cell type-specific enhancers from randomly selected non-coding sequences. Moreover, SeqEnhDL can directly discriminate enhancers from different cell types, which has not been achieved by other enhancer classifiers. Our analysis suggests that both enhancers and their tissue-specificity can be accurately identified based on their sequence features. SeqEnhDL is publicly available at https://github.com/wyp1125/SeqEnhDL. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13104-021-05518-7. |
format | Online Article Text |
id | pubmed-7980595 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-79805952021-03-22 SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models Wang, Yupeng Jaime-Lara, Rosario B. Roy, Abhrarup Sun, Ying Liu, Xinyue Joseph, Paule V. BMC Res Notes Research Note OBJECTIVE: To address the challenge of computational identification of cell type-specific regulatory elements on a genome-wide scale. RESULTS: We propose SeqEnhDL, a deep learning framework for classifying cell type-specific enhancers based on sequence features. DNA sequences of “strong enhancer” chromatin states in nine cell types from the ENCODE project were retrieved to build and test enhancer classifiers. For any DNA sequence, positional k-mer (k = 5, 7, 9 and 11) fold changes relative to randomly selected non-coding sequences across each nucleotide position were used as features for deep learning models. Three deep learning models were implemented, including multi-layer perceptron (MLP), Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). All models in SeqEnhDL outperform state-of-the-art enhancer classifiers (including gkm-SVM and DanQ) in distinguishing cell type-specific enhancers from randomly selected non-coding sequences. Moreover, SeqEnhDL can directly discriminate enhancers from different cell types, which has not been achieved by other enhancer classifiers. Our analysis suggests that both enhancers and their tissue-specificity can be accurately identified based on their sequence features. SeqEnhDL is publicly available at https://github.com/wyp1125/SeqEnhDL. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13104-021-05518-7. BioMed Central 2021-03-19 /pmc/articles/PMC7980595/ /pubmed/33741075 http://dx.doi.org/10.1186/s13104-021-05518-7 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Note Wang, Yupeng Jaime-Lara, Rosario B. Roy, Abhrarup Sun, Ying Liu, Xinyue Joseph, Paule V. SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models |
title | SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models |
title_full | SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models |
title_fullStr | SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models |
title_full_unstemmed | SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models |
title_short | SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models |
title_sort | seqenhdl: sequence-based classification of cell type-specific enhancers using deep learning models |
topic | Research Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7980595/ https://www.ncbi.nlm.nih.gov/pubmed/33741075 http://dx.doi.org/10.1186/s13104-021-05518-7 |
work_keys_str_mv | AT wangyupeng seqenhdlsequencebasedclassificationofcelltypespecificenhancersusingdeeplearningmodels AT jaimelararosariob seqenhdlsequencebasedclassificationofcelltypespecificenhancersusingdeeplearningmodels AT royabhrarup seqenhdlsequencebasedclassificationofcelltypespecificenhancersusingdeeplearningmodels AT sunying seqenhdlsequencebasedclassificationofcelltypespecificenhancersusingdeeplearningmodels AT liuxinyue seqenhdlsequencebasedclassificationofcelltypespecificenhancersusingdeeplearningmodels AT josephpaulev seqenhdlsequencebasedclassificationofcelltypespecificenhancersusingdeeplearningmodels |