Cargando…

DeepCAPE: A Deep Convolutional Neural Network for the Accurate Prediction of Enhancers

The establishment of a landscape of enhancers across human cells is crucial to deciphering the mechanism of gene regulation, cell differentiation, and disease development. High-throughput experimental approaches, which contain successfully reported enhancers in typical cell lines, are still too cost...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Shengquan, Gan, Mingxin, Lv, Hairong, Jiang, Rui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9040020/
https://www.ncbi.nlm.nih.gov/pubmed/33581335
http://dx.doi.org/10.1016/j.gpb.2019.04.006
_version_ 1784694253831585792
author Chen, Shengquan
Gan, Mingxin
Lv, Hairong
Jiang, Rui
author_facet Chen, Shengquan
Gan, Mingxin
Lv, Hairong
Jiang, Rui
author_sort Chen, Shengquan
collection PubMed
description The establishment of a landscape of enhancers across human cells is crucial to deciphering the mechanism of gene regulation, cell differentiation, and disease development. High-throughput experimental approaches, which contain successfully reported enhancers in typical cell lines, are still too costly and time-consuming to perform systematic identification of enhancers specific to different cell lines. Existing computational methods, capable of predicting regulatory elements purely relying on DNA sequences, lack the power of cell line-specific screening. Recent studies have suggested that chromatin accessibility of a DNA segment is closely related to its potential function in regulation, and thus may provide useful information in identifying regulatory elements. Motivated by the aforementioned understanding, we integrate DNA sequences and chromatin accessibility data to accurately predict enhancers in a cell line-specific manner. We proposed DeepCAPE, a deep convolutional neural network to predict enhancers via the integration of DNA sequences and DNase-seq data. Benefitting from the well-designed feature extraction mechanism and skip connection strategy, our model not only consistently outperforms existing methods in the imbalanced classification of cell line-specific enhancers against background sequences, but also has the ability to self-adapt to different sizes of datasets. Besides, with the adoption of auto-encoder, our model is capable of making cross-cell line predictions. We further visualize kernels of the first convolutional layer and show the match of identified sequence signatures and known motifs. We finally demonstrate the potential ability of our model to explain functional implications of putative disease-associated genetic variants and discriminate disease-related enhancers. The source code and detailed tutorial of DeepCAPE are freely available at https://github.com/ShengquanChen/DeepCAPE.
format Online
Article
Text
id pubmed-9040020
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-90400202022-04-27 DeepCAPE: A Deep Convolutional Neural Network for the Accurate Prediction of Enhancers Chen, Shengquan Gan, Mingxin Lv, Hairong Jiang, Rui Genomics Proteomics Bioinformatics Method The establishment of a landscape of enhancers across human cells is crucial to deciphering the mechanism of gene regulation, cell differentiation, and disease development. High-throughput experimental approaches, which contain successfully reported enhancers in typical cell lines, are still too costly and time-consuming to perform systematic identification of enhancers specific to different cell lines. Existing computational methods, capable of predicting regulatory elements purely relying on DNA sequences, lack the power of cell line-specific screening. Recent studies have suggested that chromatin accessibility of a DNA segment is closely related to its potential function in regulation, and thus may provide useful information in identifying regulatory elements. Motivated by the aforementioned understanding, we integrate DNA sequences and chromatin accessibility data to accurately predict enhancers in a cell line-specific manner. We proposed DeepCAPE, a deep convolutional neural network to predict enhancers via the integration of DNA sequences and DNase-seq data. Benefitting from the well-designed feature extraction mechanism and skip connection strategy, our model not only consistently outperforms existing methods in the imbalanced classification of cell line-specific enhancers against background sequences, but also has the ability to self-adapt to different sizes of datasets. Besides, with the adoption of auto-encoder, our model is capable of making cross-cell line predictions. We further visualize kernels of the first convolutional layer and show the match of identified sequence signatures and known motifs. We finally demonstrate the potential ability of our model to explain functional implications of putative disease-associated genetic variants and discriminate disease-related enhancers. The source code and detailed tutorial of DeepCAPE are freely available at https://github.com/ShengquanChen/DeepCAPE. Elsevier 2021-08 2021-02-11 /pmc/articles/PMC9040020/ /pubmed/33581335 http://dx.doi.org/10.1016/j.gpb.2019.04.006 Text en © 2021 The Author https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Method
Chen, Shengquan
Gan, Mingxin
Lv, Hairong
Jiang, Rui
DeepCAPE: A Deep Convolutional Neural Network for the Accurate Prediction of Enhancers
title DeepCAPE: A Deep Convolutional Neural Network for the Accurate Prediction of Enhancers
title_full DeepCAPE: A Deep Convolutional Neural Network for the Accurate Prediction of Enhancers
title_fullStr DeepCAPE: A Deep Convolutional Neural Network for the Accurate Prediction of Enhancers
title_full_unstemmed DeepCAPE: A Deep Convolutional Neural Network for the Accurate Prediction of Enhancers
title_short DeepCAPE: A Deep Convolutional Neural Network for the Accurate Prediction of Enhancers
title_sort deepcape: a deep convolutional neural network for the accurate prediction of enhancers
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9040020/
https://www.ncbi.nlm.nih.gov/pubmed/33581335
http://dx.doi.org/10.1016/j.gpb.2019.04.006
work_keys_str_mv AT chenshengquan deepcapeadeepconvolutionalneuralnetworkfortheaccuratepredictionofenhancers
AT ganmingxin deepcapeadeepconvolutionalneuralnetworkfortheaccuratepredictionofenhancers
AT lvhairong deepcapeadeepconvolutionalneuralnetworkfortheaccuratepredictionofenhancers
AT jiangrui deepcapeadeepconvolutionalneuralnetworkfortheaccuratepredictionofenhancers