Cargando…

Predicting enhancers with deep convolutional neural networks

BACKGROUND: With the rapid development of deep sequencing techniques in the recent years, enhancers have been systematically identified in such projects as FANTOM and ENCODE, forming genome-wide landscapes in a series of human cell lines. Nevertheless, experimental approaches are still costly and ti...

Descripción completa

Detalles Bibliográficos
Autores principales: Min, Xu, Zeng, Wanwen, Chen, Shengquan, Chen, Ning, Chen, Ting, Jiang, Rui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773911/
https://www.ncbi.nlm.nih.gov/pubmed/29219068
http://dx.doi.org/10.1186/s12859-017-1878-3
_version_ 1783293663471730688
author Min, Xu
Zeng, Wanwen
Chen, Shengquan
Chen, Ning
Chen, Ting
Jiang, Rui
author_facet Min, Xu
Zeng, Wanwen
Chen, Shengquan
Chen, Ning
Chen, Ting
Jiang, Rui
author_sort Min, Xu
collection PubMed
description BACKGROUND: With the rapid development of deep sequencing techniques in the recent years, enhancers have been systematically identified in such projects as FANTOM and ENCODE, forming genome-wide landscapes in a series of human cell lines. Nevertheless, experimental approaches are still costly and time consuming for large scale identification of enhancers across a variety of tissues under different disease status, making computational identification of enhancers indispensable. RESULTS: To facilitate the identification of enhancers, we propose a computational framework, named DeepEnhancer, to distinguish enhancers from background genomic sequences. Our method purely relies on DNA sequences to predict enhancers in an end-to-end manner by using a deep convolutional neural network (CNN). We train our deep learning model on permissive enhancers and then adopt a transfer learning strategy to fine-tune the model on enhancers specific to a cell line. Results demonstrate the effectiveness and efficiency of our method in the classification of enhancers against random sequences, exhibiting advantages of deep learning over traditional sequence-based classifiers. We then construct a variety of neural networks with different architectures and show the usefulness of such techniques as max-pooling and batch normalization in our method. To gain the interpretability of our approach, we further visualize convolutional kernels as sequence logos and successfully identify similar motifs in the JASPAR database. CONCLUSIONS: DeepEnhancer enables the identification of novel enhancers using only DNA sequences via a highly accurate deep learning model. The proposed computational framework can also be applied to similar problems, thereby prompting the use of machine learning methods in life sciences.
format Online
Article
Text
id pubmed-5773911
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57739112018-01-26 Predicting enhancers with deep convolutional neural networks Min, Xu Zeng, Wanwen Chen, Shengquan Chen, Ning Chen, Ting Jiang, Rui BMC Bioinformatics Research BACKGROUND: With the rapid development of deep sequencing techniques in the recent years, enhancers have been systematically identified in such projects as FANTOM and ENCODE, forming genome-wide landscapes in a series of human cell lines. Nevertheless, experimental approaches are still costly and time consuming for large scale identification of enhancers across a variety of tissues under different disease status, making computational identification of enhancers indispensable. RESULTS: To facilitate the identification of enhancers, we propose a computational framework, named DeepEnhancer, to distinguish enhancers from background genomic sequences. Our method purely relies on DNA sequences to predict enhancers in an end-to-end manner by using a deep convolutional neural network (CNN). We train our deep learning model on permissive enhancers and then adopt a transfer learning strategy to fine-tune the model on enhancers specific to a cell line. Results demonstrate the effectiveness and efficiency of our method in the classification of enhancers against random sequences, exhibiting advantages of deep learning over traditional sequence-based classifiers. We then construct a variety of neural networks with different architectures and show the usefulness of such techniques as max-pooling and batch normalization in our method. To gain the interpretability of our approach, we further visualize convolutional kernels as sequence logos and successfully identify similar motifs in the JASPAR database. CONCLUSIONS: DeepEnhancer enables the identification of novel enhancers using only DNA sequences via a highly accurate deep learning model. The proposed computational framework can also be applied to similar problems, thereby prompting the use of machine learning methods in life sciences. BioMed Central 2017-12-01 /pmc/articles/PMC5773911/ /pubmed/29219068 http://dx.doi.org/10.1186/s12859-017-1878-3 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Min, Xu
Zeng, Wanwen
Chen, Shengquan
Chen, Ning
Chen, Ting
Jiang, Rui
Predicting enhancers with deep convolutional neural networks
title Predicting enhancers with deep convolutional neural networks
title_full Predicting enhancers with deep convolutional neural networks
title_fullStr Predicting enhancers with deep convolutional neural networks
title_full_unstemmed Predicting enhancers with deep convolutional neural networks
title_short Predicting enhancers with deep convolutional neural networks
title_sort predicting enhancers with deep convolutional neural networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773911/
https://www.ncbi.nlm.nih.gov/pubmed/29219068
http://dx.doi.org/10.1186/s12859-017-1878-3
work_keys_str_mv AT minxu predictingenhancerswithdeepconvolutionalneuralnetworks
AT zengwanwen predictingenhancerswithdeepconvolutionalneuralnetworks
AT chenshengquan predictingenhancerswithdeepconvolutionalneuralnetworks
AT chenning predictingenhancerswithdeepconvolutionalneuralnetworks
AT chenting predictingenhancerswithdeepconvolutionalneuralnetworks
AT jiangrui predictingenhancerswithdeepconvolutionalneuralnetworks