Cargando…

EP-DNN: A Deep Neural Network-Based Global Enhancer Prediction Algorithm

We present EP-DNN, a protocol for predicting enhancers based on chromatin features, in different cell types. Specifically, we use a deep neural network (DNN)-based architecture to extract enhancer signatures in a representative human embryonic stem cell type (H1) and a differentiated lung cell type...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Seong Gon, Harwani, Mrudul, Grama, Ananth, Chaterji, Somali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5144062/
https://www.ncbi.nlm.nih.gov/pubmed/27929098
http://dx.doi.org/10.1038/srep38433
_version_ 1782473053479895040
author Kim, Seong Gon
Harwani, Mrudul
Grama, Ananth
Chaterji, Somali
author_facet Kim, Seong Gon
Harwani, Mrudul
Grama, Ananth
Chaterji, Somali
author_sort Kim, Seong Gon
collection PubMed
description We present EP-DNN, a protocol for predicting enhancers based on chromatin features, in different cell types. Specifically, we use a deep neural network (DNN)-based architecture to extract enhancer signatures in a representative human embryonic stem cell type (H1) and a differentiated lung cell type (IMR90). We train EP-DNN using p300 binding sites, as enhancers, and TSS and random non-DHS sites, as non-enhancers. We perform same-cell and cross-cell predictions to quantify the validation rate and compare against two state-of-the-art methods, DEEP-ENCODE and RFECS. We find that EP-DNN has superior accuracy with a validation rate of 91.6%, relative to 85.3% for DEEP-ENCODE and 85.5% for RFECS, for a given number of enhancer predictions and also scales better for a larger number of enhancer predictions. Moreover, our H1 → IMR90 predictions turn out to be more accurate than IMR90 → IMR90, potentially because H1 exhibits a richer signature set and our EP-DNN model is expressive enough to extract these subtleties. Our work shows how to leverage the full expressivity of deep learning models, using multiple hidden layers, while avoiding overfitting on the training data. We also lay the foundation for exploration of cross-cell enhancer predictions, potentially reducing the need for expensive experimentation.
format Online
Article
Text
id pubmed-5144062
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-51440622016-12-16 EP-DNN: A Deep Neural Network-Based Global Enhancer Prediction Algorithm Kim, Seong Gon Harwani, Mrudul Grama, Ananth Chaterji, Somali Sci Rep Article We present EP-DNN, a protocol for predicting enhancers based on chromatin features, in different cell types. Specifically, we use a deep neural network (DNN)-based architecture to extract enhancer signatures in a representative human embryonic stem cell type (H1) and a differentiated lung cell type (IMR90). We train EP-DNN using p300 binding sites, as enhancers, and TSS and random non-DHS sites, as non-enhancers. We perform same-cell and cross-cell predictions to quantify the validation rate and compare against two state-of-the-art methods, DEEP-ENCODE and RFECS. We find that EP-DNN has superior accuracy with a validation rate of 91.6%, relative to 85.3% for DEEP-ENCODE and 85.5% for RFECS, for a given number of enhancer predictions and also scales better for a larger number of enhancer predictions. Moreover, our H1 → IMR90 predictions turn out to be more accurate than IMR90 → IMR90, potentially because H1 exhibits a richer signature set and our EP-DNN model is expressive enough to extract these subtleties. Our work shows how to leverage the full expressivity of deep learning models, using multiple hidden layers, while avoiding overfitting on the training data. We also lay the foundation for exploration of cross-cell enhancer predictions, potentially reducing the need for expensive experimentation. Nature Publishing Group 2016-12-08 /pmc/articles/PMC5144062/ /pubmed/27929098 http://dx.doi.org/10.1038/srep38433 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Kim, Seong Gon
Harwani, Mrudul
Grama, Ananth
Chaterji, Somali
EP-DNN: A Deep Neural Network-Based Global Enhancer Prediction Algorithm
title EP-DNN: A Deep Neural Network-Based Global Enhancer Prediction Algorithm
title_full EP-DNN: A Deep Neural Network-Based Global Enhancer Prediction Algorithm
title_fullStr EP-DNN: A Deep Neural Network-Based Global Enhancer Prediction Algorithm
title_full_unstemmed EP-DNN: A Deep Neural Network-Based Global Enhancer Prediction Algorithm
title_short EP-DNN: A Deep Neural Network-Based Global Enhancer Prediction Algorithm
title_sort ep-dnn: a deep neural network-based global enhancer prediction algorithm
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5144062/
https://www.ncbi.nlm.nih.gov/pubmed/27929098
http://dx.doi.org/10.1038/srep38433
work_keys_str_mv AT kimseonggon epdnnadeepneuralnetworkbasedglobalenhancerpredictionalgorithm
AT harwanimrudul epdnnadeepneuralnetworkbasedglobalenhancerpredictionalgorithm
AT gramaananth epdnnadeepneuralnetworkbasedglobalenhancerpredictionalgorithm
AT chaterjisomali epdnnadeepneuralnetworkbasedglobalenhancerpredictionalgorithm