Cargando…

PEDLA: predicting enhancers with a deep learning-based algorithmic framework

Transcriptional enhancers are non-coding segments of DNA that play a central role in the spatiotemporal regulation of gene expression programs. However, systematically and precisely predicting enhancers remain a major challenge. Although existing methods have achieved some success in enhancer predic...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Feng, Li, Hao, Ren, Chao, Bo, Xiaochen, Shu, Wenjie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4916453/
https://www.ncbi.nlm.nih.gov/pubmed/27329130
http://dx.doi.org/10.1038/srep28517
_version_ 1782438834428968960
author Liu, Feng
Li, Hao
Ren, Chao
Bo, Xiaochen
Shu, Wenjie
author_facet Liu, Feng
Li, Hao
Ren, Chao
Bo, Xiaochen
Shu, Wenjie
author_sort Liu, Feng
collection PubMed
description Transcriptional enhancers are non-coding segments of DNA that play a central role in the spatiotemporal regulation of gene expression programs. However, systematically and precisely predicting enhancers remain a major challenge. Although existing methods have achieved some success in enhancer prediction, they still suffer from many issues. We developed a deep learning-based algorithmic framework named PEDLA (https://github.com/wenjiegroup/PEDLA), which can directly learn an enhancer predictor from massively heterogeneous data and generalize in ways that are mostly consistent across various cell types/tissues. We first trained PEDLA with 1,114-dimensional heterogeneous features in H1 cells, and demonstrated that PEDLA framework integrates diverse heterogeneous features and gives state-of-the-art performance relative to five existing methods for enhancer prediction. We further extended PEDLA to iteratively learn from 22 training cell types/tissues. Our results showed that PEDLA manifested superior performance consistency in both training and independent test sets. On average, PEDLA achieved 95.0% accuracy and a 96.8% geometric mean (GM) of sensitivity and specificity across 22 training cell types/tissues, as well as 95.7% accuracy and a 96.8% GM across 20 independent test cell types/tissues. Together, our work illustrates the power of harnessing state-of-the-art deep learning techniques to consistently identify regulatory elements at a genome-wide scale from massively heterogeneous data across diverse cell types/tissues.
format Online
Article
Text
id pubmed-4916453
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-49164532016-06-27 PEDLA: predicting enhancers with a deep learning-based algorithmic framework Liu, Feng Li, Hao Ren, Chao Bo, Xiaochen Shu, Wenjie Sci Rep Article Transcriptional enhancers are non-coding segments of DNA that play a central role in the spatiotemporal regulation of gene expression programs. However, systematically and precisely predicting enhancers remain a major challenge. Although existing methods have achieved some success in enhancer prediction, they still suffer from many issues. We developed a deep learning-based algorithmic framework named PEDLA (https://github.com/wenjiegroup/PEDLA), which can directly learn an enhancer predictor from massively heterogeneous data and generalize in ways that are mostly consistent across various cell types/tissues. We first trained PEDLA with 1,114-dimensional heterogeneous features in H1 cells, and demonstrated that PEDLA framework integrates diverse heterogeneous features and gives state-of-the-art performance relative to five existing methods for enhancer prediction. We further extended PEDLA to iteratively learn from 22 training cell types/tissues. Our results showed that PEDLA manifested superior performance consistency in both training and independent test sets. On average, PEDLA achieved 95.0% accuracy and a 96.8% geometric mean (GM) of sensitivity and specificity across 22 training cell types/tissues, as well as 95.7% accuracy and a 96.8% GM across 20 independent test cell types/tissues. Together, our work illustrates the power of harnessing state-of-the-art deep learning techniques to consistently identify regulatory elements at a genome-wide scale from massively heterogeneous data across diverse cell types/tissues. Nature Publishing Group 2016-06-22 /pmc/articles/PMC4916453/ /pubmed/27329130 http://dx.doi.org/10.1038/srep28517 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Liu, Feng
Li, Hao
Ren, Chao
Bo, Xiaochen
Shu, Wenjie
PEDLA: predicting enhancers with a deep learning-based algorithmic framework
title PEDLA: predicting enhancers with a deep learning-based algorithmic framework
title_full PEDLA: predicting enhancers with a deep learning-based algorithmic framework
title_fullStr PEDLA: predicting enhancers with a deep learning-based algorithmic framework
title_full_unstemmed PEDLA: predicting enhancers with a deep learning-based algorithmic framework
title_short PEDLA: predicting enhancers with a deep learning-based algorithmic framework
title_sort pedla: predicting enhancers with a deep learning-based algorithmic framework
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4916453/
https://www.ncbi.nlm.nih.gov/pubmed/27329130
http://dx.doi.org/10.1038/srep28517
work_keys_str_mv AT liufeng pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework
AT lihao pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework
AT renchao pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework
AT boxiaochen pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework
AT shuwenjie pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework