Cargando…

PEDLA: predicting enhancers with a deep learning-based algorithmic framework

Transcriptional enhancers are non-coding segments of DNA that play a central role in the spatiotemporal regulation of gene expression programs. However, systematically and precisely predicting enhancers remain a major challenge. Although existing methods have achieved some success in enhancer predic...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Feng, Li, Hao, Ren, Chao, Bo, Xiaochen, Shu, Wenjie
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group 2016
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4916453/ https://www.ncbi.nlm.nih.gov/pubmed/27329130 http://dx.doi.org/10.1038/srep28517

_version_	1782438834428968960
author	Liu, Feng Li, Hao Ren, Chao Bo, Xiaochen Shu, Wenjie
author_facet	Liu, Feng Li, Hao Ren, Chao Bo, Xiaochen Shu, Wenjie
author_sort	Liu, Feng
collection	PubMed
description	Transcriptional enhancers are non-coding segments of DNA that play a central role in the spatiotemporal regulation of gene expression programs. However, systematically and precisely predicting enhancers remain a major challenge. Although existing methods have achieved some success in enhancer prediction, they still suffer from many issues. We developed a deep learning-based algorithmic framework named PEDLA (https://github.com/wenjiegroup/PEDLA), which can directly learn an enhancer predictor from massively heterogeneous data and generalize in ways that are mostly consistent across various cell types/tissues. We first trained PEDLA with 1,114-dimensional heterogeneous features in H1 cells, and demonstrated that PEDLA framework integrates diverse heterogeneous features and gives state-of-the-art performance relative to five existing methods for enhancer prediction. We further extended PEDLA to iteratively learn from 22 training cell types/tissues. Our results showed that PEDLA manifested superior performance consistency in both training and independent test sets. On average, PEDLA achieved 95.0% accuracy and a 96.8% geometric mean (GM) of sensitivity and specificity across 22 training cell types/tissues, as well as 95.7% accuracy and a 96.8% GM across 20 independent test cell types/tissues. Together, our work illustrates the power of harnessing state-of-the-art deep learning techniques to consistently identify regulatory elements at a genome-wide scale from massively heterogeneous data across diverse cell types/tissues.
format	Online Article Text
id	pubmed-4916453
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Nature Publishing Group
record_format	MEDLINE/PubMed
spelling	pubmed-49164532016-06-27 PEDLA: predicting enhancers with a deep learning-based algorithmic framework Liu, Feng Li, Hao Ren, Chao Bo, Xiaochen Shu, Wenjie Sci Rep Article Transcriptional enhancers are non-coding segments of DNA that play a central role in the spatiotemporal regulation of gene expression programs. However, systematically and precisely predicting enhancers remain a major challenge. Although existing methods have achieved some success in enhancer prediction, they still suffer from many issues. We developed a deep learning-based algorithmic framework named PEDLA (https://github.com/wenjiegroup/PEDLA), which can directly learn an enhancer predictor from massively heterogeneous data and generalize in ways that are mostly consistent across various cell types/tissues. We first trained PEDLA with 1,114-dimensional heterogeneous features in H1 cells, and demonstrated that PEDLA framework integrates diverse heterogeneous features and gives state-of-the-art performance relative to five existing methods for enhancer prediction. We further extended PEDLA to iteratively learn from 22 training cell types/tissues. Our results showed that PEDLA manifested superior performance consistency in both training and independent test sets. On average, PEDLA achieved 95.0% accuracy and a 96.8% geometric mean (GM) of sensitivity and specificity across 22 training cell types/tissues, as well as 95.7% accuracy and a 96.8% GM across 20 independent test cell types/tissues. Together, our work illustrates the power of harnessing state-of-the-art deep learning techniques to consistently identify regulatory elements at a genome-wide scale from massively heterogeneous data across diverse cell types/tissues. Nature Publishing Group 2016-06-22 /pmc/articles/PMC4916453/ /pubmed/27329130 http://dx.doi.org/10.1038/srep28517 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle	Article Liu, Feng Li, Hao Ren, Chao Bo, Xiaochen Shu, Wenjie PEDLA: predicting enhancers with a deep learning-based algorithmic framework
title	PEDLA: predicting enhancers with a deep learning-based algorithmic framework
title_full	PEDLA: predicting enhancers with a deep learning-based algorithmic framework
title_fullStr	PEDLA: predicting enhancers with a deep learning-based algorithmic framework
title_full_unstemmed	PEDLA: predicting enhancers with a deep learning-based algorithmic framework
title_short	PEDLA: predicting enhancers with a deep learning-based algorithmic framework
title_sort	pedla: predicting enhancers with a deep learning-based algorithmic framework
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4916453/ https://www.ncbi.nlm.nih.gov/pubmed/27329130 http://dx.doi.org/10.1038/srep28517
work_keys_str_mv	AT liufeng pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework AT lihao pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework AT renchao pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework AT boxiaochen pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework AT shuwenjie pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework

PEDLA: predicting enhancers with a deep learning-based algorithmic framework

Ejemplares similares