Cargando…
PEDLA: predicting enhancers with a deep learning-based algorithmic framework
Transcriptional enhancers are non-coding segments of DNA that play a central role in the spatiotemporal regulation of gene expression programs. However, systematically and precisely predicting enhancers remain a major challenge. Although existing methods have achieved some success in enhancer predic...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4916453/ https://www.ncbi.nlm.nih.gov/pubmed/27329130 http://dx.doi.org/10.1038/srep28517 |
_version_ | 1782438834428968960 |
---|---|
author | Liu, Feng Li, Hao Ren, Chao Bo, Xiaochen Shu, Wenjie |
author_facet | Liu, Feng Li, Hao Ren, Chao Bo, Xiaochen Shu, Wenjie |
author_sort | Liu, Feng |
collection | PubMed |
description | Transcriptional enhancers are non-coding segments of DNA that play a central role in the spatiotemporal regulation of gene expression programs. However, systematically and precisely predicting enhancers remain a major challenge. Although existing methods have achieved some success in enhancer prediction, they still suffer from many issues. We developed a deep learning-based algorithmic framework named PEDLA (https://github.com/wenjiegroup/PEDLA), which can directly learn an enhancer predictor from massively heterogeneous data and generalize in ways that are mostly consistent across various cell types/tissues. We first trained PEDLA with 1,114-dimensional heterogeneous features in H1 cells, and demonstrated that PEDLA framework integrates diverse heterogeneous features and gives state-of-the-art performance relative to five existing methods for enhancer prediction. We further extended PEDLA to iteratively learn from 22 training cell types/tissues. Our results showed that PEDLA manifested superior performance consistency in both training and independent test sets. On average, PEDLA achieved 95.0% accuracy and a 96.8% geometric mean (GM) of sensitivity and specificity across 22 training cell types/tissues, as well as 95.7% accuracy and a 96.8% GM across 20 independent test cell types/tissues. Together, our work illustrates the power of harnessing state-of-the-art deep learning techniques to consistently identify regulatory elements at a genome-wide scale from massively heterogeneous data across diverse cell types/tissues. |
format | Online Article Text |
id | pubmed-4916453 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-49164532016-06-27 PEDLA: predicting enhancers with a deep learning-based algorithmic framework Liu, Feng Li, Hao Ren, Chao Bo, Xiaochen Shu, Wenjie Sci Rep Article Transcriptional enhancers are non-coding segments of DNA that play a central role in the spatiotemporal regulation of gene expression programs. However, systematically and precisely predicting enhancers remain a major challenge. Although existing methods have achieved some success in enhancer prediction, they still suffer from many issues. We developed a deep learning-based algorithmic framework named PEDLA (https://github.com/wenjiegroup/PEDLA), which can directly learn an enhancer predictor from massively heterogeneous data and generalize in ways that are mostly consistent across various cell types/tissues. We first trained PEDLA with 1,114-dimensional heterogeneous features in H1 cells, and demonstrated that PEDLA framework integrates diverse heterogeneous features and gives state-of-the-art performance relative to five existing methods for enhancer prediction. We further extended PEDLA to iteratively learn from 22 training cell types/tissues. Our results showed that PEDLA manifested superior performance consistency in both training and independent test sets. On average, PEDLA achieved 95.0% accuracy and a 96.8% geometric mean (GM) of sensitivity and specificity across 22 training cell types/tissues, as well as 95.7% accuracy and a 96.8% GM across 20 independent test cell types/tissues. Together, our work illustrates the power of harnessing state-of-the-art deep learning techniques to consistently identify regulatory elements at a genome-wide scale from massively heterogeneous data across diverse cell types/tissues. Nature Publishing Group 2016-06-22 /pmc/articles/PMC4916453/ /pubmed/27329130 http://dx.doi.org/10.1038/srep28517 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Liu, Feng Li, Hao Ren, Chao Bo, Xiaochen Shu, Wenjie PEDLA: predicting enhancers with a deep learning-based algorithmic framework |
title | PEDLA: predicting enhancers with a deep learning-based algorithmic framework |
title_full | PEDLA: predicting enhancers with a deep learning-based algorithmic framework |
title_fullStr | PEDLA: predicting enhancers with a deep learning-based algorithmic framework |
title_full_unstemmed | PEDLA: predicting enhancers with a deep learning-based algorithmic framework |
title_short | PEDLA: predicting enhancers with a deep learning-based algorithmic framework |
title_sort | pedla: predicting enhancers with a deep learning-based algorithmic framework |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4916453/ https://www.ncbi.nlm.nih.gov/pubmed/27329130 http://dx.doi.org/10.1038/srep28517 |
work_keys_str_mv | AT liufeng pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework AT lihao pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework AT renchao pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework AT boxiaochen pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework AT shuwenjie pedlapredictingenhancerswithadeeplearningbasedalgorithmicframework |