Cargando…

DEEP: a general computational framework for predicting enhancers

Transcription regulation in multicellular eukaryotes is orchestrated by a number of DNA functional elements located at gene regulatory regions. Some regulatory regions (e.g. enhancers) are located far away from the gene they affect. Identification of distal regulatory elements is a challenge for the...

Descripción completa

Detalles Bibliográficos
Autores principales: Kleftogiannis, Dimitrios, Kalnis, Panos, Bajic, Vladimir B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288148/
https://www.ncbi.nlm.nih.gov/pubmed/25378307
http://dx.doi.org/10.1093/nar/gku1058
_version_ 1782351916693454848
author Kleftogiannis, Dimitrios
Kalnis, Panos
Bajic, Vladimir B.
author_facet Kleftogiannis, Dimitrios
Kalnis, Panos
Bajic, Vladimir B.
author_sort Kleftogiannis, Dimitrios
collection PubMed
description Transcription regulation in multicellular eukaryotes is orchestrated by a number of DNA functional elements located at gene regulatory regions. Some regulatory regions (e.g. enhancers) are located far away from the gene they affect. Identification of distal regulatory elements is a challenge for the bioinformatics research. Although existing methodologies increased the number of computationally predicted enhancers, performance inconsistency of computational models across different cell-lines, class imbalance within the learning sets and ad hoc rules for selecting enhancer candidates for supervised learning, are some key questions that require further examination. In this study we developed DEEP, a novel ensemble prediction framework. DEEP integrates three components with diverse characteristics that streamline the analysis of enhancer's properties in a great variety of cellular conditions. In our method we train many individual classification models that we combine to classify DNA regions as enhancers or non-enhancers. DEEP uses features derived from histone modification marks or attributes coming from sequence characteristics. Experimental results indicate that DEEP performs better than four state-of-the-art methods on the ENCODE data. We report the first computational enhancer prediction results on FANTOM5 data where DEEP achieves 90.2% accuracy and 90% geometric mean (GM) of specificity and sensitivity across 36 different tissues. We further present results derived using in vivo-derived enhancer data from VISTA database. DEEP-VISTA, when tested on an independent test set, achieved GM of 80.1% and accuracy of 89.64%. DEEP framework is publicly available at http://cbrc.kaust.edu.sa/deep/.
format Online
Article
Text
id pubmed-4288148
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-42881482015-02-19 DEEP: a general computational framework for predicting enhancers Kleftogiannis, Dimitrios Kalnis, Panos Bajic, Vladimir B. Nucleic Acids Res Methods Online Transcription regulation in multicellular eukaryotes is orchestrated by a number of DNA functional elements located at gene regulatory regions. Some regulatory regions (e.g. enhancers) are located far away from the gene they affect. Identification of distal regulatory elements is a challenge for the bioinformatics research. Although existing methodologies increased the number of computationally predicted enhancers, performance inconsistency of computational models across different cell-lines, class imbalance within the learning sets and ad hoc rules for selecting enhancer candidates for supervised learning, are some key questions that require further examination. In this study we developed DEEP, a novel ensemble prediction framework. DEEP integrates three components with diverse characteristics that streamline the analysis of enhancer's properties in a great variety of cellular conditions. In our method we train many individual classification models that we combine to classify DNA regions as enhancers or non-enhancers. DEEP uses features derived from histone modification marks or attributes coming from sequence characteristics. Experimental results indicate that DEEP performs better than four state-of-the-art methods on the ENCODE data. We report the first computational enhancer prediction results on FANTOM5 data where DEEP achieves 90.2% accuracy and 90% geometric mean (GM) of specificity and sensitivity across 36 different tissues. We further present results derived using in vivo-derived enhancer data from VISTA database. DEEP-VISTA, when tested on an independent test set, achieved GM of 80.1% and accuracy of 89.64%. DEEP framework is publicly available at http://cbrc.kaust.edu.sa/deep/. Oxford University Press 2015-01-09 2014-11-05 /pmc/articles/PMC4288148/ /pubmed/25378307 http://dx.doi.org/10.1093/nar/gku1058 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Kleftogiannis, Dimitrios
Kalnis, Panos
Bajic, Vladimir B.
DEEP: a general computational framework for predicting enhancers
title DEEP: a general computational framework for predicting enhancers
title_full DEEP: a general computational framework for predicting enhancers
title_fullStr DEEP: a general computational framework for predicting enhancers
title_full_unstemmed DEEP: a general computational framework for predicting enhancers
title_short DEEP: a general computational framework for predicting enhancers
title_sort deep: a general computational framework for predicting enhancers
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288148/
https://www.ncbi.nlm.nih.gov/pubmed/25378307
http://dx.doi.org/10.1093/nar/gku1058
work_keys_str_mv AT kleftogiannisdimitrios deepageneralcomputationalframeworkforpredictingenhancers
AT kalnispanos deepageneralcomputationalframeworkforpredictingenhancers
AT bajicvladimirb deepageneralcomputationalframeworkforpredictingenhancers