Cargando…
Predicting enhancers in mammalian genomes using supervised hidden Markov models
BACKGROUND: Eukaryotic gene regulation is a complex process comprising the dynamic interaction of enhancers and promoters in order to activate gene expression. In recent years, research in regulatory genomics has contributed to a better understanding of the characteristics of promoter elements and f...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6437899/ https://www.ncbi.nlm.nih.gov/pubmed/30917778 http://dx.doi.org/10.1186/s12859-019-2708-6 |
_version_ | 1783407014229049344 |
---|---|
author | Zehnder, Tobias Benner, Philipp Vingron, Martin |
author_facet | Zehnder, Tobias Benner, Philipp Vingron, Martin |
author_sort | Zehnder, Tobias |
collection | PubMed |
description | BACKGROUND: Eukaryotic gene regulation is a complex process comprising the dynamic interaction of enhancers and promoters in order to activate gene expression. In recent years, research in regulatory genomics has contributed to a better understanding of the characteristics of promoter elements and for most sequenced model organism genomes there exist comprehensive and reliable promoter annotations. For enhancers, however, a reliable description of their characteristics and location has so far proven to be elusive. With the development of high-throughput methods such as ChIP-seq, large amounts of data about epigenetic conditions have become available, and many existing methods use the information on chromatin accessibility or histone modifications to train classifiers in order to segment the genome into functional groups such as enhancers and promoters. However, these methods often do not consider prior biological knowledge about enhancers such as their diverse lengths or molecular structure. RESULTS: We developed enhancer HMM (eHMM), a supervised hidden Markov model designed to learn the molecular structure of promoters and enhancers. Both consist of a central stretch of accessible DNA flanked by nucleosomes with distinct histone modification patterns. We evaluated the performance of eHMM within and across cell types and developmental stages and found that eHMM successfully predicts enhancers with high precision and recall comparable to state-of-the-art methods, and consistently outperforms those in terms of accuracy and resolution. CONCLUSIONS: eHMM predicts active enhancers based on data from chromatin accessibility assays and a minimal set of histone modification ChIP-seq experiments. In comparison to other ’black box’ methods its parameters are easy to interpret. eHMM can be used as a stand-alone tool for enhancer prediction without the need for additional training or a tuning of parameters. The high spatial precision of enhancer predictions gives valuable targets for potential knockout experiments or downstream analyses such as motif search. |
format | Online Article Text |
id | pubmed-6437899 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-64378992019-04-08 Predicting enhancers in mammalian genomes using supervised hidden Markov models Zehnder, Tobias Benner, Philipp Vingron, Martin BMC Bioinformatics Methodology Article BACKGROUND: Eukaryotic gene regulation is a complex process comprising the dynamic interaction of enhancers and promoters in order to activate gene expression. In recent years, research in regulatory genomics has contributed to a better understanding of the characteristics of promoter elements and for most sequenced model organism genomes there exist comprehensive and reliable promoter annotations. For enhancers, however, a reliable description of their characteristics and location has so far proven to be elusive. With the development of high-throughput methods such as ChIP-seq, large amounts of data about epigenetic conditions have become available, and many existing methods use the information on chromatin accessibility or histone modifications to train classifiers in order to segment the genome into functional groups such as enhancers and promoters. However, these methods often do not consider prior biological knowledge about enhancers such as their diverse lengths or molecular structure. RESULTS: We developed enhancer HMM (eHMM), a supervised hidden Markov model designed to learn the molecular structure of promoters and enhancers. Both consist of a central stretch of accessible DNA flanked by nucleosomes with distinct histone modification patterns. We evaluated the performance of eHMM within and across cell types and developmental stages and found that eHMM successfully predicts enhancers with high precision and recall comparable to state-of-the-art methods, and consistently outperforms those in terms of accuracy and resolution. CONCLUSIONS: eHMM predicts active enhancers based on data from chromatin accessibility assays and a minimal set of histone modification ChIP-seq experiments. In comparison to other ’black box’ methods its parameters are easy to interpret. eHMM can be used as a stand-alone tool for enhancer prediction without the need for additional training or a tuning of parameters. The high spatial precision of enhancer predictions gives valuable targets for potential knockout experiments or downstream analyses such as motif search. BioMed Central 2019-03-27 /pmc/articles/PMC6437899/ /pubmed/30917778 http://dx.doi.org/10.1186/s12859-019-2708-6 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Zehnder, Tobias Benner, Philipp Vingron, Martin Predicting enhancers in mammalian genomes using supervised hidden Markov models |
title | Predicting enhancers in mammalian genomes using supervised hidden Markov models |
title_full | Predicting enhancers in mammalian genomes using supervised hidden Markov models |
title_fullStr | Predicting enhancers in mammalian genomes using supervised hidden Markov models |
title_full_unstemmed | Predicting enhancers in mammalian genomes using supervised hidden Markov models |
title_short | Predicting enhancers in mammalian genomes using supervised hidden Markov models |
title_sort | predicting enhancers in mammalian genomes using supervised hidden markov models |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6437899/ https://www.ncbi.nlm.nih.gov/pubmed/30917778 http://dx.doi.org/10.1186/s12859-019-2708-6 |
work_keys_str_mv | AT zehndertobias predictingenhancersinmammaliangenomesusingsupervisedhiddenmarkovmodels AT bennerphilipp predictingenhancersinmammaliangenomesusingsupervisedhiddenmarkovmodels AT vingronmartin predictingenhancersinmammaliangenomesusingsupervisedhiddenmarkovmodels |