Cargando…

Predicting enhancers in mammalian genomes using supervised hidden Markov models

BACKGROUND: Eukaryotic gene regulation is a complex process comprising the dynamic interaction of enhancers and promoters in order to activate gene expression. In recent years, research in regulatory genomics has contributed to a better understanding of the characteristics of promoter elements and f...

Descripción completa

Detalles Bibliográficos
Autores principales: Zehnder, Tobias, Benner, Philipp, Vingron, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6437899/
https://www.ncbi.nlm.nih.gov/pubmed/30917778
http://dx.doi.org/10.1186/s12859-019-2708-6
_version_ 1783407014229049344
author Zehnder, Tobias
Benner, Philipp
Vingron, Martin
author_facet Zehnder, Tobias
Benner, Philipp
Vingron, Martin
author_sort Zehnder, Tobias
collection PubMed
description BACKGROUND: Eukaryotic gene regulation is a complex process comprising the dynamic interaction of enhancers and promoters in order to activate gene expression. In recent years, research in regulatory genomics has contributed to a better understanding of the characteristics of promoter elements and for most sequenced model organism genomes there exist comprehensive and reliable promoter annotations. For enhancers, however, a reliable description of their characteristics and location has so far proven to be elusive. With the development of high-throughput methods such as ChIP-seq, large amounts of data about epigenetic conditions have become available, and many existing methods use the information on chromatin accessibility or histone modifications to train classifiers in order to segment the genome into functional groups such as enhancers and promoters. However, these methods often do not consider prior biological knowledge about enhancers such as their diverse lengths or molecular structure. RESULTS: We developed enhancer HMM (eHMM), a supervised hidden Markov model designed to learn the molecular structure of promoters and enhancers. Both consist of a central stretch of accessible DNA flanked by nucleosomes with distinct histone modification patterns. We evaluated the performance of eHMM within and across cell types and developmental stages and found that eHMM successfully predicts enhancers with high precision and recall comparable to state-of-the-art methods, and consistently outperforms those in terms of accuracy and resolution. CONCLUSIONS: eHMM predicts active enhancers based on data from chromatin accessibility assays and a minimal set of histone modification ChIP-seq experiments. In comparison to other ’black box’ methods its parameters are easy to interpret. eHMM can be used as a stand-alone tool for enhancer prediction without the need for additional training or a tuning of parameters. The high spatial precision of enhancer predictions gives valuable targets for potential knockout experiments or downstream analyses such as motif search.
format Online
Article
Text
id pubmed-6437899
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64378992019-04-08 Predicting enhancers in mammalian genomes using supervised hidden Markov models Zehnder, Tobias Benner, Philipp Vingron, Martin BMC Bioinformatics Methodology Article BACKGROUND: Eukaryotic gene regulation is a complex process comprising the dynamic interaction of enhancers and promoters in order to activate gene expression. In recent years, research in regulatory genomics has contributed to a better understanding of the characteristics of promoter elements and for most sequenced model organism genomes there exist comprehensive and reliable promoter annotations. For enhancers, however, a reliable description of their characteristics and location has so far proven to be elusive. With the development of high-throughput methods such as ChIP-seq, large amounts of data about epigenetic conditions have become available, and many existing methods use the information on chromatin accessibility or histone modifications to train classifiers in order to segment the genome into functional groups such as enhancers and promoters. However, these methods often do not consider prior biological knowledge about enhancers such as their diverse lengths or molecular structure. RESULTS: We developed enhancer HMM (eHMM), a supervised hidden Markov model designed to learn the molecular structure of promoters and enhancers. Both consist of a central stretch of accessible DNA flanked by nucleosomes with distinct histone modification patterns. We evaluated the performance of eHMM within and across cell types and developmental stages and found that eHMM successfully predicts enhancers with high precision and recall comparable to state-of-the-art methods, and consistently outperforms those in terms of accuracy and resolution. CONCLUSIONS: eHMM predicts active enhancers based on data from chromatin accessibility assays and a minimal set of histone modification ChIP-seq experiments. In comparison to other ’black box’ methods its parameters are easy to interpret. eHMM can be used as a stand-alone tool for enhancer prediction without the need for additional training or a tuning of parameters. The high spatial precision of enhancer predictions gives valuable targets for potential knockout experiments or downstream analyses such as motif search. BioMed Central 2019-03-27 /pmc/articles/PMC6437899/ /pubmed/30917778 http://dx.doi.org/10.1186/s12859-019-2708-6 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Zehnder, Tobias
Benner, Philipp
Vingron, Martin
Predicting enhancers in mammalian genomes using supervised hidden Markov models
title Predicting enhancers in mammalian genomes using supervised hidden Markov models
title_full Predicting enhancers in mammalian genomes using supervised hidden Markov models
title_fullStr Predicting enhancers in mammalian genomes using supervised hidden Markov models
title_full_unstemmed Predicting enhancers in mammalian genomes using supervised hidden Markov models
title_short Predicting enhancers in mammalian genomes using supervised hidden Markov models
title_sort predicting enhancers in mammalian genomes using supervised hidden markov models
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6437899/
https://www.ncbi.nlm.nih.gov/pubmed/30917778
http://dx.doi.org/10.1186/s12859-019-2708-6
work_keys_str_mv AT zehndertobias predictingenhancersinmammaliangenomesusingsupervisedhiddenmarkovmodels
AT bennerphilipp predictingenhancersinmammaliangenomesusingsupervisedhiddenmarkovmodels
AT vingronmartin predictingenhancersinmammaliangenomesusingsupervisedhiddenmarkovmodels