Cargando…

Accurate Promoter and Enhancer Identification in 127 ENCODE and Roadmap Epigenomics Cell Types and Tissues by GenoSTAN

Accurate maps of promoters and enhancers are required for understanding transcriptional regulation. Promoters and enhancers are usually mapped by integration of chromatin assays charting histone modifications, DNA accessibility, and transcription factor binding. However, current algorithms are limit...

Descripción completa

Detalles Bibliográficos
Autores principales: Zacher, Benedikt, Michel, Margaux, Schwalb, Björn, Cramer, Patrick, Tresch, Achim, Gagneur, Julien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5215863/
https://www.ncbi.nlm.nih.gov/pubmed/28056037
http://dx.doi.org/10.1371/journal.pone.0169249
_version_ 1782491827080790016
author Zacher, Benedikt
Michel, Margaux
Schwalb, Björn
Cramer, Patrick
Tresch, Achim
Gagneur, Julien
author_facet Zacher, Benedikt
Michel, Margaux
Schwalb, Björn
Cramer, Patrick
Tresch, Achim
Gagneur, Julien
author_sort Zacher, Benedikt
collection PubMed
description Accurate maps of promoters and enhancers are required for understanding transcriptional regulation. Promoters and enhancers are usually mapped by integration of chromatin assays charting histone modifications, DNA accessibility, and transcription factor binding. However, current algorithms are limited by unrealistic data distribution assumptions. Here we propose GenoSTAN (Genomic STate ANnotation), a hidden Markov model overcoming these limitations. We map promoters and enhancers for 127 cell types and tissues from the ENCODE and Roadmap Epigenomics projects, today’s largest compendium of chromatin assays. Extensive benchmarks demonstrate that GenoSTAN generally identifies promoters and enhancers with significantly higher accuracy than previous methods. Moreover, GenoSTAN-derived promoters and enhancers showed significantly higher enrichment of complex trait-associated genetic variants than current annotations. Altogether, GenoSTAN provides an easy-to-use tool to define promoters and enhancers in any system, and our annotation of human transcriptional cis-regulatory elements constitutes a rich resource for future research in biology and medicine.
format Online
Article
Text
id pubmed-5215863
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-52158632017-01-19 Accurate Promoter and Enhancer Identification in 127 ENCODE and Roadmap Epigenomics Cell Types and Tissues by GenoSTAN Zacher, Benedikt Michel, Margaux Schwalb, Björn Cramer, Patrick Tresch, Achim Gagneur, Julien PLoS One Research Article Accurate maps of promoters and enhancers are required for understanding transcriptional regulation. Promoters and enhancers are usually mapped by integration of chromatin assays charting histone modifications, DNA accessibility, and transcription factor binding. However, current algorithms are limited by unrealistic data distribution assumptions. Here we propose GenoSTAN (Genomic STate ANnotation), a hidden Markov model overcoming these limitations. We map promoters and enhancers for 127 cell types and tissues from the ENCODE and Roadmap Epigenomics projects, today’s largest compendium of chromatin assays. Extensive benchmarks demonstrate that GenoSTAN generally identifies promoters and enhancers with significantly higher accuracy than previous methods. Moreover, GenoSTAN-derived promoters and enhancers showed significantly higher enrichment of complex trait-associated genetic variants than current annotations. Altogether, GenoSTAN provides an easy-to-use tool to define promoters and enhancers in any system, and our annotation of human transcriptional cis-regulatory elements constitutes a rich resource for future research in biology and medicine. Public Library of Science 2017-01-05 /pmc/articles/PMC5215863/ /pubmed/28056037 http://dx.doi.org/10.1371/journal.pone.0169249 Text en © 2017 Zacher et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zacher, Benedikt
Michel, Margaux
Schwalb, Björn
Cramer, Patrick
Tresch, Achim
Gagneur, Julien
Accurate Promoter and Enhancer Identification in 127 ENCODE and Roadmap Epigenomics Cell Types and Tissues by GenoSTAN
title Accurate Promoter and Enhancer Identification in 127 ENCODE and Roadmap Epigenomics Cell Types and Tissues by GenoSTAN
title_full Accurate Promoter and Enhancer Identification in 127 ENCODE and Roadmap Epigenomics Cell Types and Tissues by GenoSTAN
title_fullStr Accurate Promoter and Enhancer Identification in 127 ENCODE and Roadmap Epigenomics Cell Types and Tissues by GenoSTAN
title_full_unstemmed Accurate Promoter and Enhancer Identification in 127 ENCODE and Roadmap Epigenomics Cell Types and Tissues by GenoSTAN
title_short Accurate Promoter and Enhancer Identification in 127 ENCODE and Roadmap Epigenomics Cell Types and Tissues by GenoSTAN
title_sort accurate promoter and enhancer identification in 127 encode and roadmap epigenomics cell types and tissues by genostan
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5215863/
https://www.ncbi.nlm.nih.gov/pubmed/28056037
http://dx.doi.org/10.1371/journal.pone.0169249
work_keys_str_mv AT zacherbenedikt accuratepromoterandenhanceridentificationin127encodeandroadmapepigenomicscelltypesandtissuesbygenostan
AT michelmargaux accuratepromoterandenhanceridentificationin127encodeandroadmapepigenomicscelltypesandtissuesbygenostan
AT schwalbbjorn accuratepromoterandenhanceridentificationin127encodeandroadmapepigenomicscelltypesandtissuesbygenostan
AT cramerpatrick accuratepromoterandenhanceridentificationin127encodeandroadmapepigenomicscelltypesandtissuesbygenostan
AT treschachim accuratepromoterandenhanceridentificationin127encodeandroadmapepigenomicscelltypesandtissuesbygenostan
AT gagneurjulien accuratepromoterandenhanceridentificationin127encodeandroadmapepigenomicscelltypesandtissuesbygenostan