Cargando…

Annotation of genomics data using bidirectional hidden Markov models unveils variations in Pol II transcription cycle

DNA replication, transcription and repair involve the recruitment of protein complexes that change their composition as they progress along the genome in a directed or strand-specific manner. Chromatin immunoprecipitation in conjunction with hidden Markov models (HMMs) has been instrumental in under...

Descripción completa

Detalles Bibliográficos
Autores principales: Zacher, Benedikt, Lidschreiber, Michael, Cramer, Patrick, Gagneur, Julien, Tresch, Achim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BlackWell Publishing Ltd 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4300491/
https://www.ncbi.nlm.nih.gov/pubmed/25527639
http://dx.doi.org/10.15252/msb.20145654
_version_ 1782353531817164800
author Zacher, Benedikt
Lidschreiber, Michael
Cramer, Patrick
Gagneur, Julien
Tresch, Achim
author_facet Zacher, Benedikt
Lidschreiber, Michael
Cramer, Patrick
Gagneur, Julien
Tresch, Achim
author_sort Zacher, Benedikt
collection PubMed
description DNA replication, transcription and repair involve the recruitment of protein complexes that change their composition as they progress along the genome in a directed or strand-specific manner. Chromatin immunoprecipitation in conjunction with hidden Markov models (HMMs) has been instrumental in understanding these processes, as they segment the genome into discrete states that can be related to DNA-associated protein complexes. However, current HMM-based approaches are not able to assign forward or reverse direction to states or properly integrate strand-specific (e.g., RNA expression) with non-strand-specific (e.g., ChIP) data, which is indispensable to accurately characterize directed processes. To overcome these limitations, we introduce bidirectional HMMs which infer directed genomic states from occupancy profiles de novo. Application to RNA polymerase II-associated factors in yeast and chromatin modifications in human T cells recovers the majority of transcribed loci, reveals gene-specific variations in the yeast transcription cycle and indicates the existence of directed chromatin state patterns at transcribed, but not at repressed, regions in the human genome. In yeast, we identify 32 new transcribed loci, a regulated initiation–elongation transition, the absence of elongation factors Ctk1 and Paf1 from a class of genes, a distinct transcription mechanism for highly expressed genes and novel DNA sequence motifs associated with transcription termination. We anticipate bidirectional HMMs to significantly improve the analyses of genome-associated directed processes.
format Online
Article
Text
id pubmed-4300491
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BlackWell Publishing Ltd
record_format MEDLINE/PubMed
spelling pubmed-43004912015-01-23 Annotation of genomics data using bidirectional hidden Markov models unveils variations in Pol II transcription cycle Zacher, Benedikt Lidschreiber, Michael Cramer, Patrick Gagneur, Julien Tresch, Achim Mol Syst Biol Articles DNA replication, transcription and repair involve the recruitment of protein complexes that change their composition as they progress along the genome in a directed or strand-specific manner. Chromatin immunoprecipitation in conjunction with hidden Markov models (HMMs) has been instrumental in understanding these processes, as they segment the genome into discrete states that can be related to DNA-associated protein complexes. However, current HMM-based approaches are not able to assign forward or reverse direction to states or properly integrate strand-specific (e.g., RNA expression) with non-strand-specific (e.g., ChIP) data, which is indispensable to accurately characterize directed processes. To overcome these limitations, we introduce bidirectional HMMs which infer directed genomic states from occupancy profiles de novo. Application to RNA polymerase II-associated factors in yeast and chromatin modifications in human T cells recovers the majority of transcribed loci, reveals gene-specific variations in the yeast transcription cycle and indicates the existence of directed chromatin state patterns at transcribed, but not at repressed, regions in the human genome. In yeast, we identify 32 new transcribed loci, a regulated initiation–elongation transition, the absence of elongation factors Ctk1 and Paf1 from a class of genes, a distinct transcription mechanism for highly expressed genes and novel DNA sequence motifs associated with transcription termination. We anticipate bidirectional HMMs to significantly improve the analyses of genome-associated directed processes. BlackWell Publishing Ltd 2014-12-20 /pmc/articles/PMC4300491/ /pubmed/25527639 http://dx.doi.org/10.15252/msb.20145654 Text en © 2014 The Authors. Published under the terms of the CC BY 4.0 license http://creativecommons.org/licenses/by/4.0/ This is an open access article under the terms of the Creative Commons Attribution 4.0 License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Zacher, Benedikt
Lidschreiber, Michael
Cramer, Patrick
Gagneur, Julien
Tresch, Achim
Annotation of genomics data using bidirectional hidden Markov models unveils variations in Pol II transcription cycle
title Annotation of genomics data using bidirectional hidden Markov models unveils variations in Pol II transcription cycle
title_full Annotation of genomics data using bidirectional hidden Markov models unveils variations in Pol II transcription cycle
title_fullStr Annotation of genomics data using bidirectional hidden Markov models unveils variations in Pol II transcription cycle
title_full_unstemmed Annotation of genomics data using bidirectional hidden Markov models unveils variations in Pol II transcription cycle
title_short Annotation of genomics data using bidirectional hidden Markov models unveils variations in Pol II transcription cycle
title_sort annotation of genomics data using bidirectional hidden markov models unveils variations in pol ii transcription cycle
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4300491/
https://www.ncbi.nlm.nih.gov/pubmed/25527639
http://dx.doi.org/10.15252/msb.20145654
work_keys_str_mv AT zacherbenedikt annotationofgenomicsdatausingbidirectionalhiddenmarkovmodelsunveilsvariationsinpoliitranscriptioncycle
AT lidschreibermichael annotationofgenomicsdatausingbidirectionalhiddenmarkovmodelsunveilsvariationsinpoliitranscriptioncycle
AT cramerpatrick annotationofgenomicsdatausingbidirectionalhiddenmarkovmodelsunveilsvariationsinpoliitranscriptioncycle
AT gagneurjulien annotationofgenomicsdatausingbidirectionalhiddenmarkovmodelsunveilsvariationsinpoliitranscriptioncycle
AT treschachim annotationofgenomicsdatausingbidirectionalhiddenmarkovmodelsunveilsvariationsinpoliitranscriptioncycle