Cargando…

hiHMM: Bayesian non-parametric joint inference of chromatin state maps

Motivation: Genome-wide mapping of chromatin states is essential for defining regulatory elements and inferring their activities in eukaryotic genomes. A number of hidden Markov model (HMM)-based methods have been developed to infer chromatin state maps from genome-wide histone modification data for...

Descripción completa

Detalles Bibliográficos
Autores principales: Sohn, Kyung-Ah, Ho, Joshua W. K., Djordjevic, Djordje, Jeong, Hyun-hwan, Park, Peter J., Kim, Ju Han
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4481846/
https://www.ncbi.nlm.nih.gov/pubmed/25725496
http://dx.doi.org/10.1093/bioinformatics/btv117
_version_ 1782378335016321024
author Sohn, Kyung-Ah
Ho, Joshua W. K.
Djordjevic, Djordje
Jeong, Hyun-hwan
Park, Peter J.
Kim, Ju Han
author_facet Sohn, Kyung-Ah
Ho, Joshua W. K.
Djordjevic, Djordje
Jeong, Hyun-hwan
Park, Peter J.
Kim, Ju Han
author_sort Sohn, Kyung-Ah
collection PubMed
description Motivation: Genome-wide mapping of chromatin states is essential for defining regulatory elements and inferring their activities in eukaryotic genomes. A number of hidden Markov model (HMM)-based methods have been developed to infer chromatin state maps from genome-wide histone modification data for an individual genome. To perform a principled comparison of evolutionarily distant epigenomes, we must consider species-specific biases such as differences in genome size, strength of signal enrichment and co-occurrence patterns of histone modifications. Results: Here, we present a new Bayesian non-parametric method called hierarchically linked infinite HMM (hiHMM) to jointly infer chromatin state maps in multiple genomes (different species, cell types and developmental stages) using genome-wide histone modification data. This flexible framework provides a new way to learn a consistent definition of chromatin states across multiple genomes, thus facilitating a direct comparison among them. We demonstrate the utility of this method using synthetic data as well as multiple modENCODE ChIP-seq datasets. Conclusion: The hierarchical and Bayesian non-parametric formulation in our approach is an important extension to the current set of methodologies for comparative chromatin landscape analysis. Availability and implementation: Source codes are available at https://github.com/kasohn/hiHMM. Chromatin data are available at http://encode-x.med.harvard.edu/data_sets/chromatin/. Contact: peter_park@harvard.edu or juhan@snu.ac.kr Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4481846
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-44818462015-06-30 hiHMM: Bayesian non-parametric joint inference of chromatin state maps Sohn, Kyung-Ah Ho, Joshua W. K. Djordjevic, Djordje Jeong, Hyun-hwan Park, Peter J. Kim, Ju Han Bioinformatics Original Papers Motivation: Genome-wide mapping of chromatin states is essential for defining regulatory elements and inferring their activities in eukaryotic genomes. A number of hidden Markov model (HMM)-based methods have been developed to infer chromatin state maps from genome-wide histone modification data for an individual genome. To perform a principled comparison of evolutionarily distant epigenomes, we must consider species-specific biases such as differences in genome size, strength of signal enrichment and co-occurrence patterns of histone modifications. Results: Here, we present a new Bayesian non-parametric method called hierarchically linked infinite HMM (hiHMM) to jointly infer chromatin state maps in multiple genomes (different species, cell types and developmental stages) using genome-wide histone modification data. This flexible framework provides a new way to learn a consistent definition of chromatin states across multiple genomes, thus facilitating a direct comparison among them. We demonstrate the utility of this method using synthetic data as well as multiple modENCODE ChIP-seq datasets. Conclusion: The hierarchical and Bayesian non-parametric formulation in our approach is an important extension to the current set of methodologies for comparative chromatin landscape analysis. Availability and implementation: Source codes are available at https://github.com/kasohn/hiHMM. Chromatin data are available at http://encode-x.med.harvard.edu/data_sets/chromatin/. Contact: peter_park@harvard.edu or juhan@snu.ac.kr Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-07-01 2015-02-27 /pmc/articles/PMC4481846/ /pubmed/25725496 http://dx.doi.org/10.1093/bioinformatics/btv117 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Sohn, Kyung-Ah
Ho, Joshua W. K.
Djordjevic, Djordje
Jeong, Hyun-hwan
Park, Peter J.
Kim, Ju Han
hiHMM: Bayesian non-parametric joint inference of chromatin state maps
title hiHMM: Bayesian non-parametric joint inference of chromatin state maps
title_full hiHMM: Bayesian non-parametric joint inference of chromatin state maps
title_fullStr hiHMM: Bayesian non-parametric joint inference of chromatin state maps
title_full_unstemmed hiHMM: Bayesian non-parametric joint inference of chromatin state maps
title_short hiHMM: Bayesian non-parametric joint inference of chromatin state maps
title_sort hihmm: bayesian non-parametric joint inference of chromatin state maps
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4481846/
https://www.ncbi.nlm.nih.gov/pubmed/25725496
http://dx.doi.org/10.1093/bioinformatics/btv117
work_keys_str_mv AT sohnkyungah hihmmbayesiannonparametricjointinferenceofchromatinstatemaps
AT hojoshuawk hihmmbayesiannonparametricjointinferenceofchromatinstatemaps
AT djordjevicdjordje hihmmbayesiannonparametricjointinferenceofchromatinstatemaps
AT jeonghyunhwan hihmmbayesiannonparametricjointinferenceofchromatinstatemaps
AT parkpeterj hihmmbayesiannonparametricjointinferenceofchromatinstatemaps
AT kimjuhan hihmmbayesiannonparametricjointinferenceofchromatinstatemaps