Cargando…

An Integrated Model of Multiple-Condition ChIP-Seq Data Reveals Predeterminants of Cdx2 Binding

Regulatory proteins can bind to different sets of genomic targets in various cell types or conditions. To reliably characterize such condition-specific regulatory binding we introduce MultiGPS, an integrated machine learning approach for the analysis of multiple related ChIP-seq experiments. MultiGP...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahony, Shaun, Edwards, Matthew D., Mazzoni, Esteban O., Sherwood, Richard I., Kakumanu, Akshay, Morrison, Carolyn A., Wichterle, Hynek, Gifford, David K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3967921/
https://www.ncbi.nlm.nih.gov/pubmed/24675637
http://dx.doi.org/10.1371/journal.pcbi.1003501
_version_ 1782309076701544448
author Mahony, Shaun
Edwards, Matthew D.
Mazzoni, Esteban O.
Sherwood, Richard I.
Kakumanu, Akshay
Morrison, Carolyn A.
Wichterle, Hynek
Gifford, David K.
author_facet Mahony, Shaun
Edwards, Matthew D.
Mazzoni, Esteban O.
Sherwood, Richard I.
Kakumanu, Akshay
Morrison, Carolyn A.
Wichterle, Hynek
Gifford, David K.
author_sort Mahony, Shaun
collection PubMed
description Regulatory proteins can bind to different sets of genomic targets in various cell types or conditions. To reliably characterize such condition-specific regulatory binding we introduce MultiGPS, an integrated machine learning approach for the analysis of multiple related ChIP-seq experiments. MultiGPS is based on a generalized Expectation Maximization framework that shares information across multiple experiments for binding event discovery. We demonstrate that our framework enables the simultaneous modeling of sparse condition-specific binding changes, sequence dependence, and replicate-specific noise sources. MultiGPS encourages consistency in reported binding event locations across multiple-condition ChIP-seq datasets and provides accurate estimation of ChIP enrichment levels at each event. MultiGPS's multi-experiment modeling approach thus provides a reliable platform for detecting differential binding enrichment across experimental conditions. We demonstrate the advantages of MultiGPS with an analysis of Cdx2 binding in three distinct developmental contexts. By accurately characterizing condition-specific Cdx2 binding, MultiGPS enables novel insight into the mechanistic basis of Cdx2 site selectivity. Specifically, the condition-specific Cdx2 sites characterized by MultiGPS are highly associated with pre-existing genomic context, suggesting that such sites are pre-determined by cell-specific regulatory architecture. However, MultiGPS-defined condition-independent sites are not predicted by pre-existing regulatory signals, suggesting that Cdx2 can bind to a subset of locations regardless of genomic environment. A summary of this paper appears in the proceedings of the RECOMB 2014 conference, April 2–5.
format Online
Article
Text
id pubmed-3967921
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39679212014-04-01 An Integrated Model of Multiple-Condition ChIP-Seq Data Reveals Predeterminants of Cdx2 Binding Mahony, Shaun Edwards, Matthew D. Mazzoni, Esteban O. Sherwood, Richard I. Kakumanu, Akshay Morrison, Carolyn A. Wichterle, Hynek Gifford, David K. PLoS Comput Biol Research Article Regulatory proteins can bind to different sets of genomic targets in various cell types or conditions. To reliably characterize such condition-specific regulatory binding we introduce MultiGPS, an integrated machine learning approach for the analysis of multiple related ChIP-seq experiments. MultiGPS is based on a generalized Expectation Maximization framework that shares information across multiple experiments for binding event discovery. We demonstrate that our framework enables the simultaneous modeling of sparse condition-specific binding changes, sequence dependence, and replicate-specific noise sources. MultiGPS encourages consistency in reported binding event locations across multiple-condition ChIP-seq datasets and provides accurate estimation of ChIP enrichment levels at each event. MultiGPS's multi-experiment modeling approach thus provides a reliable platform for detecting differential binding enrichment across experimental conditions. We demonstrate the advantages of MultiGPS with an analysis of Cdx2 binding in three distinct developmental contexts. By accurately characterizing condition-specific Cdx2 binding, MultiGPS enables novel insight into the mechanistic basis of Cdx2 site selectivity. Specifically, the condition-specific Cdx2 sites characterized by MultiGPS are highly associated with pre-existing genomic context, suggesting that such sites are pre-determined by cell-specific regulatory architecture. However, MultiGPS-defined condition-independent sites are not predicted by pre-existing regulatory signals, suggesting that Cdx2 can bind to a subset of locations regardless of genomic environment. A summary of this paper appears in the proceedings of the RECOMB 2014 conference, April 2–5. Public Library of Science 2014-03-27 /pmc/articles/PMC3967921/ /pubmed/24675637 http://dx.doi.org/10.1371/journal.pcbi.1003501 Text en © 2014 Mahony et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Mahony, Shaun
Edwards, Matthew D.
Mazzoni, Esteban O.
Sherwood, Richard I.
Kakumanu, Akshay
Morrison, Carolyn A.
Wichterle, Hynek
Gifford, David K.
An Integrated Model of Multiple-Condition ChIP-Seq Data Reveals Predeterminants of Cdx2 Binding
title An Integrated Model of Multiple-Condition ChIP-Seq Data Reveals Predeterminants of Cdx2 Binding
title_full An Integrated Model of Multiple-Condition ChIP-Seq Data Reveals Predeterminants of Cdx2 Binding
title_fullStr An Integrated Model of Multiple-Condition ChIP-Seq Data Reveals Predeterminants of Cdx2 Binding
title_full_unstemmed An Integrated Model of Multiple-Condition ChIP-Seq Data Reveals Predeterminants of Cdx2 Binding
title_short An Integrated Model of Multiple-Condition ChIP-Seq Data Reveals Predeterminants of Cdx2 Binding
title_sort integrated model of multiple-condition chip-seq data reveals predeterminants of cdx2 binding
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3967921/
https://www.ncbi.nlm.nih.gov/pubmed/24675637
http://dx.doi.org/10.1371/journal.pcbi.1003501
work_keys_str_mv AT mahonyshaun anintegratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT edwardsmatthewd anintegratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT mazzoniestebano anintegratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT sherwoodrichardi anintegratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT kakumanuakshay anintegratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT morrisoncarolyna anintegratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT wichterlehynek anintegratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT gifforddavidk anintegratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT mahonyshaun integratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT edwardsmatthewd integratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT mazzoniestebano integratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT sherwoodrichardi integratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT kakumanuakshay integratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT morrisoncarolyna integratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT wichterlehynek integratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding
AT gifforddavidk integratedmodelofmultipleconditionchipseqdatarevealspredeterminantsofcdx2binding