Cargando…

In Silico Pooling of ChIP-seq Control Experiments

As next generation sequencing technologies are becoming more economical, large-scale ChIP-seq studies are enabling the investigation of the roles of transcription factor binding and epigenome on phenotypic variation. Studying such variation requires individual level ChIP-seq experiments. Standard de...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Guannan, Srinivasan, Rajini, Lopez-Anido, Camila, Hung, Holly A., Svaren, John, Keleş, Sündüz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4224375/
https://www.ncbi.nlm.nih.gov/pubmed/25380244
http://dx.doi.org/10.1371/journal.pone.0109691
_version_ 1782343335252328448
author Sun, Guannan
Srinivasan, Rajini
Lopez-Anido, Camila
Hung, Holly A.
Svaren, John
Keleş, Sündüz
author_facet Sun, Guannan
Srinivasan, Rajini
Lopez-Anido, Camila
Hung, Holly A.
Svaren, John
Keleş, Sündüz
author_sort Sun, Guannan
collection PubMed
description As next generation sequencing technologies are becoming more economical, large-scale ChIP-seq studies are enabling the investigation of the roles of transcription factor binding and epigenome on phenotypic variation. Studying such variation requires individual level ChIP-seq experiments. Standard designs for ChIP-seq experiments employ a paired control per ChIP-seq sample. Genomic coverage for control experiments is often sacrificed to increase the resources for ChIP samples. However, the quality of ChIP-enriched regions identifiable from a ChIP-seq experiment depends on the quality and the coverage of the control experiments. Insufficient coverage leads to loss of power in detecting enrichment. We investigate the effect of in silico pooling of control samples within multiple biological replicates, multiple treatment conditions, and multiple cell lines and tissues across multiple datasets with varying levels of genomic coverage. Our computational studies suggest guidelines for performing in silico pooling of control experiments. Using vast amounts of ENCODE data, we show that pairwise correlations between control samples originating from multiple biological replicates, treatments, and cell lines/tissues can be grouped into two classes representing whether or not in silico pooling leads to power gain in detecting enrichment between the ChIP and the control samples. Our findings have important implications for multiplexing samples.
format Online
Article
Text
id pubmed-4224375
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42243752014-11-18 In Silico Pooling of ChIP-seq Control Experiments Sun, Guannan Srinivasan, Rajini Lopez-Anido, Camila Hung, Holly A. Svaren, John Keleş, Sündüz PLoS One Research Article As next generation sequencing technologies are becoming more economical, large-scale ChIP-seq studies are enabling the investigation of the roles of transcription factor binding and epigenome on phenotypic variation. Studying such variation requires individual level ChIP-seq experiments. Standard designs for ChIP-seq experiments employ a paired control per ChIP-seq sample. Genomic coverage for control experiments is often sacrificed to increase the resources for ChIP samples. However, the quality of ChIP-enriched regions identifiable from a ChIP-seq experiment depends on the quality and the coverage of the control experiments. Insufficient coverage leads to loss of power in detecting enrichment. We investigate the effect of in silico pooling of control samples within multiple biological replicates, multiple treatment conditions, and multiple cell lines and tissues across multiple datasets with varying levels of genomic coverage. Our computational studies suggest guidelines for performing in silico pooling of control experiments. Using vast amounts of ENCODE data, we show that pairwise correlations between control samples originating from multiple biological replicates, treatments, and cell lines/tissues can be grouped into two classes representing whether or not in silico pooling leads to power gain in detecting enrichment between the ChIP and the control samples. Our findings have important implications for multiplexing samples. Public Library of Science 2014-11-07 /pmc/articles/PMC4224375/ /pubmed/25380244 http://dx.doi.org/10.1371/journal.pone.0109691 Text en © 2014 Sun et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Sun, Guannan
Srinivasan, Rajini
Lopez-Anido, Camila
Hung, Holly A.
Svaren, John
Keleş, Sündüz
In Silico Pooling of ChIP-seq Control Experiments
title In Silico Pooling of ChIP-seq Control Experiments
title_full In Silico Pooling of ChIP-seq Control Experiments
title_fullStr In Silico Pooling of ChIP-seq Control Experiments
title_full_unstemmed In Silico Pooling of ChIP-seq Control Experiments
title_short In Silico Pooling of ChIP-seq Control Experiments
title_sort in silico pooling of chip-seq control experiments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4224375/
https://www.ncbi.nlm.nih.gov/pubmed/25380244
http://dx.doi.org/10.1371/journal.pone.0109691
work_keys_str_mv AT sunguannan insilicopoolingofchipseqcontrolexperiments
AT srinivasanrajini insilicopoolingofchipseqcontrolexperiments
AT lopezanidocamila insilicopoolingofchipseqcontrolexperiments
AT hunghollya insilicopoolingofchipseqcontrolexperiments
AT svarenjohn insilicopoolingofchipseqcontrolexperiments
AT kelessunduz insilicopoolingofchipseqcontrolexperiments