Cargando…

Accounting for immunoprecipitation efficiencies in the statistical analysis of ChIP-seq data

BACKGROUND: ImmunoPrecipitation (IP) efficiencies may vary largely between different antibodies and between repeated experiments with the same antibody. These differences have a large impact on the quality of ChIP-seq data: a more efficient experiment will necessarily lead to a higher signal to back...

Descripción completa

Detalles Bibliográficos
Autores principales: Bao, Yanchun, Vinciotti, Veronica, Wit, Ernst, ’t Hoen, Peter AC
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3717085/
https://www.ncbi.nlm.nih.gov/pubmed/23721376
http://dx.doi.org/10.1186/1471-2105-14-169
_version_ 1782277655659282432
author Bao, Yanchun
Vinciotti, Veronica
Wit, Ernst
’t Hoen, Peter AC
author_facet Bao, Yanchun
Vinciotti, Veronica
Wit, Ernst
’t Hoen, Peter AC
author_sort Bao, Yanchun
collection PubMed
description BACKGROUND: ImmunoPrecipitation (IP) efficiencies may vary largely between different antibodies and between repeated experiments with the same antibody. These differences have a large impact on the quality of ChIP-seq data: a more efficient experiment will necessarily lead to a higher signal to background ratio, and therefore to an apparent larger number of enriched regions, compared to a less efficient experiment. In this paper, we show how IP efficiencies can be explicitly accounted for in the joint statistical modelling of ChIP-seq data. RESULTS: We fit a latent mixture model to eight experiments on two proteins, from two laboratories where different antibodies are used for the two proteins. We use the model parameters to estimate the efficiencies of individual experiments, and find that these are clearly different for the different laboratories, and amongst technical replicates from the same lab. When we account for ChIP efficiency, we find more regions bound in the more efficient experiments than in the less efficient ones, at the same false discovery rate. A priori knowledge of the same number of binding sites across experiments can also be included in the model for a more robust detection of differentially bound regions among two different proteins. CONCLUSIONS: We propose a statistical model for the detection of enriched and differentially bound regions from multiple ChIP-seq data sets. The framework that we present accounts explicitly for IP efficiencies in ChIP-seq data, and allows to model jointly, rather than individually, replicates and experiments from different proteins, leading to more robust biological conclusions.
format Online
Article
Text
id pubmed-3717085
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37170852013-07-23 Accounting for immunoprecipitation efficiencies in the statistical analysis of ChIP-seq data Bao, Yanchun Vinciotti, Veronica Wit, Ernst ’t Hoen, Peter AC BMC Bioinformatics Research Article BACKGROUND: ImmunoPrecipitation (IP) efficiencies may vary largely between different antibodies and between repeated experiments with the same antibody. These differences have a large impact on the quality of ChIP-seq data: a more efficient experiment will necessarily lead to a higher signal to background ratio, and therefore to an apparent larger number of enriched regions, compared to a less efficient experiment. In this paper, we show how IP efficiencies can be explicitly accounted for in the joint statistical modelling of ChIP-seq data. RESULTS: We fit a latent mixture model to eight experiments on two proteins, from two laboratories where different antibodies are used for the two proteins. We use the model parameters to estimate the efficiencies of individual experiments, and find that these are clearly different for the different laboratories, and amongst technical replicates from the same lab. When we account for ChIP efficiency, we find more regions bound in the more efficient experiments than in the less efficient ones, at the same false discovery rate. A priori knowledge of the same number of binding sites across experiments can also be included in the model for a more robust detection of differentially bound regions among two different proteins. CONCLUSIONS: We propose a statistical model for the detection of enriched and differentially bound regions from multiple ChIP-seq data sets. The framework that we present accounts explicitly for IP efficiencies in ChIP-seq data, and allows to model jointly, rather than individually, replicates and experiments from different proteins, leading to more robust biological conclusions. BioMed Central 2013-05-30 /pmc/articles/PMC3717085/ /pubmed/23721376 http://dx.doi.org/10.1186/1471-2105-14-169 Text en Copyright © 2013 Bao et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Bao, Yanchun
Vinciotti, Veronica
Wit, Ernst
’t Hoen, Peter AC
Accounting for immunoprecipitation efficiencies in the statistical analysis of ChIP-seq data
title Accounting for immunoprecipitation efficiencies in the statistical analysis of ChIP-seq data
title_full Accounting for immunoprecipitation efficiencies in the statistical analysis of ChIP-seq data
title_fullStr Accounting for immunoprecipitation efficiencies in the statistical analysis of ChIP-seq data
title_full_unstemmed Accounting for immunoprecipitation efficiencies in the statistical analysis of ChIP-seq data
title_short Accounting for immunoprecipitation efficiencies in the statistical analysis of ChIP-seq data
title_sort accounting for immunoprecipitation efficiencies in the statistical analysis of chip-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3717085/
https://www.ncbi.nlm.nih.gov/pubmed/23721376
http://dx.doi.org/10.1186/1471-2105-14-169
work_keys_str_mv AT baoyanchun accountingforimmunoprecipitationefficienciesinthestatisticalanalysisofchipseqdata
AT vinciottiveronica accountingforimmunoprecipitationefficienciesinthestatisticalanalysisofchipseqdata
AT witernst accountingforimmunoprecipitationefficienciesinthestatisticalanalysisofchipseqdata
AT thoenpeterac accountingforimmunoprecipitationefficienciesinthestatisticalanalysisofchipseqdata