Cargando…

De novo detection of differentially bound regions for ChIP-seq data using peaks and windows: controlling error rates correctly

A common aim in ChIP-seq experiments is to identify changes in protein binding patterns between conditions, i.e. differential binding. A number of peak- and window-based strategies have been developed to detect differential binding when the regions of interest are not known in advance. However, care...

Descripción completa

Detalles Bibliográficos
Autores principales: Lun, Aaron T.L., Smyth, Gordon K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4066778/
https://www.ncbi.nlm.nih.gov/pubmed/24852250
http://dx.doi.org/10.1093/nar/gku351
_version_ 1782322213979947008
author Lun, Aaron T.L.
Smyth, Gordon K.
author_facet Lun, Aaron T.L.
Smyth, Gordon K.
author_sort Lun, Aaron T.L.
collection PubMed
description A common aim in ChIP-seq experiments is to identify changes in protein binding patterns between conditions, i.e. differential binding. A number of peak- and window-based strategies have been developed to detect differential binding when the regions of interest are not known in advance. However, careful consideration of error control is needed when applying these methods. Peak-based approaches use the same data set to define peaks and to detect differential binding. Done improperly, this can result in loss of type I error control. For window-based methods, controlling the false discovery rate over all detected windows does not guarantee control across all detected regions. Misinterpreting the former as the latter can result in unexpected liberalness. Here, several solutions are presented to maintain error control for these de novo counting strategies. For peak-based methods, peak calling should be performed on pooled libraries prior to the statistical analysis. For window-based methods, a hybrid approach using Simes’ method is proposed to maintain control of the false discovery rate across regions. More generally, the relative advantages of peak- and window-based strategies are explored using a range of simulated and real data sets. Implementations of both strategies also compare favourably to existing programs for differential binding analyses.
format Online
Article
Text
id pubmed-4066778
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40667782014-06-24 De novo detection of differentially bound regions for ChIP-seq data using peaks and windows: controlling error rates correctly Lun, Aaron T.L. Smyth, Gordon K. Nucleic Acids Res Methods Online A common aim in ChIP-seq experiments is to identify changes in protein binding patterns between conditions, i.e. differential binding. A number of peak- and window-based strategies have been developed to detect differential binding when the regions of interest are not known in advance. However, careful consideration of error control is needed when applying these methods. Peak-based approaches use the same data set to define peaks and to detect differential binding. Done improperly, this can result in loss of type I error control. For window-based methods, controlling the false discovery rate over all detected windows does not guarantee control across all detected regions. Misinterpreting the former as the latter can result in unexpected liberalness. Here, several solutions are presented to maintain error control for these de novo counting strategies. For peak-based methods, peak calling should be performed on pooled libraries prior to the statistical analysis. For window-based methods, a hybrid approach using Simes’ method is proposed to maintain control of the false discovery rate across regions. More generally, the relative advantages of peak- and window-based strategies are explored using a range of simulated and real data sets. Implementations of both strategies also compare favourably to existing programs for differential binding analyses. Oxford University Press 2014-07-01 2014-05-22 /pmc/articles/PMC4066778/ /pubmed/24852250 http://dx.doi.org/10.1093/nar/gku351 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Lun, Aaron T.L.
Smyth, Gordon K.
De novo detection of differentially bound regions for ChIP-seq data using peaks and windows: controlling error rates correctly
title De novo detection of differentially bound regions for ChIP-seq data using peaks and windows: controlling error rates correctly
title_full De novo detection of differentially bound regions for ChIP-seq data using peaks and windows: controlling error rates correctly
title_fullStr De novo detection of differentially bound regions for ChIP-seq data using peaks and windows: controlling error rates correctly
title_full_unstemmed De novo detection of differentially bound regions for ChIP-seq data using peaks and windows: controlling error rates correctly
title_short De novo detection of differentially bound regions for ChIP-seq data using peaks and windows: controlling error rates correctly
title_sort de novo detection of differentially bound regions for chip-seq data using peaks and windows: controlling error rates correctly
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4066778/
https://www.ncbi.nlm.nih.gov/pubmed/24852250
http://dx.doi.org/10.1093/nar/gku351
work_keys_str_mv AT lunaarontl denovodetectionofdifferentiallyboundregionsforchipseqdatausingpeaksandwindowscontrollingerrorratescorrectly
AT smythgordonk denovodetectionofdifferentiallyboundregionsforchipseqdatausingpeaksandwindowscontrollingerrorratescorrectly