Cargando…

peaksat: an R package for ChIP-seq peak saturation analysis

BACKGROUND: Epigenomic profiling assays such as ChIP-seq have been widely used to map the genome-wide enrichment profiles of chromatin-associated proteins and posttranslational histone modifications. Sequencing depth is a key parameter in experimental design and quality control. However, due to vari...

Descripción completa

Detalles Bibliográficos
Autores principales: Boyd, Joseph R, Gao, Cong, Quinn, Kathleen, Fritz, Andrew, Stein, Janet, Stein, Gary, Glass, Karen, Frietze, Seth
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9878872/
https://www.ncbi.nlm.nih.gov/pubmed/36698077
http://dx.doi.org/10.1186/s12864-023-09109-7
Descripción
Sumario:BACKGROUND: Epigenomic profiling assays such as ChIP-seq have been widely used to map the genome-wide enrichment profiles of chromatin-associated proteins and posttranslational histone modifications. Sequencing depth is a key parameter in experimental design and quality control. However, due to variable sequencing depth requirements across experimental conditions, it can be challenging to determine optimal sequencing depth, particularly for projects involving multiple targets or cell types. RESULTS: We developed the peaksat R package to provide target read depth estimates for epigenomic experiments based on the analysis of peak saturation curves. We applied peaksat to establish the distinctive read depth requirements for ChIP-seq studies of histone modifications in different cell lines. Using peaksat, we were able to estimate the target read depth required per library to obtain high-quality peak calls for downstream analysis. In addition, peaksat was applied to other sequence-enrichment methods including CUT&RUN and ATAC-seq. CONCLUSION: peaksat addresses a need for researchers to make informed decisions about whether their sequencing data has been generated to an adequate depth and subsequently sufficient meaningful peaks, and failing that, how many more reads would be required per library. peaksat is applicable to other sequence-based methods that include calling peaks in their analysis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09109-7.