Cargando…

A flexible ChIP-sequencing simulation toolkit

BACKGROUND: A major challenge in evaluating quantitative ChIP-seq analyses, such as peak calling and differential binding, is a lack of reliable ground truth data. Accurate simulation of ChIP-seq data can mitigate this challenge, but existing frameworks are either too cumbersome to apply genome-wide...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, An, Lamkin, Michael, Qiu, Yutong, Ren, Kevin, Goren, Alon, Gymrek, Melissa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8056602/
https://www.ncbi.nlm.nih.gov/pubmed/33879052
http://dx.doi.org/10.1186/s12859-021-04097-5
_version_ 1783680680665808896
author Zheng, An
Lamkin, Michael
Qiu, Yutong
Ren, Kevin
Goren, Alon
Gymrek, Melissa
author_facet Zheng, An
Lamkin, Michael
Qiu, Yutong
Ren, Kevin
Goren, Alon
Gymrek, Melissa
author_sort Zheng, An
collection PubMed
description BACKGROUND: A major challenge in evaluating quantitative ChIP-seq analyses, such as peak calling and differential binding, is a lack of reliable ground truth data. Accurate simulation of ChIP-seq data can mitigate this challenge, but existing frameworks are either too cumbersome to apply genome-wide or unable to model a number of important experimental conditions in ChIP-seq. RESULTS: We present ChIPs, a toolkit for rapidly simulating ChIP-seq data using statistical models of key experimental steps. We demonstrate how ChIPs can be used for a range of applications, including benchmarking analysis tools and evaluating the impact of various experimental parameters. ChIPs is implemented as a standalone command-line program written in C++ and is available from https://github.com/gymreklab/chips. CONCLUSIONS: ChIPs is an efficient ChIP-seq simulation framework that generates realistic datasets over a flexible range of experimental conditions. It can serve as an important component in various ChIP-seq analyses where ground truth data are needed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04097-5.
format Online
Article
Text
id pubmed-8056602
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-80566022021-04-20 A flexible ChIP-sequencing simulation toolkit Zheng, An Lamkin, Michael Qiu, Yutong Ren, Kevin Goren, Alon Gymrek, Melissa BMC Bioinformatics Software BACKGROUND: A major challenge in evaluating quantitative ChIP-seq analyses, such as peak calling and differential binding, is a lack of reliable ground truth data. Accurate simulation of ChIP-seq data can mitigate this challenge, but existing frameworks are either too cumbersome to apply genome-wide or unable to model a number of important experimental conditions in ChIP-seq. RESULTS: We present ChIPs, a toolkit for rapidly simulating ChIP-seq data using statistical models of key experimental steps. We demonstrate how ChIPs can be used for a range of applications, including benchmarking analysis tools and evaluating the impact of various experimental parameters. ChIPs is implemented as a standalone command-line program written in C++ and is available from https://github.com/gymreklab/chips. CONCLUSIONS: ChIPs is an efficient ChIP-seq simulation framework that generates realistic datasets over a flexible range of experimental conditions. It can serve as an important component in various ChIP-seq analyses where ground truth data are needed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04097-5. BioMed Central 2021-04-20 /pmc/articles/PMC8056602/ /pubmed/33879052 http://dx.doi.org/10.1186/s12859-021-04097-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Zheng, An
Lamkin, Michael
Qiu, Yutong
Ren, Kevin
Goren, Alon
Gymrek, Melissa
A flexible ChIP-sequencing simulation toolkit
title A flexible ChIP-sequencing simulation toolkit
title_full A flexible ChIP-sequencing simulation toolkit
title_fullStr A flexible ChIP-sequencing simulation toolkit
title_full_unstemmed A flexible ChIP-sequencing simulation toolkit
title_short A flexible ChIP-sequencing simulation toolkit
title_sort flexible chip-sequencing simulation toolkit
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8056602/
https://www.ncbi.nlm.nih.gov/pubmed/33879052
http://dx.doi.org/10.1186/s12859-021-04097-5
work_keys_str_mv AT zhengan aflexiblechipsequencingsimulationtoolkit
AT lamkinmichael aflexiblechipsequencingsimulationtoolkit
AT qiuyutong aflexiblechipsequencingsimulationtoolkit
AT renkevin aflexiblechipsequencingsimulationtoolkit
AT gorenalon aflexiblechipsequencingsimulationtoolkit
AT gymrekmelissa aflexiblechipsequencingsimulationtoolkit
AT zhengan flexiblechipsequencingsimulationtoolkit
AT lamkinmichael flexiblechipsequencingsimulationtoolkit
AT qiuyutong flexiblechipsequencingsimulationtoolkit
AT renkevin flexiblechipsequencingsimulationtoolkit
AT gorenalon flexiblechipsequencingsimulationtoolkit
AT gymrekmelissa flexiblechipsequencingsimulationtoolkit