Cargando…
A permutation-based non-parametric analysis of CRISPR screen data
BACKGROUND: Clustered regularly-interspaced short palindromic repeats (CRISPR) screens are usually implemented in cultured cells to identify genes with critical functions. Although several methods have been developed or adapted to analyze CRISPR screening data, no single specific algorithm has gaine...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5518132/ https://www.ncbi.nlm.nih.gov/pubmed/28724352 http://dx.doi.org/10.1186/s12864-017-3938-5 |
_version_ | 1783251432194965504 |
---|---|
author | Jia, Gaoxiang Wang, Xinlei Xiao, Guanghua |
author_facet | Jia, Gaoxiang Wang, Xinlei Xiao, Guanghua |
author_sort | Jia, Gaoxiang |
collection | PubMed |
description | BACKGROUND: Clustered regularly-interspaced short palindromic repeats (CRISPR) screens are usually implemented in cultured cells to identify genes with critical functions. Although several methods have been developed or adapted to analyze CRISPR screening data, no single specific algorithm has gained popularity. Thus, rigorous procedures are needed to overcome the shortcomings of existing algorithms. METHODS: We developed a Permutation-Based Non-Parametric Analysis (PBNPA) algorithm, which computes p-values at the gene level by permuting sgRNA labels, and thus it avoids restrictive distributional assumptions. Although PBNPA is designed to analyze CRISPR data, it can also be applied to analyze genetic screens implemented with siRNAs or shRNAs and drug screens. RESULTS: We compared the performance of PBNPA with competing methods on simulated data as well as on real data. PBNPA outperformed recent methods designed for CRISPR screen analysis, as well as methods used for analyzing other functional genomics screens, in terms of Receiver Operating Characteristics (ROC) curves and False Discovery Rate (FDR) control for simulated data under various settings. Remarkably, the PBNPA algorithm showed better consistency and FDR control on published real data as well. CONCLUSIONS: PBNPA yields more consistent and reliable results than its competitors, especially when the data quality is low. R package of PBNPA is available at: https://cran.r-project.org/web/packages/PBNPA/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3938-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5518132 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-55181322017-08-16 A permutation-based non-parametric analysis of CRISPR screen data Jia, Gaoxiang Wang, Xinlei Xiao, Guanghua BMC Genomics Methodology Article BACKGROUND: Clustered regularly-interspaced short palindromic repeats (CRISPR) screens are usually implemented in cultured cells to identify genes with critical functions. Although several methods have been developed or adapted to analyze CRISPR screening data, no single specific algorithm has gained popularity. Thus, rigorous procedures are needed to overcome the shortcomings of existing algorithms. METHODS: We developed a Permutation-Based Non-Parametric Analysis (PBNPA) algorithm, which computes p-values at the gene level by permuting sgRNA labels, and thus it avoids restrictive distributional assumptions. Although PBNPA is designed to analyze CRISPR data, it can also be applied to analyze genetic screens implemented with siRNAs or shRNAs and drug screens. RESULTS: We compared the performance of PBNPA with competing methods on simulated data as well as on real data. PBNPA outperformed recent methods designed for CRISPR screen analysis, as well as methods used for analyzing other functional genomics screens, in terms of Receiver Operating Characteristics (ROC) curves and False Discovery Rate (FDR) control for simulated data under various settings. Remarkably, the PBNPA algorithm showed better consistency and FDR control on published real data as well. CONCLUSIONS: PBNPA yields more consistent and reliable results than its competitors, especially when the data quality is low. R package of PBNPA is available at: https://cran.r-project.org/web/packages/PBNPA/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3938-5) contains supplementary material, which is available to authorized users. BioMed Central 2017-07-19 /pmc/articles/PMC5518132/ /pubmed/28724352 http://dx.doi.org/10.1186/s12864-017-3938-5 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Jia, Gaoxiang Wang, Xinlei Xiao, Guanghua A permutation-based non-parametric analysis of CRISPR screen data |
title | A permutation-based non-parametric analysis of CRISPR screen data |
title_full | A permutation-based non-parametric analysis of CRISPR screen data |
title_fullStr | A permutation-based non-parametric analysis of CRISPR screen data |
title_full_unstemmed | A permutation-based non-parametric analysis of CRISPR screen data |
title_short | A permutation-based non-parametric analysis of CRISPR screen data |
title_sort | permutation-based non-parametric analysis of crispr screen data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5518132/ https://www.ncbi.nlm.nih.gov/pubmed/28724352 http://dx.doi.org/10.1186/s12864-017-3938-5 |
work_keys_str_mv | AT jiagaoxiang apermutationbasednonparametricanalysisofcrisprscreendata AT wangxinlei apermutationbasednonparametricanalysisofcrisprscreendata AT xiaoguanghua apermutationbasednonparametricanalysisofcrisprscreendata AT jiagaoxiang permutationbasednonparametricanalysisofcrisprscreendata AT wangxinlei permutationbasednonparametricanalysisofcrisprscreendata AT xiaoguanghua permutationbasednonparametricanalysisofcrisprscreendata |