Cargando…

A permutation-based non-parametric analysis of CRISPR screen data

BACKGROUND: Clustered regularly-interspaced short palindromic repeats (CRISPR) screens are usually implemented in cultured cells to identify genes with critical functions. Although several methods have been developed or adapted to analyze CRISPR screening data, no single specific algorithm has gaine...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Gaoxiang, Wang, Xinlei, Xiao, Guanghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5518132/
https://www.ncbi.nlm.nih.gov/pubmed/28724352
http://dx.doi.org/10.1186/s12864-017-3938-5
_version_ 1783251432194965504
author Jia, Gaoxiang
Wang, Xinlei
Xiao, Guanghua
author_facet Jia, Gaoxiang
Wang, Xinlei
Xiao, Guanghua
author_sort Jia, Gaoxiang
collection PubMed
description BACKGROUND: Clustered regularly-interspaced short palindromic repeats (CRISPR) screens are usually implemented in cultured cells to identify genes with critical functions. Although several methods have been developed or adapted to analyze CRISPR screening data, no single specific algorithm has gained popularity. Thus, rigorous procedures are needed to overcome the shortcomings of existing algorithms. METHODS: We developed a Permutation-Based Non-Parametric Analysis (PBNPA) algorithm, which computes p-values at the gene level by permuting sgRNA labels, and thus it avoids restrictive distributional assumptions. Although PBNPA is designed to analyze CRISPR data, it can also be applied to analyze genetic screens implemented with siRNAs or shRNAs and drug screens. RESULTS: We compared the performance of PBNPA with competing methods on simulated data as well as on real data. PBNPA outperformed recent methods designed for CRISPR screen analysis, as well as methods used for analyzing other functional genomics screens, in terms of Receiver Operating Characteristics (ROC) curves and False Discovery Rate (FDR) control for simulated data under various settings. Remarkably, the PBNPA algorithm showed better consistency and FDR control on published real data as well. CONCLUSIONS: PBNPA yields more consistent and reliable results than its competitors, especially when the data quality is low. R package of PBNPA is available at: https://cran.r-project.org/web/packages/PBNPA/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3938-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5518132
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55181322017-08-16 A permutation-based non-parametric analysis of CRISPR screen data Jia, Gaoxiang Wang, Xinlei Xiao, Guanghua BMC Genomics Methodology Article BACKGROUND: Clustered regularly-interspaced short palindromic repeats (CRISPR) screens are usually implemented in cultured cells to identify genes with critical functions. Although several methods have been developed or adapted to analyze CRISPR screening data, no single specific algorithm has gained popularity. Thus, rigorous procedures are needed to overcome the shortcomings of existing algorithms. METHODS: We developed a Permutation-Based Non-Parametric Analysis (PBNPA) algorithm, which computes p-values at the gene level by permuting sgRNA labels, and thus it avoids restrictive distributional assumptions. Although PBNPA is designed to analyze CRISPR data, it can also be applied to analyze genetic screens implemented with siRNAs or shRNAs and drug screens. RESULTS: We compared the performance of PBNPA with competing methods on simulated data as well as on real data. PBNPA outperformed recent methods designed for CRISPR screen analysis, as well as methods used for analyzing other functional genomics screens, in terms of Receiver Operating Characteristics (ROC) curves and False Discovery Rate (FDR) control for simulated data under various settings. Remarkably, the PBNPA algorithm showed better consistency and FDR control on published real data as well. CONCLUSIONS: PBNPA yields more consistent and reliable results than its competitors, especially when the data quality is low. R package of PBNPA is available at: https://cran.r-project.org/web/packages/PBNPA/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3938-5) contains supplementary material, which is available to authorized users. BioMed Central 2017-07-19 /pmc/articles/PMC5518132/ /pubmed/28724352 http://dx.doi.org/10.1186/s12864-017-3938-5 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Jia, Gaoxiang
Wang, Xinlei
Xiao, Guanghua
A permutation-based non-parametric analysis of CRISPR screen data
title A permutation-based non-parametric analysis of CRISPR screen data
title_full A permutation-based non-parametric analysis of CRISPR screen data
title_fullStr A permutation-based non-parametric analysis of CRISPR screen data
title_full_unstemmed A permutation-based non-parametric analysis of CRISPR screen data
title_short A permutation-based non-parametric analysis of CRISPR screen data
title_sort permutation-based non-parametric analysis of crispr screen data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5518132/
https://www.ncbi.nlm.nih.gov/pubmed/28724352
http://dx.doi.org/10.1186/s12864-017-3938-5
work_keys_str_mv AT jiagaoxiang apermutationbasednonparametricanalysisofcrisprscreendata
AT wangxinlei apermutationbasednonparametricanalysisofcrisprscreendata
AT xiaoguanghua apermutationbasednonparametricanalysisofcrisprscreendata
AT jiagaoxiang permutationbasednonparametricanalysisofcrisprscreendata
AT wangxinlei permutationbasednonparametricanalysisofcrisprscreendata
AT xiaoguanghua permutationbasednonparametricanalysisofcrisprscreendata