Cargando…

Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring

Statistical analysis of single cell RNA-sequencing (scRNA-seq) data is hindered by high levels of technical noise and inflated zero counts. One promising approach for addressing these challenges is gene set testing, or pathway analysis, which can mitigate sparsity and noise, and improve interpretati...

Descripción completa

Detalles Bibliográficos
Autor principal: Frost, Hildreth Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7498348/
https://www.ncbi.nlm.nih.gov/pubmed/32633778
http://dx.doi.org/10.1093/nar/gkaa582
_version_ 1783583490947678208
author Frost, Hildreth Robert
author_facet Frost, Hildreth Robert
author_sort Frost, Hildreth Robert
collection PubMed
description Statistical analysis of single cell RNA-sequencing (scRNA-seq) data is hindered by high levels of technical noise and inflated zero counts. One promising approach for addressing these challenges is gene set testing, or pathway analysis, which can mitigate sparsity and noise, and improve interpretation and power, by aggregating expression data to the pathway level. Unfortunately, methods optimized for bulk transcriptomics perform poorly on scRNA-seq data and progress on single cell-specific techniques has been limited. Importantly, no existing methods support cell-level gene set inference. To address this challenge, we developed a new gene set testing method, Variance-adjusted Mahalanobis (VAM), that integrates with the Seurat framework and can accommodate the technical noise, sparsity and large sample sizes characteristic of scRNA-seq data. The VAM method computes cell-specific pathway scores to transform a cell-by-gene matrix into a cell-by-pathway matrix that can be used for both data visualization and statistical enrichment analysis. Because the distribution of these scores under the null of uncorrelated technical noise has an accurate gamma approximation, both population and cell-level inference is supported. As demonstrated using simulated and real scRNA-seq data, the VAM method provides superior classification accuracy at a lower computation cost relative to existing single sample gene set testing approaches.
format Online
Article
Text
id pubmed-7498348
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-74983482020-09-23 Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring Frost, Hildreth Robert Nucleic Acids Res Methods Online Statistical analysis of single cell RNA-sequencing (scRNA-seq) data is hindered by high levels of technical noise and inflated zero counts. One promising approach for addressing these challenges is gene set testing, or pathway analysis, which can mitigate sparsity and noise, and improve interpretation and power, by aggregating expression data to the pathway level. Unfortunately, methods optimized for bulk transcriptomics perform poorly on scRNA-seq data and progress on single cell-specific techniques has been limited. Importantly, no existing methods support cell-level gene set inference. To address this challenge, we developed a new gene set testing method, Variance-adjusted Mahalanobis (VAM), that integrates with the Seurat framework and can accommodate the technical noise, sparsity and large sample sizes characteristic of scRNA-seq data. The VAM method computes cell-specific pathway scores to transform a cell-by-gene matrix into a cell-by-pathway matrix that can be used for both data visualization and statistical enrichment analysis. Because the distribution of these scores under the null of uncorrelated technical noise has an accurate gamma approximation, both population and cell-level inference is supported. As demonstrated using simulated and real scRNA-seq data, the VAM method provides superior classification accuracy at a lower computation cost relative to existing single sample gene set testing approaches. Oxford University Press 2020-07-07 /pmc/articles/PMC7498348/ /pubmed/32633778 http://dx.doi.org/10.1093/nar/gkaa582 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Frost, Hildreth Robert
Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring
title Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring
title_full Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring
title_fullStr Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring
title_full_unstemmed Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring
title_short Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring
title_sort variance-adjusted mahalanobis (vam): a fast and accurate method for cell-specific gene set scoring
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7498348/
https://www.ncbi.nlm.nih.gov/pubmed/32633778
http://dx.doi.org/10.1093/nar/gkaa582
work_keys_str_mv AT frosthildrethrobert varianceadjustedmahalanobisvamafastandaccuratemethodforcellspecificgenesetscoring