Cargando…
Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring
Statistical analysis of single cell RNA-sequencing (scRNA-seq) data is hindered by high levels of technical noise and inflated zero counts. One promising approach for addressing these challenges is gene set testing, or pathway analysis, which can mitigate sparsity and noise, and improve interpretati...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7498348/ https://www.ncbi.nlm.nih.gov/pubmed/32633778 http://dx.doi.org/10.1093/nar/gkaa582 |
_version_ | 1783583490947678208 |
---|---|
author | Frost, Hildreth Robert |
author_facet | Frost, Hildreth Robert |
author_sort | Frost, Hildreth Robert |
collection | PubMed |
description | Statistical analysis of single cell RNA-sequencing (scRNA-seq) data is hindered by high levels of technical noise and inflated zero counts. One promising approach for addressing these challenges is gene set testing, or pathway analysis, which can mitigate sparsity and noise, and improve interpretation and power, by aggregating expression data to the pathway level. Unfortunately, methods optimized for bulk transcriptomics perform poorly on scRNA-seq data and progress on single cell-specific techniques has been limited. Importantly, no existing methods support cell-level gene set inference. To address this challenge, we developed a new gene set testing method, Variance-adjusted Mahalanobis (VAM), that integrates with the Seurat framework and can accommodate the technical noise, sparsity and large sample sizes characteristic of scRNA-seq data. The VAM method computes cell-specific pathway scores to transform a cell-by-gene matrix into a cell-by-pathway matrix that can be used for both data visualization and statistical enrichment analysis. Because the distribution of these scores under the null of uncorrelated technical noise has an accurate gamma approximation, both population and cell-level inference is supported. As demonstrated using simulated and real scRNA-seq data, the VAM method provides superior classification accuracy at a lower computation cost relative to existing single sample gene set testing approaches. |
format | Online Article Text |
id | pubmed-7498348 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-74983482020-09-23 Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring Frost, Hildreth Robert Nucleic Acids Res Methods Online Statistical analysis of single cell RNA-sequencing (scRNA-seq) data is hindered by high levels of technical noise and inflated zero counts. One promising approach for addressing these challenges is gene set testing, or pathway analysis, which can mitigate sparsity and noise, and improve interpretation and power, by aggregating expression data to the pathway level. Unfortunately, methods optimized for bulk transcriptomics perform poorly on scRNA-seq data and progress on single cell-specific techniques has been limited. Importantly, no existing methods support cell-level gene set inference. To address this challenge, we developed a new gene set testing method, Variance-adjusted Mahalanobis (VAM), that integrates with the Seurat framework and can accommodate the technical noise, sparsity and large sample sizes characteristic of scRNA-seq data. The VAM method computes cell-specific pathway scores to transform a cell-by-gene matrix into a cell-by-pathway matrix that can be used for both data visualization and statistical enrichment analysis. Because the distribution of these scores under the null of uncorrelated technical noise has an accurate gamma approximation, both population and cell-level inference is supported. As demonstrated using simulated and real scRNA-seq data, the VAM method provides superior classification accuracy at a lower computation cost relative to existing single sample gene set testing approaches. Oxford University Press 2020-07-07 /pmc/articles/PMC7498348/ /pubmed/32633778 http://dx.doi.org/10.1093/nar/gkaa582 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Frost, Hildreth Robert Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring |
title | Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring |
title_full | Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring |
title_fullStr | Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring |
title_full_unstemmed | Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring |
title_short | Variance-adjusted Mahalanobis (VAM): a fast and accurate method for cell-specific gene set scoring |
title_sort | variance-adjusted mahalanobis (vam): a fast and accurate method for cell-specific gene set scoring |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7498348/ https://www.ncbi.nlm.nih.gov/pubmed/32633778 http://dx.doi.org/10.1093/nar/gkaa582 |
work_keys_str_mv | AT frosthildrethrobert varianceadjustedmahalanobisvamafastandaccuratemethodforcellspecificgenesetscoring |