Cargando…
Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model
Single-cell RNA-Seq (scRNA-Seq) profiles gene expression of individual cells. Recent scRNA-Seq datasets have incorporated unique molecular identifiers (UMIs). Using negative controls, we show UMI counts follow multinomial sampling with no zero inflation. Current normalization procedures such as log...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927135/ https://www.ncbi.nlm.nih.gov/pubmed/31870412 http://dx.doi.org/10.1186/s13059-019-1861-6 |
_version_ | 1783482247534346240 |
---|---|
author | Townes, F. William Hicks, Stephanie C. Aryee, Martin J. Irizarry, Rafael A. |
author_facet | Townes, F. William Hicks, Stephanie C. Aryee, Martin J. Irizarry, Rafael A. |
author_sort | Townes, F. William |
collection | PubMed |
description | Single-cell RNA-Seq (scRNA-Seq) profiles gene expression of individual cells. Recent scRNA-Seq datasets have incorporated unique molecular identifiers (UMIs). Using negative controls, we show UMI counts follow multinomial sampling with no zero inflation. Current normalization procedures such as log of counts per million and feature selection by highly variable genes produce false variability in dimension reduction. We propose simple multinomial methods, including generalized principal component analysis (GLM-PCA) for non-normal distributions, and feature selection using deviance. These methods outperform the current practice in a downstream clustering assessment using ground truth datasets. |
format | Online Article Text |
id | pubmed-6927135 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69271352019-12-30 Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model Townes, F. William Hicks, Stephanie C. Aryee, Martin J. Irizarry, Rafael A. Genome Biol Method Single-cell RNA-Seq (scRNA-Seq) profiles gene expression of individual cells. Recent scRNA-Seq datasets have incorporated unique molecular identifiers (UMIs). Using negative controls, we show UMI counts follow multinomial sampling with no zero inflation. Current normalization procedures such as log of counts per million and feature selection by highly variable genes produce false variability in dimension reduction. We propose simple multinomial methods, including generalized principal component analysis (GLM-PCA) for non-normal distributions, and feature selection using deviance. These methods outperform the current practice in a downstream clustering assessment using ground truth datasets. BioMed Central 2019-12-23 /pmc/articles/PMC6927135/ /pubmed/31870412 http://dx.doi.org/10.1186/s13059-019-1861-6 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Method Townes, F. William Hicks, Stephanie C. Aryee, Martin J. Irizarry, Rafael A. Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model |
title | Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model |
title_full | Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model |
title_fullStr | Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model |
title_full_unstemmed | Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model |
title_short | Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model |
title_sort | feature selection and dimension reduction for single-cell rna-seq based on a multinomial model |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927135/ https://www.ncbi.nlm.nih.gov/pubmed/31870412 http://dx.doi.org/10.1186/s13059-019-1861-6 |
work_keys_str_mv | AT townesfwilliam featureselectionanddimensionreductionforsinglecellrnaseqbasedonamultinomialmodel AT hicksstephaniec featureselectionanddimensionreductionforsinglecellrnaseqbasedonamultinomialmodel AT aryeemartinj featureselectionanddimensionreductionforsinglecellrnaseqbasedonamultinomialmodel AT irizarryrafaela featureselectionanddimensionreductionforsinglecellrnaseqbasedonamultinomialmodel |