Cargando…

f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq

Single-cell RNA-sequencing (scRNA-seq) allows studying heterogeneity in gene expression in large cell populations. Such heterogeneity can arise due to technical or biological factors, making decomposing sources of variation difficult. We here describe f-scLVM (factorial single-cell latent variable m...

Descripción completa

Detalles Bibliográficos
Autores principales: Buettner, Florian, Pratanwanich, Naruemon, McCarthy, Davis J., Marioni, John C., Stegle, Oliver
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5674756/
https://www.ncbi.nlm.nih.gov/pubmed/29115968
http://dx.doi.org/10.1186/s13059-017-1334-8
_version_ 1783276843229511680
author Buettner, Florian
Pratanwanich, Naruemon
McCarthy, Davis J.
Marioni, John C.
Stegle, Oliver
author_facet Buettner, Florian
Pratanwanich, Naruemon
McCarthy, Davis J.
Marioni, John C.
Stegle, Oliver
author_sort Buettner, Florian
collection PubMed
description Single-cell RNA-sequencing (scRNA-seq) allows studying heterogeneity in gene expression in large cell populations. Such heterogeneity can arise due to technical or biological factors, making decomposing sources of variation difficult. We here describe f-scLVM (factorial single-cell latent variable model), a method based on factor analysis that uses pathway annotations to guide the inference of interpretable factors underpinning the heterogeneity. Our model jointly estimates the relevance of individual factors, refines gene set annotations, and infers factors without annotation. In applications to multiple scRNA-seq datasets, we find that f-scLVM robustly decomposes scRNA-seq datasets into interpretable components, thereby facilitating the identification of novel subpopulations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1334-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5674756
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-56747562017-11-15 f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq Buettner, Florian Pratanwanich, Naruemon McCarthy, Davis J. Marioni, John C. Stegle, Oliver Genome Biol Method Single-cell RNA-sequencing (scRNA-seq) allows studying heterogeneity in gene expression in large cell populations. Such heterogeneity can arise due to technical or biological factors, making decomposing sources of variation difficult. We here describe f-scLVM (factorial single-cell latent variable model), a method based on factor analysis that uses pathway annotations to guide the inference of interpretable factors underpinning the heterogeneity. Our model jointly estimates the relevance of individual factors, refines gene set annotations, and infers factors without annotation. In applications to multiple scRNA-seq datasets, we find that f-scLVM robustly decomposes scRNA-seq datasets into interpretable components, thereby facilitating the identification of novel subpopulations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1334-8) contains supplementary material, which is available to authorized users. BioMed Central 2017-11-07 /pmc/articles/PMC5674756/ /pubmed/29115968 http://dx.doi.org/10.1186/s13059-017-1334-8 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Method
Buettner, Florian
Pratanwanich, Naruemon
McCarthy, Davis J.
Marioni, John C.
Stegle, Oliver
f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq
title f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq
title_full f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq
title_fullStr f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq
title_full_unstemmed f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq
title_short f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq
title_sort f-sclvm: scalable and versatile factor analysis for single-cell rna-seq
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5674756/
https://www.ncbi.nlm.nih.gov/pubmed/29115968
http://dx.doi.org/10.1186/s13059-017-1334-8
work_keys_str_mv AT buettnerflorian fsclvmscalableandversatilefactoranalysisforsinglecellrnaseq
AT pratanwanichnaruemon fsclvmscalableandversatilefactoranalysisforsinglecellrnaseq
AT mccarthydavisj fsclvmscalableandversatilefactoranalysisforsinglecellrnaseq
AT marionijohnc fsclvmscalableandversatilefactoranalysisforsinglecellrnaseq
AT stegleoliver fsclvmscalableandversatilefactoranalysisforsinglecellrnaseq