Cargando…

Nonnegative spatial factorization applied to spatial genomics

Nonnegative matrix factorization (NMF) is widely used to analyze high-dimensional count data because, in contrast to real-valued alternatives such as factor analysis, it produces an interpretable parts-based representation. However, in applications such as spatial transcriptomics, NMF fails to incor...

Descripción completa

Detalles Bibliográficos
Autores principales: Townes, F. William, Engelhardt, Barbara E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9911348/
https://www.ncbi.nlm.nih.gov/pubmed/36587187
http://dx.doi.org/10.1038/s41592-022-01687-w
_version_ 1784884974363607040
author Townes, F. William
Engelhardt, Barbara E.
author_facet Townes, F. William
Engelhardt, Barbara E.
author_sort Townes, F. William
collection PubMed
description Nonnegative matrix factorization (NMF) is widely used to analyze high-dimensional count data because, in contrast to real-valued alternatives such as factor analysis, it produces an interpretable parts-based representation. However, in applications such as spatial transcriptomics, NMF fails to incorporate known structure between observations. Here, we present nonnegative spatial factorization (NSF), a spatially-aware probabilistic dimension reduction model based on transformed Gaussian processes that naturally encourages sparsity and scales to tens of thousands of observations. NSF recovers ground truth factors more accurately than real-valued alternatives such as MEFISTO in simulations, and has lower out-of-sample prediction error than probabilistic NMF on three spatial transcriptomics datasets from mouse brain and liver. Since not all patterns of gene expression have spatial correlations, we also propose a hybrid extension of NSF that combines spatial and nonspatial components, enabling quantification of spatial importance for both observations and features. A TensorFlow implementation of NSF is available from https://github.com/willtownes/nsf-paper.
format Online
Article
Text
id pubmed-9911348
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-99113482023-02-11 Nonnegative spatial factorization applied to spatial genomics Townes, F. William Engelhardt, Barbara E. Nat Methods Article Nonnegative matrix factorization (NMF) is widely used to analyze high-dimensional count data because, in contrast to real-valued alternatives such as factor analysis, it produces an interpretable parts-based representation. However, in applications such as spatial transcriptomics, NMF fails to incorporate known structure between observations. Here, we present nonnegative spatial factorization (NSF), a spatially-aware probabilistic dimension reduction model based on transformed Gaussian processes that naturally encourages sparsity and scales to tens of thousands of observations. NSF recovers ground truth factors more accurately than real-valued alternatives such as MEFISTO in simulations, and has lower out-of-sample prediction error than probabilistic NMF on three spatial transcriptomics datasets from mouse brain and liver. Since not all patterns of gene expression have spatial correlations, we also propose a hybrid extension of NSF that combines spatial and nonspatial components, enabling quantification of spatial importance for both observations and features. A TensorFlow implementation of NSF is available from https://github.com/willtownes/nsf-paper. Nature Publishing Group US 2022-12-31 2023 /pmc/articles/PMC9911348/ /pubmed/36587187 http://dx.doi.org/10.1038/s41592-022-01687-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Townes, F. William
Engelhardt, Barbara E.
Nonnegative spatial factorization applied to spatial genomics
title Nonnegative spatial factorization applied to spatial genomics
title_full Nonnegative spatial factorization applied to spatial genomics
title_fullStr Nonnegative spatial factorization applied to spatial genomics
title_full_unstemmed Nonnegative spatial factorization applied to spatial genomics
title_short Nonnegative spatial factorization applied to spatial genomics
title_sort nonnegative spatial factorization applied to spatial genomics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9911348/
https://www.ncbi.nlm.nih.gov/pubmed/36587187
http://dx.doi.org/10.1038/s41592-022-01687-w
work_keys_str_mv AT townesfwilliam nonnegativespatialfactorizationappliedtospatialgenomics
AT engelhardtbarbarae nonnegativespatialfactorizationappliedtospatialgenomics