Cargando…
Nonnegative spatial factorization applied to spatial genomics
Nonnegative matrix factorization (NMF) is widely used to analyze high-dimensional count data because, in contrast to real-valued alternatives such as factor analysis, it produces an interpretable parts-based representation. However, in applications such as spatial transcriptomics, NMF fails to incor...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group US
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9911348/ https://www.ncbi.nlm.nih.gov/pubmed/36587187 http://dx.doi.org/10.1038/s41592-022-01687-w |
_version_ | 1784884974363607040 |
---|---|
author | Townes, F. William Engelhardt, Barbara E. |
author_facet | Townes, F. William Engelhardt, Barbara E. |
author_sort | Townes, F. William |
collection | PubMed |
description | Nonnegative matrix factorization (NMF) is widely used to analyze high-dimensional count data because, in contrast to real-valued alternatives such as factor analysis, it produces an interpretable parts-based representation. However, in applications such as spatial transcriptomics, NMF fails to incorporate known structure between observations. Here, we present nonnegative spatial factorization (NSF), a spatially-aware probabilistic dimension reduction model based on transformed Gaussian processes that naturally encourages sparsity and scales to tens of thousands of observations. NSF recovers ground truth factors more accurately than real-valued alternatives such as MEFISTO in simulations, and has lower out-of-sample prediction error than probabilistic NMF on three spatial transcriptomics datasets from mouse brain and liver. Since not all patterns of gene expression have spatial correlations, we also propose a hybrid extension of NSF that combines spatial and nonspatial components, enabling quantification of spatial importance for both observations and features. A TensorFlow implementation of NSF is available from https://github.com/willtownes/nsf-paper. |
format | Online Article Text |
id | pubmed-9911348 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group US |
record_format | MEDLINE/PubMed |
spelling | pubmed-99113482023-02-11 Nonnegative spatial factorization applied to spatial genomics Townes, F. William Engelhardt, Barbara E. Nat Methods Article Nonnegative matrix factorization (NMF) is widely used to analyze high-dimensional count data because, in contrast to real-valued alternatives such as factor analysis, it produces an interpretable parts-based representation. However, in applications such as spatial transcriptomics, NMF fails to incorporate known structure between observations. Here, we present nonnegative spatial factorization (NSF), a spatially-aware probabilistic dimension reduction model based on transformed Gaussian processes that naturally encourages sparsity and scales to tens of thousands of observations. NSF recovers ground truth factors more accurately than real-valued alternatives such as MEFISTO in simulations, and has lower out-of-sample prediction error than probabilistic NMF on three spatial transcriptomics datasets from mouse brain and liver. Since not all patterns of gene expression have spatial correlations, we also propose a hybrid extension of NSF that combines spatial and nonspatial components, enabling quantification of spatial importance for both observations and features. A TensorFlow implementation of NSF is available from https://github.com/willtownes/nsf-paper. Nature Publishing Group US 2022-12-31 2023 /pmc/articles/PMC9911348/ /pubmed/36587187 http://dx.doi.org/10.1038/s41592-022-01687-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Townes, F. William Engelhardt, Barbara E. Nonnegative spatial factorization applied to spatial genomics |
title | Nonnegative spatial factorization applied to spatial genomics |
title_full | Nonnegative spatial factorization applied to spatial genomics |
title_fullStr | Nonnegative spatial factorization applied to spatial genomics |
title_full_unstemmed | Nonnegative spatial factorization applied to spatial genomics |
title_short | Nonnegative spatial factorization applied to spatial genomics |
title_sort | nonnegative spatial factorization applied to spatial genomics |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9911348/ https://www.ncbi.nlm.nih.gov/pubmed/36587187 http://dx.doi.org/10.1038/s41592-022-01687-w |
work_keys_str_mv | AT townesfwilliam nonnegativespatialfactorizationappliedtospatialgenomics AT engelhardtbarbarae nonnegativespatialfactorizationappliedtospatialgenomics |