Cargando…

The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data

MOTIVATION: Learning low-dimensional representations of single-cell transcriptomics has become instrumental to its downstream analysis. The state of the art is currently represented by neural network models, such as variational autoencoders, which use a variational approximation of the likelihood fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Schuster, Viktoria, Krogh, Anders
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10483129/
https://www.ncbi.nlm.nih.gov/pubmed/37572301
http://dx.doi.org/10.1093/bioinformatics/btad497
_version_ 1785102312772993024
author Schuster, Viktoria
Krogh, Anders
author_facet Schuster, Viktoria
Krogh, Anders
author_sort Schuster, Viktoria
collection PubMed
description MOTIVATION: Learning low-dimensional representations of single-cell transcriptomics has become instrumental to its downstream analysis. The state of the art is currently represented by neural network models, such as variational autoencoders, which use a variational approximation of the likelihood for inference. RESULTS: We here present the Deep Generative Decoder (DGD), a simple generative model that computes model parameters and representations directly via maximum a posteriori estimation. The DGD handles complex parameterized latent distributions naturally unlike variational autoencoders, which typically use a fixed Gaussian distribution, because of the complexity of adding other types. We first show its general functionality on a commonly used benchmark set, Fashion-MNIST. Secondly, we apply the model to multiple single-cell datasets. Here, the DGD learns low-dimensional, meaningful, and well-structured latent representations with sub-clustering beyond the provided labels. The advantages of this approach are its simplicity and its capability to provide representations of much smaller dimensionality than a comparable variational autoencoder. AVAILABILITY AND IMPLEMENTATION: scDGD is available as a python package at https://github.com/Center-for-Health-Data-Science/scDGD. The remaining code is made available here: https://github.com/Center-for-Health-Data-Science/dgd.
format Online
Article
Text
id pubmed-10483129
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-104831292023-09-08 The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data Schuster, Viktoria Krogh, Anders Bioinformatics Original Paper MOTIVATION: Learning low-dimensional representations of single-cell transcriptomics has become instrumental to its downstream analysis. The state of the art is currently represented by neural network models, such as variational autoencoders, which use a variational approximation of the likelihood for inference. RESULTS: We here present the Deep Generative Decoder (DGD), a simple generative model that computes model parameters and representations directly via maximum a posteriori estimation. The DGD handles complex parameterized latent distributions naturally unlike variational autoencoders, which typically use a fixed Gaussian distribution, because of the complexity of adding other types. We first show its general functionality on a commonly used benchmark set, Fashion-MNIST. Secondly, we apply the model to multiple single-cell datasets. Here, the DGD learns low-dimensional, meaningful, and well-structured latent representations with sub-clustering beyond the provided labels. The advantages of this approach are its simplicity and its capability to provide representations of much smaller dimensionality than a comparable variational autoencoder. AVAILABILITY AND IMPLEMENTATION: scDGD is available as a python package at https://github.com/Center-for-Health-Data-Science/scDGD. The remaining code is made available here: https://github.com/Center-for-Health-Data-Science/dgd. Oxford University Press 2023-08-12 /pmc/articles/PMC10483129/ /pubmed/37572301 http://dx.doi.org/10.1093/bioinformatics/btad497 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Schuster, Viktoria
Krogh, Anders
The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data
title The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data
title_full The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data
title_fullStr The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data
title_full_unstemmed The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data
title_short The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data
title_sort deep generative decoder: map estimation of representations improves modelling of single-cell rna data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10483129/
https://www.ncbi.nlm.nih.gov/pubmed/37572301
http://dx.doi.org/10.1093/bioinformatics/btad497
work_keys_str_mv AT schusterviktoria thedeepgenerativedecodermapestimationofrepresentationsimprovesmodellingofsinglecellrnadata
AT kroghanders thedeepgenerativedecodermapestimationofrepresentationsimprovesmodellingofsinglecellrnadata
AT schusterviktoria deepgenerativedecodermapestimationofrepresentationsimprovesmodellingofsinglecellrnadata
AT kroghanders deepgenerativedecodermapestimationofrepresentationsimprovesmodellingofsinglecellrnadata