Cargando…

Biologically informed variational autoencoders allow predictive modeling of genetic and drug-induced perturbations

MOTIVATION: Variational autoencoders (VAEs) have rapidly increased in popularity in biological applications and have already successfully been used on many omic datasets. Their latent space provides a low-dimensional representation of input data, and VAEs have been applied, e.g. for clustering of si...

Descripción completa

Detalles Bibliográficos
Autores principales: Doncevic, Daria, Herrmann, Carl
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10301695/
https://www.ncbi.nlm.nih.gov/pubmed/37326971
http://dx.doi.org/10.1093/bioinformatics/btad387
_version_ 1785064873382641664
author Doncevic, Daria
Herrmann, Carl
author_facet Doncevic, Daria
Herrmann, Carl
author_sort Doncevic, Daria
collection PubMed
description MOTIVATION: Variational autoencoders (VAEs) have rapidly increased in popularity in biological applications and have already successfully been used on many omic datasets. Their latent space provides a low-dimensional representation of input data, and VAEs have been applied, e.g. for clustering of single-cell transcriptomic data. However, due to their non-linear nature, the patterns that VAEs learn in the latent space remain obscure. Hence, the lower-dimensional data embedding cannot directly be related to input features. RESULTS: To shed light on the inner workings of VAE and enable direct interpretability of the model through its structure, we designed a novel VAE, OntoVAE (Ontology guided VAE) that can incorporate any ontology in its latent space and decoder part and, thus, provide pathway or phenotype activities for the ontology terms. In this work, we demonstrate that OntoVAE can be applied in the context of predictive modeling and show its ability to predict the effects of genetic or drug-induced perturbations using different ontologies and both, bulk and single-cell transcriptomic datasets. Finally, we provide a flexible framework, which can be easily adapted to any ontology and dataset. AVAILABILITY AND IMPLEMENTATION: OntoVAE is available as a python package under https://github.com/hdsu-bioquant/onto-vae.
format Online
Article
Text
id pubmed-10301695
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103016952023-06-29 Biologically informed variational autoencoders allow predictive modeling of genetic and drug-induced perturbations Doncevic, Daria Herrmann, Carl Bioinformatics Original Paper MOTIVATION: Variational autoencoders (VAEs) have rapidly increased in popularity in biological applications and have already successfully been used on many omic datasets. Their latent space provides a low-dimensional representation of input data, and VAEs have been applied, e.g. for clustering of single-cell transcriptomic data. However, due to their non-linear nature, the patterns that VAEs learn in the latent space remain obscure. Hence, the lower-dimensional data embedding cannot directly be related to input features. RESULTS: To shed light on the inner workings of VAE and enable direct interpretability of the model through its structure, we designed a novel VAE, OntoVAE (Ontology guided VAE) that can incorporate any ontology in its latent space and decoder part and, thus, provide pathway or phenotype activities for the ontology terms. In this work, we demonstrate that OntoVAE can be applied in the context of predictive modeling and show its ability to predict the effects of genetic or drug-induced perturbations using different ontologies and both, bulk and single-cell transcriptomic datasets. Finally, we provide a flexible framework, which can be easily adapted to any ontology and dataset. AVAILABILITY AND IMPLEMENTATION: OntoVAE is available as a python package under https://github.com/hdsu-bioquant/onto-vae. Oxford University Press 2023-06-16 /pmc/articles/PMC10301695/ /pubmed/37326971 http://dx.doi.org/10.1093/bioinformatics/btad387 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Doncevic, Daria
Herrmann, Carl
Biologically informed variational autoencoders allow predictive modeling of genetic and drug-induced perturbations
title Biologically informed variational autoencoders allow predictive modeling of genetic and drug-induced perturbations
title_full Biologically informed variational autoencoders allow predictive modeling of genetic and drug-induced perturbations
title_fullStr Biologically informed variational autoencoders allow predictive modeling of genetic and drug-induced perturbations
title_full_unstemmed Biologically informed variational autoencoders allow predictive modeling of genetic and drug-induced perturbations
title_short Biologically informed variational autoencoders allow predictive modeling of genetic and drug-induced perturbations
title_sort biologically informed variational autoencoders allow predictive modeling of genetic and drug-induced perturbations
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10301695/
https://www.ncbi.nlm.nih.gov/pubmed/37326971
http://dx.doi.org/10.1093/bioinformatics/btad387
work_keys_str_mv AT doncevicdaria biologicallyinformedvariationalautoencodersallowpredictivemodelingofgeneticanddruginducedperturbations
AT herrmanncarl biologicallyinformedvariationalautoencodersallowpredictivemodelingofgeneticanddruginducedperturbations