Cargando…

N-of-one differential gene expression without control samples using a deep generative model

Differential analysis of bulk RNA-seq data often suffers from lack of good controls. Here, we present a generative model that replaces controls, trained solely on healthy tissues. The unsupervised model learns a low-dimensional representation and can identify the closest normal representation for a...

Descripción completa

Detalles Bibliográficos
Autores principales: Prada-Luengo, Iñigo, Schuster, Viktoria, Liang, Yuhu, Terkelsen, Thilde, Sora, Valentina, Krogh, Anders
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10655485/
https://www.ncbi.nlm.nih.gov/pubmed/37974217
http://dx.doi.org/10.1186/s13059-023-03104-7
Descripción
Sumario:Differential analysis of bulk RNA-seq data often suffers from lack of good controls. Here, we present a generative model that replaces controls, trained solely on healthy tissues. The unsupervised model learns a low-dimensional representation and can identify the closest normal representation for a given disease sample. This enables control-free, single-sample differential expression analysis. In breast cancer, we demonstrate how our approach selects marker genes and outperforms a state-of-the-art method. Furthermore, significant genes identified by the model are enriched in driver genes across cancers. Our results show that the in silico closest normal provides a more favorable comparison than control samples. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03104-7.