Cargando…

Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues

Autoencoders have been used to model single-cell mRNA-sequencing data with the purpose of denoising, visualization, data simulation, and dimensionality reduction. We, and others, have shown that autoencoders can be explainable models and interpreted in terms of biology. Here, we show that such autoe...

Descripción completa

Detalles Bibliográficos
Autores principales: Walbech, Julie Sparholt, Kinalis, Savvas, Winther, Ole, Nielsen, Finn Cilius, Bagger, Frederik Otzen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8750521/
https://www.ncbi.nlm.nih.gov/pubmed/35011647
http://dx.doi.org/10.3390/cells11010085
_version_ 1784631479671717888
author Walbech, Julie Sparholt
Kinalis, Savvas
Winther, Ole
Nielsen, Finn Cilius
Bagger, Frederik Otzen
author_facet Walbech, Julie Sparholt
Kinalis, Savvas
Winther, Ole
Nielsen, Finn Cilius
Bagger, Frederik Otzen
author_sort Walbech, Julie Sparholt
collection PubMed
description Autoencoders have been used to model single-cell mRNA-sequencing data with the purpose of denoising, visualization, data simulation, and dimensionality reduction. We, and others, have shown that autoencoders can be explainable models and interpreted in terms of biology. Here, we show that such autoencoders can generalize to the extent that they can transfer directly without additional training. In practice, we can extract biological modules, denoise, and classify data correctly from an autoencoder that was trained on a different dataset and with different cells (a foreign model). We deconvoluted the biological signal encoded in the bottleneck layer of scRNA-models using saliency maps and mapped salient features to biological pathways. Biological concepts could be associated with specific nodes and interpreted in relation to biological pathways. Even in this unsupervised framework, with no prior information about cell types or labels, the specific biological pathways deduced from the model were in line with findings in previous research. It was hypothesized that autoencoders could learn and represent meaningful biology; here, we show with a systematic experiment that this is true and even transcends the training data. This means that carefully trained autoencoders can be used to assist the interpretation of new unseen data.
format Online
Article
Text
id pubmed-8750521
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87505212022-01-12 Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues Walbech, Julie Sparholt Kinalis, Savvas Winther, Ole Nielsen, Finn Cilius Bagger, Frederik Otzen Cells Article Autoencoders have been used to model single-cell mRNA-sequencing data with the purpose of denoising, visualization, data simulation, and dimensionality reduction. We, and others, have shown that autoencoders can be explainable models and interpreted in terms of biology. Here, we show that such autoencoders can generalize to the extent that they can transfer directly without additional training. In practice, we can extract biological modules, denoise, and classify data correctly from an autoencoder that was trained on a different dataset and with different cells (a foreign model). We deconvoluted the biological signal encoded in the bottleneck layer of scRNA-models using saliency maps and mapped salient features to biological pathways. Biological concepts could be associated with specific nodes and interpreted in relation to biological pathways. Even in this unsupervised framework, with no prior information about cell types or labels, the specific biological pathways deduced from the model were in line with findings in previous research. It was hypothesized that autoencoders could learn and represent meaningful biology; here, we show with a systematic experiment that this is true and even transcends the training data. This means that carefully trained autoencoders can be used to assist the interpretation of new unseen data. MDPI 2021-12-28 /pmc/articles/PMC8750521/ /pubmed/35011647 http://dx.doi.org/10.3390/cells11010085 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Walbech, Julie Sparholt
Kinalis, Savvas
Winther, Ole
Nielsen, Finn Cilius
Bagger, Frederik Otzen
Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues
title Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues
title_full Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues
title_fullStr Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues
title_full_unstemmed Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues
title_short Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues
title_sort interpretable autoencoders trained on single cell sequencing data can transfer directly to data from unseen tissues
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8750521/
https://www.ncbi.nlm.nih.gov/pubmed/35011647
http://dx.doi.org/10.3390/cells11010085
work_keys_str_mv AT walbechjuliesparholt interpretableautoencoderstrainedonsinglecellsequencingdatacantransferdirectlytodatafromunseentissues
AT kinalissavvas interpretableautoencoderstrainedonsinglecellsequencingdatacantransferdirectlytodatafromunseentissues
AT wintherole interpretableautoencoderstrainedonsinglecellsequencingdatacantransferdirectlytodatafromunseentissues
AT nielsenfinncilius interpretableautoencoderstrainedonsinglecellsequencingdatacantransferdirectlytodatafromunseentissues
AT baggerfrederikotzen interpretableautoencoderstrainedonsinglecellsequencingdatacantransferdirectlytodatafromunseentissues