Cargando…
Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues
Autoencoders have been used to model single-cell mRNA-sequencing data with the purpose of denoising, visualization, data simulation, and dimensionality reduction. We, and others, have shown that autoencoders can be explainable models and interpreted in terms of biology. Here, we show that such autoe...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8750521/ https://www.ncbi.nlm.nih.gov/pubmed/35011647 http://dx.doi.org/10.3390/cells11010085 |
_version_ | 1784631479671717888 |
---|---|
author | Walbech, Julie Sparholt Kinalis, Savvas Winther, Ole Nielsen, Finn Cilius Bagger, Frederik Otzen |
author_facet | Walbech, Julie Sparholt Kinalis, Savvas Winther, Ole Nielsen, Finn Cilius Bagger, Frederik Otzen |
author_sort | Walbech, Julie Sparholt |
collection | PubMed |
description | Autoencoders have been used to model single-cell mRNA-sequencing data with the purpose of denoising, visualization, data simulation, and dimensionality reduction. We, and others, have shown that autoencoders can be explainable models and interpreted in terms of biology. Here, we show that such autoencoders can generalize to the extent that they can transfer directly without additional training. In practice, we can extract biological modules, denoise, and classify data correctly from an autoencoder that was trained on a different dataset and with different cells (a foreign model). We deconvoluted the biological signal encoded in the bottleneck layer of scRNA-models using saliency maps and mapped salient features to biological pathways. Biological concepts could be associated with specific nodes and interpreted in relation to biological pathways. Even in this unsupervised framework, with no prior information about cell types or labels, the specific biological pathways deduced from the model were in line with findings in previous research. It was hypothesized that autoencoders could learn and represent meaningful biology; here, we show with a systematic experiment that this is true and even transcends the training data. This means that carefully trained autoencoders can be used to assist the interpretation of new unseen data. |
format | Online Article Text |
id | pubmed-8750521 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-87505212022-01-12 Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues Walbech, Julie Sparholt Kinalis, Savvas Winther, Ole Nielsen, Finn Cilius Bagger, Frederik Otzen Cells Article Autoencoders have been used to model single-cell mRNA-sequencing data with the purpose of denoising, visualization, data simulation, and dimensionality reduction. We, and others, have shown that autoencoders can be explainable models and interpreted in terms of biology. Here, we show that such autoencoders can generalize to the extent that they can transfer directly without additional training. In practice, we can extract biological modules, denoise, and classify data correctly from an autoencoder that was trained on a different dataset and with different cells (a foreign model). We deconvoluted the biological signal encoded in the bottleneck layer of scRNA-models using saliency maps and mapped salient features to biological pathways. Biological concepts could be associated with specific nodes and interpreted in relation to biological pathways. Even in this unsupervised framework, with no prior information about cell types or labels, the specific biological pathways deduced from the model were in line with findings in previous research. It was hypothesized that autoencoders could learn and represent meaningful biology; here, we show with a systematic experiment that this is true and even transcends the training data. This means that carefully trained autoencoders can be used to assist the interpretation of new unseen data. MDPI 2021-12-28 /pmc/articles/PMC8750521/ /pubmed/35011647 http://dx.doi.org/10.3390/cells11010085 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Walbech, Julie Sparholt Kinalis, Savvas Winther, Ole Nielsen, Finn Cilius Bagger, Frederik Otzen Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues |
title | Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues |
title_full | Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues |
title_fullStr | Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues |
title_full_unstemmed | Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues |
title_short | Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues |
title_sort | interpretable autoencoders trained on single cell sequencing data can transfer directly to data from unseen tissues |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8750521/ https://www.ncbi.nlm.nih.gov/pubmed/35011647 http://dx.doi.org/10.3390/cells11010085 |
work_keys_str_mv | AT walbechjuliesparholt interpretableautoencoderstrainedonsinglecellsequencingdatacantransferdirectlytodatafromunseentissues AT kinalissavvas interpretableautoencoderstrainedonsinglecellsequencingdatacantransferdirectlytodatafromunseentissues AT wintherole interpretableautoencoderstrainedonsinglecellsequencingdatacantransferdirectlytodatafromunseentissues AT nielsenfinncilius interpretableautoencoderstrainedonsinglecellsequencingdatacantransferdirectlytodatafromunseentissues AT baggerfrederikotzen interpretableautoencoderstrainedonsinglecellsequencingdatacantransferdirectlytodatafromunseentissues |