Cargando…
scAEGAN: Unification of single-cell genomics data by adversarial learning of latent space correspondences
Recent progress in Single-Cell Genomics has produced different library protocols and techniques for molecular profiling. We formulate a unifying, data-driven, integrative, and predictive methodology for different libraries, samples, and paired-unpaired data modalities. Our design of scAEGAN includes...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9897517/ https://www.ncbi.nlm.nih.gov/pubmed/36735690 http://dx.doi.org/10.1371/journal.pone.0281315 |
_version_ | 1784882265118998528 |
---|---|
author | Khan, Sumeer Ahmad Lehmann, Robert Martinez-de-Morentin, Xabier Maillo, Alberto Lagani, Vincenzo Kiani, Narsis A. Gomez-Cabrero, David Tegner, Jesper |
author_facet | Khan, Sumeer Ahmad Lehmann, Robert Martinez-de-Morentin, Xabier Maillo, Alberto Lagani, Vincenzo Kiani, Narsis A. Gomez-Cabrero, David Tegner, Jesper |
author_sort | Khan, Sumeer Ahmad |
collection | PubMed |
description | Recent progress in Single-Cell Genomics has produced different library protocols and techniques for molecular profiling. We formulate a unifying, data-driven, integrative, and predictive methodology for different libraries, samples, and paired-unpaired data modalities. Our design of scAEGAN includes an autoencoder (AE) network integrated with adversarial learning by a cycleGAN (cGAN) network. The AE learns a low-dimensional embedding of each condition, whereas the cGAN learns a non-linear mapping between the AE representations. We evaluate scAEGAN using simulated data and real scRNA-seq datasets, different library preparations (Fluidigm C1, CelSeq, CelSeq2, SmartSeq), and several data modalities as paired scRNA-seq and scATAC-seq. The scAEGAN outperforms Seurat3 in library integration, is more robust against data sparsity, and beats Seurat 4 in integrating paired data from the same cell. Furthermore, in predicting one data modality from another, scAEGAN outperforms Babel. We conclude that scAEGAN surpasses current state-of-the-art methods and unifies integration and prediction challenges. |
format | Online Article Text |
id | pubmed-9897517 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-98975172023-02-04 scAEGAN: Unification of single-cell genomics data by adversarial learning of latent space correspondences Khan, Sumeer Ahmad Lehmann, Robert Martinez-de-Morentin, Xabier Maillo, Alberto Lagani, Vincenzo Kiani, Narsis A. Gomez-Cabrero, David Tegner, Jesper PLoS One Research Article Recent progress in Single-Cell Genomics has produced different library protocols and techniques for molecular profiling. We formulate a unifying, data-driven, integrative, and predictive methodology for different libraries, samples, and paired-unpaired data modalities. Our design of scAEGAN includes an autoencoder (AE) network integrated with adversarial learning by a cycleGAN (cGAN) network. The AE learns a low-dimensional embedding of each condition, whereas the cGAN learns a non-linear mapping between the AE representations. We evaluate scAEGAN using simulated data and real scRNA-seq datasets, different library preparations (Fluidigm C1, CelSeq, CelSeq2, SmartSeq), and several data modalities as paired scRNA-seq and scATAC-seq. The scAEGAN outperforms Seurat3 in library integration, is more robust against data sparsity, and beats Seurat 4 in integrating paired data from the same cell. Furthermore, in predicting one data modality from another, scAEGAN outperforms Babel. We conclude that scAEGAN surpasses current state-of-the-art methods and unifies integration and prediction challenges. Public Library of Science 2023-02-03 /pmc/articles/PMC9897517/ /pubmed/36735690 http://dx.doi.org/10.1371/journal.pone.0281315 Text en © 2023 Khan et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Khan, Sumeer Ahmad Lehmann, Robert Martinez-de-Morentin, Xabier Maillo, Alberto Lagani, Vincenzo Kiani, Narsis A. Gomez-Cabrero, David Tegner, Jesper scAEGAN: Unification of single-cell genomics data by adversarial learning of latent space correspondences |
title | scAEGAN: Unification of single-cell genomics data by adversarial learning of latent space correspondences |
title_full | scAEGAN: Unification of single-cell genomics data by adversarial learning of latent space correspondences |
title_fullStr | scAEGAN: Unification of single-cell genomics data by adversarial learning of latent space correspondences |
title_full_unstemmed | scAEGAN: Unification of single-cell genomics data by adversarial learning of latent space correspondences |
title_short | scAEGAN: Unification of single-cell genomics data by adversarial learning of latent space correspondences |
title_sort | scaegan: unification of single-cell genomics data by adversarial learning of latent space correspondences |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9897517/ https://www.ncbi.nlm.nih.gov/pubmed/36735690 http://dx.doi.org/10.1371/journal.pone.0281315 |
work_keys_str_mv | AT khansumeerahmad scaeganunificationofsinglecellgenomicsdatabyadversariallearningoflatentspacecorrespondences AT lehmannrobert scaeganunificationofsinglecellgenomicsdatabyadversariallearningoflatentspacecorrespondences AT martinezdemorentinxabier scaeganunificationofsinglecellgenomicsdatabyadversariallearningoflatentspacecorrespondences AT mailloalberto scaeganunificationofsinglecellgenomicsdatabyadversariallearningoflatentspacecorrespondences AT laganivincenzo scaeganunificationofsinglecellgenomicsdatabyadversariallearningoflatentspacecorrespondences AT kianinarsisa scaeganunificationofsinglecellgenomicsdatabyadversariallearningoflatentspacecorrespondences AT gomezcabrerodavid scaeganunificationofsinglecellgenomicsdatabyadversariallearningoflatentspacecorrespondences AT tegnerjesper scaeganunificationofsinglecellgenomicsdatabyadversariallearningoflatentspacecorrespondences |