Cargando…

IMGG: Integrating Multiple Single-Cell Datasets through Connected Graphs and Generative Adversarial Networks

There is a strong need to eliminate batch-specific differences when integrating single-cell RNA-sequencing (scRNA-seq) datasets generated under different experimental conditions for downstream task analysis. Existing batch correction methods usually transform different batches of cells into one pres...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xun, Zhang, Chaogang, Zhang, Ying, Meng, Xiangyu, Zhang, Zhiyuan, Shi, Xin, Song, Tao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8876681/
https://www.ncbi.nlm.nih.gov/pubmed/35216199
http://dx.doi.org/10.3390/ijms23042082
_version_ 1784658231129276416
author Wang, Xun
Zhang, Chaogang
Zhang, Ying
Meng, Xiangyu
Zhang, Zhiyuan
Shi, Xin
Song, Tao
author_facet Wang, Xun
Zhang, Chaogang
Zhang, Ying
Meng, Xiangyu
Zhang, Zhiyuan
Shi, Xin
Song, Tao
author_sort Wang, Xun
collection PubMed
description There is a strong need to eliminate batch-specific differences when integrating single-cell RNA-sequencing (scRNA-seq) datasets generated under different experimental conditions for downstream task analysis. Existing batch correction methods usually transform different batches of cells into one preselected “anchor” batch or a low-dimensional embedding space, and cannot take full advantage of useful information from multiple sources. We present a novel framework, called IMGG, i.e., integrating multiple single-cell datasets through connected graphs and generative adversarial networks (GAN) to eliminate nonbiological differences between different batches. Compared with current methods, IMGG shows excellent performance on a variety of evaluation metrics, and the IMGG-corrected gene expression data incorporate features from multiple batches, allowing for downstream tasks such as differential gene expression analysis.
format Online
Article
Text
id pubmed-8876681
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88766812022-02-26 IMGG: Integrating Multiple Single-Cell Datasets through Connected Graphs and Generative Adversarial Networks Wang, Xun Zhang, Chaogang Zhang, Ying Meng, Xiangyu Zhang, Zhiyuan Shi, Xin Song, Tao Int J Mol Sci Article There is a strong need to eliminate batch-specific differences when integrating single-cell RNA-sequencing (scRNA-seq) datasets generated under different experimental conditions for downstream task analysis. Existing batch correction methods usually transform different batches of cells into one preselected “anchor” batch or a low-dimensional embedding space, and cannot take full advantage of useful information from multiple sources. We present a novel framework, called IMGG, i.e., integrating multiple single-cell datasets through connected graphs and generative adversarial networks (GAN) to eliminate nonbiological differences between different batches. Compared with current methods, IMGG shows excellent performance on a variety of evaluation metrics, and the IMGG-corrected gene expression data incorporate features from multiple batches, allowing for downstream tasks such as differential gene expression analysis. MDPI 2022-02-14 /pmc/articles/PMC8876681/ /pubmed/35216199 http://dx.doi.org/10.3390/ijms23042082 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Xun
Zhang, Chaogang
Zhang, Ying
Meng, Xiangyu
Zhang, Zhiyuan
Shi, Xin
Song, Tao
IMGG: Integrating Multiple Single-Cell Datasets through Connected Graphs and Generative Adversarial Networks
title IMGG: Integrating Multiple Single-Cell Datasets through Connected Graphs and Generative Adversarial Networks
title_full IMGG: Integrating Multiple Single-Cell Datasets through Connected Graphs and Generative Adversarial Networks
title_fullStr IMGG: Integrating Multiple Single-Cell Datasets through Connected Graphs and Generative Adversarial Networks
title_full_unstemmed IMGG: Integrating Multiple Single-Cell Datasets through Connected Graphs and Generative Adversarial Networks
title_short IMGG: Integrating Multiple Single-Cell Datasets through Connected Graphs and Generative Adversarial Networks
title_sort imgg: integrating multiple single-cell datasets through connected graphs and generative adversarial networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8876681/
https://www.ncbi.nlm.nih.gov/pubmed/35216199
http://dx.doi.org/10.3390/ijms23042082
work_keys_str_mv AT wangxun imggintegratingmultiplesinglecelldatasetsthroughconnectedgraphsandgenerativeadversarialnetworks
AT zhangchaogang imggintegratingmultiplesinglecelldatasetsthroughconnectedgraphsandgenerativeadversarialnetworks
AT zhangying imggintegratingmultiplesinglecelldatasetsthroughconnectedgraphsandgenerativeadversarialnetworks
AT mengxiangyu imggintegratingmultiplesinglecelldatasetsthroughconnectedgraphsandgenerativeadversarialnetworks
AT zhangzhiyuan imggintegratingmultiplesinglecelldatasetsthroughconnectedgraphsandgenerativeadversarialnetworks
AT shixin imggintegratingmultiplesinglecelldatasetsthroughconnectedgraphsandgenerativeadversarialnetworks
AT songtao imggintegratingmultiplesinglecelldatasetsthroughconnectedgraphsandgenerativeadversarialnetworks