Cargando…

Accurate integration of multiple heterogeneous single-cell RNA-seq data sets by learning contrastive biological variation

For most biological and medical applications of single-cell transcriptomics, an integrative study of multiple heterogeneous single-cell RNA sequencing (scRNA-seq) data sets is crucial. However, present approaches are unable to integrate diverse data sets from various biological conditions effectivel...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Yang, Sheng, Qiongyu, Qi, Jing, Hua, Jiao, Yang, Bo, Wan, Lei, Jin, Shuilin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10317120/
https://www.ncbi.nlm.nih.gov/pubmed/37308294
http://dx.doi.org/10.1101/gr.277522.122
_version_ 1785067838146347008
author Zhou, Yang
Sheng, Qiongyu
Qi, Jing
Hua, Jiao
Yang, Bo
Wan, Lei
Jin, Shuilin
author_facet Zhou, Yang
Sheng, Qiongyu
Qi, Jing
Hua, Jiao
Yang, Bo
Wan, Lei
Jin, Shuilin
author_sort Zhou, Yang
collection PubMed
description For most biological and medical applications of single-cell transcriptomics, an integrative study of multiple heterogeneous single-cell RNA sequencing (scRNA-seq) data sets is crucial. However, present approaches are unable to integrate diverse data sets from various biological conditions effectively because of the confounding effects of biological and technical differences. We introduce single-cell integration (scInt), an integration method based on accurate, robust cell–cell similarity construction and unified contrastive biological variation learning from multiple scRNA-seq data sets. scInt provides a flexible and effective approach to transfer knowledge from the already integrated reference to the query. We show that scInt outperforms 10 other cutting-edge approaches using both simulated and real data sets, particularly in the case of complex experimental designs. Application of scInt to mouse developing tracheal epithelial data shows its ability to integrate development trajectories from different developmental stages. Furthermore, scInt successfully identifies functionally distinct condition-specific cell subpopulations in single-cell heterogeneous samples from a variety of biological conditions.
format Online
Article
Text
id pubmed-10317120
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-103171202023-11-01 Accurate integration of multiple heterogeneous single-cell RNA-seq data sets by learning contrastive biological variation Zhou, Yang Sheng, Qiongyu Qi, Jing Hua, Jiao Yang, Bo Wan, Lei Jin, Shuilin Genome Res Methods For most biological and medical applications of single-cell transcriptomics, an integrative study of multiple heterogeneous single-cell RNA sequencing (scRNA-seq) data sets is crucial. However, present approaches are unable to integrate diverse data sets from various biological conditions effectively because of the confounding effects of biological and technical differences. We introduce single-cell integration (scInt), an integration method based on accurate, robust cell–cell similarity construction and unified contrastive biological variation learning from multiple scRNA-seq data sets. scInt provides a flexible and effective approach to transfer knowledge from the already integrated reference to the query. We show that scInt outperforms 10 other cutting-edge approaches using both simulated and real data sets, particularly in the case of complex experimental designs. Application of scInt to mouse developing tracheal epithelial data shows its ability to integrate development trajectories from different developmental stages. Furthermore, scInt successfully identifies functionally distinct condition-specific cell subpopulations in single-cell heterogeneous samples from a variety of biological conditions. Cold Spring Harbor Laboratory Press 2023-05 /pmc/articles/PMC10317120/ /pubmed/37308294 http://dx.doi.org/10.1101/gr.277522.122 Text en © 2023 Zhou et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Methods
Zhou, Yang
Sheng, Qiongyu
Qi, Jing
Hua, Jiao
Yang, Bo
Wan, Lei
Jin, Shuilin
Accurate integration of multiple heterogeneous single-cell RNA-seq data sets by learning contrastive biological variation
title Accurate integration of multiple heterogeneous single-cell RNA-seq data sets by learning contrastive biological variation
title_full Accurate integration of multiple heterogeneous single-cell RNA-seq data sets by learning contrastive biological variation
title_fullStr Accurate integration of multiple heterogeneous single-cell RNA-seq data sets by learning contrastive biological variation
title_full_unstemmed Accurate integration of multiple heterogeneous single-cell RNA-seq data sets by learning contrastive biological variation
title_short Accurate integration of multiple heterogeneous single-cell RNA-seq data sets by learning contrastive biological variation
title_sort accurate integration of multiple heterogeneous single-cell rna-seq data sets by learning contrastive biological variation
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10317120/
https://www.ncbi.nlm.nih.gov/pubmed/37308294
http://dx.doi.org/10.1101/gr.277522.122
work_keys_str_mv AT zhouyang accurateintegrationofmultipleheterogeneoussinglecellrnaseqdatasetsbylearningcontrastivebiologicalvariation
AT shengqiongyu accurateintegrationofmultipleheterogeneoussinglecellrnaseqdatasetsbylearningcontrastivebiologicalvariation
AT qijing accurateintegrationofmultipleheterogeneoussinglecellrnaseqdatasetsbylearningcontrastivebiologicalvariation
AT huajiao accurateintegrationofmultipleheterogeneoussinglecellrnaseqdatasetsbylearningcontrastivebiologicalvariation
AT yangbo accurateintegrationofmultipleheterogeneoussinglecellrnaseqdatasetsbylearningcontrastivebiologicalvariation
AT wanlei accurateintegrationofmultipleheterogeneoussinglecellrnaseqdatasetsbylearningcontrastivebiologicalvariation
AT jinshuilin accurateintegrationofmultipleheterogeneoussinglecellrnaseqdatasetsbylearningcontrastivebiologicalvariation