Cargando…

Atlas-scale single-cell multi-sample multi-condition data integration using scMerge2

The recent emergence of multi-sample multi-condition single-cell multi-cohort studies allows researchers to investigate different cell states. The effective integration of multiple large-cohort studies promises biological insights into cells under different conditions that individual studies cannot...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Yingxin, Cao, Yue, Willie, Elijah, Patrick, Ellis, Yang, Jean Y. H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10352351/
https://www.ncbi.nlm.nih.gov/pubmed/37460600
http://dx.doi.org/10.1038/s41467-023-39923-2
_version_ 1785074495613042688
author Lin, Yingxin
Cao, Yue
Willie, Elijah
Patrick, Ellis
Yang, Jean Y. H.
author_facet Lin, Yingxin
Cao, Yue
Willie, Elijah
Patrick, Ellis
Yang, Jean Y. H.
author_sort Lin, Yingxin
collection PubMed
description The recent emergence of multi-sample multi-condition single-cell multi-cohort studies allows researchers to investigate different cell states. The effective integration of multiple large-cohort studies promises biological insights into cells under different conditions that individual studies cannot provide. Here, we present scMerge2, a scalable algorithm that allows data integration of atlas-scale multi-sample multi-condition single-cell studies. We have generalized scMerge2 to enable the merging of millions of cells from single-cell studies generated by various single-cell technologies. Using a large COVID-19 data collection with over five million cells from 1000+ individuals, we demonstrate that scMerge2 enables multi-sample multi-condition scRNA-seq data integration from multiple cohorts and reveals signatures derived from cell-type expression that are more accurate in discriminating disease progression. Further, we demonstrate that scMerge2 can remove dataset variability in CyTOF, imaging mass cytometry and CITE-seq experiments, demonstrating its applicability to a broad spectrum of single-cell profiling technologies.
format Online
Article
Text
id pubmed-10352351
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-103523512023-07-19 Atlas-scale single-cell multi-sample multi-condition data integration using scMerge2 Lin, Yingxin Cao, Yue Willie, Elijah Patrick, Ellis Yang, Jean Y. H. Nat Commun Article The recent emergence of multi-sample multi-condition single-cell multi-cohort studies allows researchers to investigate different cell states. The effective integration of multiple large-cohort studies promises biological insights into cells under different conditions that individual studies cannot provide. Here, we present scMerge2, a scalable algorithm that allows data integration of atlas-scale multi-sample multi-condition single-cell studies. We have generalized scMerge2 to enable the merging of millions of cells from single-cell studies generated by various single-cell technologies. Using a large COVID-19 data collection with over five million cells from 1000+ individuals, we demonstrate that scMerge2 enables multi-sample multi-condition scRNA-seq data integration from multiple cohorts and reveals signatures derived from cell-type expression that are more accurate in discriminating disease progression. Further, we demonstrate that scMerge2 can remove dataset variability in CyTOF, imaging mass cytometry and CITE-seq experiments, demonstrating its applicability to a broad spectrum of single-cell profiling technologies. Nature Publishing Group UK 2023-07-17 /pmc/articles/PMC10352351/ /pubmed/37460600 http://dx.doi.org/10.1038/s41467-023-39923-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Lin, Yingxin
Cao, Yue
Willie, Elijah
Patrick, Ellis
Yang, Jean Y. H.
Atlas-scale single-cell multi-sample multi-condition data integration using scMerge2
title Atlas-scale single-cell multi-sample multi-condition data integration using scMerge2
title_full Atlas-scale single-cell multi-sample multi-condition data integration using scMerge2
title_fullStr Atlas-scale single-cell multi-sample multi-condition data integration using scMerge2
title_full_unstemmed Atlas-scale single-cell multi-sample multi-condition data integration using scMerge2
title_short Atlas-scale single-cell multi-sample multi-condition data integration using scMerge2
title_sort atlas-scale single-cell multi-sample multi-condition data integration using scmerge2
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10352351/
https://www.ncbi.nlm.nih.gov/pubmed/37460600
http://dx.doi.org/10.1038/s41467-023-39923-2
work_keys_str_mv AT linyingxin atlasscalesinglecellmultisamplemulticonditiondataintegrationusingscmerge2
AT caoyue atlasscalesinglecellmultisamplemulticonditiondataintegrationusingscmerge2
AT willieelijah atlasscalesinglecellmultisamplemulticonditiondataintegrationusingscmerge2
AT patrickellis atlasscalesinglecellmultisamplemulticonditiondataintegrationusingscmerge2
AT yangjeanyh atlasscalesinglecellmultisamplemulticonditiondataintegrationusingscmerge2