Cargando…

Fast, sensitive, and accurate integration of single cell data with Harmony

The emerging diversity of single cell RNAseq datasets allows for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. However, it is challenging to analyze them together, particularly when datasets are assayed with different technologie...

Descripción completa

Detalles Bibliográficos
Autores principales: Korsunsky, Ilya, Millard, Nghia, Fan, Jean, Slowikowski, Kamil, Zhang, Fan, Wei, Kevin, Baglaenko, Yuriy, Brenner, Michael, Loh, Po-ru, Raychaudhuri, Soumya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6884693/
https://www.ncbi.nlm.nih.gov/pubmed/31740819
http://dx.doi.org/10.1038/s41592-019-0619-0
Descripción
Sumario:The emerging diversity of single cell RNAseq datasets allows for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. However, it is challenging to analyze them together, particularly when datasets are assayed with different technologies. Here, real biological differences are interspersed with technical differences. We present Harmony, an algorithm that projects cells into a shared embedding in which cells group by cell type rather than dataset-specific conditions. Harmony simultaneously accounts for multiple experimental and biological factors. In six analyses, we demonstrate the superior performance of Harmony to previously published algorithms. We show that Harmony requires dramatically fewer computational resources. It is the only currently available algorithm that makes the integration of ~10(6) cells feasible on a personal computer. We apply Harmony to PBMCs from datasets with large experimental differences, 5 studies of pancreatic islet cells, mouse embryogenesis datasets, and cross-modality spatial integration.