Cargando…

Fast, sensitive, and accurate integration of single cell data with Harmony

The emerging diversity of single cell RNAseq datasets allows for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. However, it is challenging to analyze them together, particularly when datasets are assayed with different technologie...

Descripción completa

Detalles Bibliográficos
Autores principales: Korsunsky, Ilya, Millard, Nghia, Fan, Jean, Slowikowski, Kamil, Zhang, Fan, Wei, Kevin, Baglaenko, Yuriy, Brenner, Michael, Loh, Po-ru, Raychaudhuri, Soumya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6884693/
https://www.ncbi.nlm.nih.gov/pubmed/31740819
http://dx.doi.org/10.1038/s41592-019-0619-0
_version_ 1783474597189910528
author Korsunsky, Ilya
Millard, Nghia
Fan, Jean
Slowikowski, Kamil
Zhang, Fan
Wei, Kevin
Baglaenko, Yuriy
Brenner, Michael
Loh, Po-ru
Raychaudhuri, Soumya
author_facet Korsunsky, Ilya
Millard, Nghia
Fan, Jean
Slowikowski, Kamil
Zhang, Fan
Wei, Kevin
Baglaenko, Yuriy
Brenner, Michael
Loh, Po-ru
Raychaudhuri, Soumya
author_sort Korsunsky, Ilya
collection PubMed
description The emerging diversity of single cell RNAseq datasets allows for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. However, it is challenging to analyze them together, particularly when datasets are assayed with different technologies. Here, real biological differences are interspersed with technical differences. We present Harmony, an algorithm that projects cells into a shared embedding in which cells group by cell type rather than dataset-specific conditions. Harmony simultaneously accounts for multiple experimental and biological factors. In six analyses, we demonstrate the superior performance of Harmony to previously published algorithms. We show that Harmony requires dramatically fewer computational resources. It is the only currently available algorithm that makes the integration of ~10(6) cells feasible on a personal computer. We apply Harmony to PBMCs from datasets with large experimental differences, 5 studies of pancreatic islet cells, mouse embryogenesis datasets, and cross-modality spatial integration.
format Online
Article
Text
id pubmed-6884693
institution National Center for Biotechnology Information
language English
publishDate 2019
record_format MEDLINE/PubMed
spelling pubmed-68846932020-05-18 Fast, sensitive, and accurate integration of single cell data with Harmony Korsunsky, Ilya Millard, Nghia Fan, Jean Slowikowski, Kamil Zhang, Fan Wei, Kevin Baglaenko, Yuriy Brenner, Michael Loh, Po-ru Raychaudhuri, Soumya Nat Methods Article The emerging diversity of single cell RNAseq datasets allows for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. However, it is challenging to analyze them together, particularly when datasets are assayed with different technologies. Here, real biological differences are interspersed with technical differences. We present Harmony, an algorithm that projects cells into a shared embedding in which cells group by cell type rather than dataset-specific conditions. Harmony simultaneously accounts for multiple experimental and biological factors. In six analyses, we demonstrate the superior performance of Harmony to previously published algorithms. We show that Harmony requires dramatically fewer computational resources. It is the only currently available algorithm that makes the integration of ~10(6) cells feasible on a personal computer. We apply Harmony to PBMCs from datasets with large experimental differences, 5 studies of pancreatic islet cells, mouse embryogenesis datasets, and cross-modality spatial integration. 2019-11-18 2019-12 /pmc/articles/PMC6884693/ /pubmed/31740819 http://dx.doi.org/10.1038/s41592-019-0619-0 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Korsunsky, Ilya
Millard, Nghia
Fan, Jean
Slowikowski, Kamil
Zhang, Fan
Wei, Kevin
Baglaenko, Yuriy
Brenner, Michael
Loh, Po-ru
Raychaudhuri, Soumya
Fast, sensitive, and accurate integration of single cell data with Harmony
title Fast, sensitive, and accurate integration of single cell data with Harmony
title_full Fast, sensitive, and accurate integration of single cell data with Harmony
title_fullStr Fast, sensitive, and accurate integration of single cell data with Harmony
title_full_unstemmed Fast, sensitive, and accurate integration of single cell data with Harmony
title_short Fast, sensitive, and accurate integration of single cell data with Harmony
title_sort fast, sensitive, and accurate integration of single cell data with harmony
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6884693/
https://www.ncbi.nlm.nih.gov/pubmed/31740819
http://dx.doi.org/10.1038/s41592-019-0619-0
work_keys_str_mv AT korsunskyilya fastsensitiveandaccurateintegrationofsinglecelldatawithharmony
AT millardnghia fastsensitiveandaccurateintegrationofsinglecelldatawithharmony
AT fanjean fastsensitiveandaccurateintegrationofsinglecelldatawithharmony
AT slowikowskikamil fastsensitiveandaccurateintegrationofsinglecelldatawithharmony
AT zhangfan fastsensitiveandaccurateintegrationofsinglecelldatawithharmony
AT weikevin fastsensitiveandaccurateintegrationofsinglecelldatawithharmony
AT baglaenkoyuriy fastsensitiveandaccurateintegrationofsinglecelldatawithharmony
AT brennermichael fastsensitiveandaccurateintegrationofsinglecelldatawithharmony
AT lohporu fastsensitiveandaccurateintegrationofsinglecelldatawithharmony
AT raychaudhurisoumya fastsensitiveandaccurateintegrationofsinglecelldatawithharmony