Cargando…
Fast, sensitive, and accurate integration of single cell data with Harmony
The emerging diversity of single cell RNAseq datasets allows for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. However, it is challenging to analyze them together, particularly when datasets are assayed with different technologie...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6884693/ https://www.ncbi.nlm.nih.gov/pubmed/31740819 http://dx.doi.org/10.1038/s41592-019-0619-0 |
_version_ | 1783474597189910528 |
---|---|
author | Korsunsky, Ilya Millard, Nghia Fan, Jean Slowikowski, Kamil Zhang, Fan Wei, Kevin Baglaenko, Yuriy Brenner, Michael Loh, Po-ru Raychaudhuri, Soumya |
author_facet | Korsunsky, Ilya Millard, Nghia Fan, Jean Slowikowski, Kamil Zhang, Fan Wei, Kevin Baglaenko, Yuriy Brenner, Michael Loh, Po-ru Raychaudhuri, Soumya |
author_sort | Korsunsky, Ilya |
collection | PubMed |
description | The emerging diversity of single cell RNAseq datasets allows for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. However, it is challenging to analyze them together, particularly when datasets are assayed with different technologies. Here, real biological differences are interspersed with technical differences. We present Harmony, an algorithm that projects cells into a shared embedding in which cells group by cell type rather than dataset-specific conditions. Harmony simultaneously accounts for multiple experimental and biological factors. In six analyses, we demonstrate the superior performance of Harmony to previously published algorithms. We show that Harmony requires dramatically fewer computational resources. It is the only currently available algorithm that makes the integration of ~10(6) cells feasible on a personal computer. We apply Harmony to PBMCs from datasets with large experimental differences, 5 studies of pancreatic islet cells, mouse embryogenesis datasets, and cross-modality spatial integration. |
format | Online Article Text |
id | pubmed-6884693 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
record_format | MEDLINE/PubMed |
spelling | pubmed-68846932020-05-18 Fast, sensitive, and accurate integration of single cell data with Harmony Korsunsky, Ilya Millard, Nghia Fan, Jean Slowikowski, Kamil Zhang, Fan Wei, Kevin Baglaenko, Yuriy Brenner, Michael Loh, Po-ru Raychaudhuri, Soumya Nat Methods Article The emerging diversity of single cell RNAseq datasets allows for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. However, it is challenging to analyze them together, particularly when datasets are assayed with different technologies. Here, real biological differences are interspersed with technical differences. We present Harmony, an algorithm that projects cells into a shared embedding in which cells group by cell type rather than dataset-specific conditions. Harmony simultaneously accounts for multiple experimental and biological factors. In six analyses, we demonstrate the superior performance of Harmony to previously published algorithms. We show that Harmony requires dramatically fewer computational resources. It is the only currently available algorithm that makes the integration of ~10(6) cells feasible on a personal computer. We apply Harmony to PBMCs from datasets with large experimental differences, 5 studies of pancreatic islet cells, mouse embryogenesis datasets, and cross-modality spatial integration. 2019-11-18 2019-12 /pmc/articles/PMC6884693/ /pubmed/31740819 http://dx.doi.org/10.1038/s41592-019-0619-0 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms |
spellingShingle | Article Korsunsky, Ilya Millard, Nghia Fan, Jean Slowikowski, Kamil Zhang, Fan Wei, Kevin Baglaenko, Yuriy Brenner, Michael Loh, Po-ru Raychaudhuri, Soumya Fast, sensitive, and accurate integration of single cell data with Harmony |
title | Fast, sensitive, and accurate integration of single cell data with Harmony |
title_full | Fast, sensitive, and accurate integration of single cell data with Harmony |
title_fullStr | Fast, sensitive, and accurate integration of single cell data with Harmony |
title_full_unstemmed | Fast, sensitive, and accurate integration of single cell data with Harmony |
title_short | Fast, sensitive, and accurate integration of single cell data with Harmony |
title_sort | fast, sensitive, and accurate integration of single cell data with harmony |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6884693/ https://www.ncbi.nlm.nih.gov/pubmed/31740819 http://dx.doi.org/10.1038/s41592-019-0619-0 |
work_keys_str_mv | AT korsunskyilya fastsensitiveandaccurateintegrationofsinglecelldatawithharmony AT millardnghia fastsensitiveandaccurateintegrationofsinglecelldatawithharmony AT fanjean fastsensitiveandaccurateintegrationofsinglecelldatawithharmony AT slowikowskikamil fastsensitiveandaccurateintegrationofsinglecelldatawithharmony AT zhangfan fastsensitiveandaccurateintegrationofsinglecelldatawithharmony AT weikevin fastsensitiveandaccurateintegrationofsinglecelldatawithharmony AT baglaenkoyuriy fastsensitiveandaccurateintegrationofsinglecelldatawithharmony AT brennermichael fastsensitiveandaccurateintegrationofsinglecelldatawithharmony AT lohporu fastsensitiveandaccurateintegrationofsinglecelldatawithharmony AT raychaudhurisoumya fastsensitiveandaccurateintegrationofsinglecelldatawithharmony |