Cargando…

Cumulus provides cloud-based data analysis for large-scale single-cell and single-nucleus RNA-seq

Massively parallel single-cell and single-nucleus RNA-seq (sc/snRNA-seq) have opened the way to systematic tissue atlases in health and disease, but as the scale of data generation is growing, so does the need for computational pipelines for scaled analysis. Here, we developed Cumulus, a cloud-based...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Bo, Gould, Joshua, Yang, Yiming, Sarkizova, Siranush, Tabaka, Marcin, Ashenberg, Orr, Rosen, Yanay, Slyper, Michal, Kowalczyk, Monika S, Villani, Alexandra-Chloé, Tickle, Timothy, Hacohen, Nir, Rozenblatt-Rosen, Orit, Regev, Aviv
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7437817/
https://www.ncbi.nlm.nih.gov/pubmed/32719530
http://dx.doi.org/10.1038/s41592-020-0905-x
Descripción
Sumario:Massively parallel single-cell and single-nucleus RNA-seq (sc/snRNA-seq) have opened the way to systematic tissue atlases in health and disease, but as the scale of data generation is growing, so does the need for computational pipelines for scaled analysis. Here, we developed Cumulus, a cloud-based framework for analyzing large scale sc/snRNA-seq datasets. Cumulus combines the power of cloud computing with improvements in algorithm implementations to achieve high scalability, low cost, user-friendliness, and integrated support for a comprehensive set of features. We benchmark Cumulus on the Human Cell Atlas Census of Immune Cells dataset of bone marrow cells and show that it substantially improves efficiency over conventional frameworks, while maintaining or improving the quality of results, enabling large-scale studies.