Cargando…

DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data

BACKGROUND: Advances in droplet-based single-cell RNA-sequencing (scRNA-seq) have dramatically increased throughput, allowing tens of thousands of cells to be routinely sequenced in a single experiment. In addition to cells, droplets capture cell-free “ambient” RNA predominantly caused by lysis of c...

Descripción completa

Detalles Bibliográficos
Autores principales: Muskovic, Walter, Powell, Joseph E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8641258/
https://www.ncbi.nlm.nih.gov/pubmed/34857027
http://dx.doi.org/10.1186/s13059-021-02547-0
_version_ 1784609470422188032
author Muskovic, Walter
Powell, Joseph E.
author_facet Muskovic, Walter
Powell, Joseph E.
author_sort Muskovic, Walter
collection PubMed
description BACKGROUND: Advances in droplet-based single-cell RNA-sequencing (scRNA-seq) have dramatically increased throughput, allowing tens of thousands of cells to be routinely sequenced in a single experiment. In addition to cells, droplets capture cell-free “ambient” RNA predominantly caused by lysis of cells during sample preparation. Samples with high ambient RNA concentration can create challenges in accurately distinguishing cell-containing droplets and droplets containing ambient RNA. Current methods to separate these groups often retain a significant number of droplets that do not contain cells or empty droplets. Additionally, there are currently no methods available to detect droplets containing damaged cells, which comprise partially lysed cells, the original source of the ambient RNA. RESULTS: Here, we describe DropletQC, a new method that is able to detect empty droplets, damaged, and intact cells, and accurately distinguish them from one another. This approach is based on a novel quality control metric, the nuclear fraction, which quantifies for each droplet the fraction of RNA originating from unspliced, nuclear pre-mRNA. We demonstrate how DropletQC provides a powerful extension to existing computational methods for identifying empty droplets such as EmptyDrops. CONCLUSIONS: We implement DropletQC as an R package, which can be easily integrated into existing single-cell analysis workflows. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-021-02547-0.
format Online
Article
Text
id pubmed-8641258
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-86412582021-12-06 DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data Muskovic, Walter Powell, Joseph E. Genome Biol Short Report BACKGROUND: Advances in droplet-based single-cell RNA-sequencing (scRNA-seq) have dramatically increased throughput, allowing tens of thousands of cells to be routinely sequenced in a single experiment. In addition to cells, droplets capture cell-free “ambient” RNA predominantly caused by lysis of cells during sample preparation. Samples with high ambient RNA concentration can create challenges in accurately distinguishing cell-containing droplets and droplets containing ambient RNA. Current methods to separate these groups often retain a significant number of droplets that do not contain cells or empty droplets. Additionally, there are currently no methods available to detect droplets containing damaged cells, which comprise partially lysed cells, the original source of the ambient RNA. RESULTS: Here, we describe DropletQC, a new method that is able to detect empty droplets, damaged, and intact cells, and accurately distinguish them from one another. This approach is based on a novel quality control metric, the nuclear fraction, which quantifies for each droplet the fraction of RNA originating from unspliced, nuclear pre-mRNA. We demonstrate how DropletQC provides a powerful extension to existing computational methods for identifying empty droplets such as EmptyDrops. CONCLUSIONS: We implement DropletQC as an R package, which can be easily integrated into existing single-cell analysis workflows. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-021-02547-0. BioMed Central 2021-12-02 /pmc/articles/PMC8641258/ /pubmed/34857027 http://dx.doi.org/10.1186/s13059-021-02547-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Short Report
Muskovic, Walter
Powell, Joseph E.
DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data
title DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data
title_full DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data
title_fullStr DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data
title_full_unstemmed DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data
title_short DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data
title_sort dropletqc: improved identification of empty droplets and damaged cells in single-cell rna-seq data
topic Short Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8641258/
https://www.ncbi.nlm.nih.gov/pubmed/34857027
http://dx.doi.org/10.1186/s13059-021-02547-0
work_keys_str_mv AT muskovicwalter dropletqcimprovedidentificationofemptydropletsanddamagedcellsinsinglecellrnaseqdata
AT powelljosephe dropletqcimprovedidentificationofemptydropletsanddamagedcellsinsinglecellrnaseqdata