Cargando…

demuxmix: demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models

MOTIVATION: Droplet-based single-cell RNA sequencing (scRNA-seq) is widely used in biomedical research for interrogating the transcriptomes of single cells on a large scale. Pooling and processing cells from different samples together can reduce costs and batch effects. To pool cells, they are often...

Descripción completa

Detalles Bibliográficos
Autor principal: Klein, Hans-Ulrich
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10412409/
https://www.ncbi.nlm.nih.gov/pubmed/37527018
http://dx.doi.org/10.1093/bioinformatics/btad481
_version_ 1785086899590791168
author Klein, Hans-Ulrich
author_facet Klein, Hans-Ulrich
author_sort Klein, Hans-Ulrich
collection PubMed
description MOTIVATION: Droplet-based single-cell RNA sequencing (scRNA-seq) is widely used in biomedical research for interrogating the transcriptomes of single cells on a large scale. Pooling and processing cells from different samples together can reduce costs and batch effects. To pool cells, they are often first labeled with hashtag oligonucleotides (HTOs). These HTOs are sequenced alongside the cells’ RNA in the droplets and subsequently used to computationally assign each droplet to its sample of origin, a process referred to as demultiplexing. Accurate demultiplexing is crucial but can be challenging due to background HTOs, low-quality cells/cell debris, and multiplets. RESULTS: A new demultiplexing method based on negative binomial regression mixture models is introduced. The method, called demuxmix, implements two significant improvements. First, demuxmix’s probabilistic classification framework provides error probabilities for droplet assignments that can be used to discard uncertain droplets and inform about the quality of the HTO data and the success of the demultiplexing process. Second, demuxmix utilizes the positive association between detected genes in the RNA library and HTO counts to explain parts of the variance in the HTO data resulting in improved droplet assignments. The improved performance of demuxmix compared with existing demultiplexing methods is assessed using real and simulated data. Finally, the feasibility of accurately demultiplexing experimental designs where non-labeled cells are pooled with labeled cells is demonstrated. AVAILABILITY AND IMPLEMENTATION: R/Bioconductor package demuxmix (https://doi.org/doi:10.18129/B9.bioc.demuxmix)
format Online
Article
Text
id pubmed-10412409
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-104124092023-08-11 demuxmix: demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models Klein, Hans-Ulrich Bioinformatics Original Paper MOTIVATION: Droplet-based single-cell RNA sequencing (scRNA-seq) is widely used in biomedical research for interrogating the transcriptomes of single cells on a large scale. Pooling and processing cells from different samples together can reduce costs and batch effects. To pool cells, they are often first labeled with hashtag oligonucleotides (HTOs). These HTOs are sequenced alongside the cells’ RNA in the droplets and subsequently used to computationally assign each droplet to its sample of origin, a process referred to as demultiplexing. Accurate demultiplexing is crucial but can be challenging due to background HTOs, low-quality cells/cell debris, and multiplets. RESULTS: A new demultiplexing method based on negative binomial regression mixture models is introduced. The method, called demuxmix, implements two significant improvements. First, demuxmix’s probabilistic classification framework provides error probabilities for droplet assignments that can be used to discard uncertain droplets and inform about the quality of the HTO data and the success of the demultiplexing process. Second, demuxmix utilizes the positive association between detected genes in the RNA library and HTO counts to explain parts of the variance in the HTO data resulting in improved droplet assignments. The improved performance of demuxmix compared with existing demultiplexing methods is assessed using real and simulated data. Finally, the feasibility of accurately demultiplexing experimental designs where non-labeled cells are pooled with labeled cells is demonstrated. AVAILABILITY AND IMPLEMENTATION: R/Bioconductor package demuxmix (https://doi.org/doi:10.18129/B9.bioc.demuxmix) Oxford University Press 2023-08-01 /pmc/articles/PMC10412409/ /pubmed/37527018 http://dx.doi.org/10.1093/bioinformatics/btad481 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Klein, Hans-Ulrich
demuxmix: demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models
title demuxmix: demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models
title_full demuxmix: demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models
title_fullStr demuxmix: demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models
title_full_unstemmed demuxmix: demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models
title_short demuxmix: demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models
title_sort demuxmix: demultiplexing oligonucleotide-barcoded single-cell rna sequencing data with regression mixture models
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10412409/
https://www.ncbi.nlm.nih.gov/pubmed/37527018
http://dx.doi.org/10.1093/bioinformatics/btad481
work_keys_str_mv AT kleinhansulrich demuxmixdemultiplexingoligonucleotidebarcodedsinglecellrnasequencingdatawithregressionmixturemodels