Cargando…

deMULTIplex2: robust sample demultiplexing for scRNA-seq

Single-cell sample multiplexing technologies function by associating sample-specific barcode tags with cell-specific barcode tags, thereby increasing sample throughput, reducing batch effects, and decreasing reagent costs. Computational methods must then correctly associate cell-tags with sample-tag...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Qin, Conrad, Daniel N., Gartner, Zev J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10120618/
https://www.ncbi.nlm.nih.gov/pubmed/37090649
http://dx.doi.org/10.1101/2023.04.11.536275
_version_ 1785029213722509312
author Zhu, Qin
Conrad, Daniel N.
Gartner, Zev J.
author_facet Zhu, Qin
Conrad, Daniel N.
Gartner, Zev J.
author_sort Zhu, Qin
collection PubMed
description Single-cell sample multiplexing technologies function by associating sample-specific barcode tags with cell-specific barcode tags, thereby increasing sample throughput, reducing batch effects, and decreasing reagent costs. Computational methods must then correctly associate cell-tags with sample-tags, but their performance deteriorates rapidly when working with datasets that are large, have imbalanced cell numbers across samples, or are noisy due to cross-contamination among sample tags - unavoidable features of many real-world experiments. Here we introduce deMULTIplex2, a mechanism-guided classification algorithm for multiplexed scRNA-seq data that successfully recovers many more cells across a spectrum of challenging datasets compared to existing methods. deMULTIplex2 is built on a statistical model of tag read counts derived from the physical mechanism of tag cross-contamination. Using generalized linear models and expectation-maximization, deMULTIplex2 probabilistically infers the sample identity of each cell and classifies singlets with high accuracy. Using Randomized Quantile Residuals, we show the model fits both simulated and real datasets. Benchmarking analysis suggests that deMULTIplex2 outperforms existing algorithms, especially when handling large and noisy single-cell datasets or those with unbalanced sample compositions.
format Online
Article
Text
id pubmed-10120618
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-101206182023-04-22 deMULTIplex2: robust sample demultiplexing for scRNA-seq Zhu, Qin Conrad, Daniel N. Gartner, Zev J. bioRxiv Article Single-cell sample multiplexing technologies function by associating sample-specific barcode tags with cell-specific barcode tags, thereby increasing sample throughput, reducing batch effects, and decreasing reagent costs. Computational methods must then correctly associate cell-tags with sample-tags, but their performance deteriorates rapidly when working with datasets that are large, have imbalanced cell numbers across samples, or are noisy due to cross-contamination among sample tags - unavoidable features of many real-world experiments. Here we introduce deMULTIplex2, a mechanism-guided classification algorithm for multiplexed scRNA-seq data that successfully recovers many more cells across a spectrum of challenging datasets compared to existing methods. deMULTIplex2 is built on a statistical model of tag read counts derived from the physical mechanism of tag cross-contamination. Using generalized linear models and expectation-maximization, deMULTIplex2 probabilistically infers the sample identity of each cell and classifies singlets with high accuracy. Using Randomized Quantile Residuals, we show the model fits both simulated and real datasets. Benchmarking analysis suggests that deMULTIplex2 outperforms existing algorithms, especially when handling large and noisy single-cell datasets or those with unbalanced sample compositions. Cold Spring Harbor Laboratory 2023-04-12 /pmc/articles/PMC10120618/ /pubmed/37090649 http://dx.doi.org/10.1101/2023.04.11.536275 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Zhu, Qin
Conrad, Daniel N.
Gartner, Zev J.
deMULTIplex2: robust sample demultiplexing for scRNA-seq
title deMULTIplex2: robust sample demultiplexing for scRNA-seq
title_full deMULTIplex2: robust sample demultiplexing for scRNA-seq
title_fullStr deMULTIplex2: robust sample demultiplexing for scRNA-seq
title_full_unstemmed deMULTIplex2: robust sample demultiplexing for scRNA-seq
title_short deMULTIplex2: robust sample demultiplexing for scRNA-seq
title_sort demultiplex2: robust sample demultiplexing for scrna-seq
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10120618/
https://www.ncbi.nlm.nih.gov/pubmed/37090649
http://dx.doi.org/10.1101/2023.04.11.536275
work_keys_str_mv AT zhuqin demultiplex2robustsampledemultiplexingforscrnaseq
AT conraddanieln demultiplex2robustsampledemultiplexingforscrnaseq
AT gartnerzevj demultiplex2robustsampledemultiplexingforscrnaseq