Cargando…

GMM-Demux: sample demultiplexing, multiplet detection, experiment planning, and novel cell-type verification in single cell sequencing

Identifying and removing multiplets are essential to improving the scalability and the reliability of single cell RNA sequencing (scRNA-seq). Multiplets create artificial cell types in the dataset. We propose a Gaussian mixture model-based multiplet identification method, GMM-Demux. GMM-Demux accura...

Descripción completa

Detalles Bibliográficos
Autores principales: Xin, Hongyi, Lian, Qiuyu, Jiang, Yale, Luo, Jiadi, Wang, Xinjun, Erb, Carla, Xu, Zhongli, Zhang, Xiaoyi, Heidrich-O’Hare, Elisa, Yan, Qi, Duerr, Richard H., Chen, Kong, Chen, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7393741/
https://www.ncbi.nlm.nih.gov/pubmed/32731885
http://dx.doi.org/10.1186/s13059-020-02084-2
_version_ 1783565096312635392
author Xin, Hongyi
Lian, Qiuyu
Jiang, Yale
Luo, Jiadi
Wang, Xinjun
Erb, Carla
Xu, Zhongli
Zhang, Xiaoyi
Heidrich-O’Hare, Elisa
Yan, Qi
Duerr, Richard H.
Chen, Kong
Chen, Wei
author_facet Xin, Hongyi
Lian, Qiuyu
Jiang, Yale
Luo, Jiadi
Wang, Xinjun
Erb, Carla
Xu, Zhongli
Zhang, Xiaoyi
Heidrich-O’Hare, Elisa
Yan, Qi
Duerr, Richard H.
Chen, Kong
Chen, Wei
author_sort Xin, Hongyi
collection PubMed
description Identifying and removing multiplets are essential to improving the scalability and the reliability of single cell RNA sequencing (scRNA-seq). Multiplets create artificial cell types in the dataset. We propose a Gaussian mixture model-based multiplet identification method, GMM-Demux. GMM-Demux accurately identifies and removes multiplets through sample barcoding, including cell hashing and MULTI-seq. GMM-Demux uses a droplet formation model to authenticate putative cell types discovered from a scRNA-seq dataset. We generate two in-house cell-hashing datasets and compared GMM-Demux against three state-of-the-art sample barcoding classifiers. We show that GMM-Demux is stable and highly accurate and recognizes 9 multiplet-induced fake cell types in a PBMC dataset.
format Online
Article
Text
id pubmed-7393741
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73937412020-08-04 GMM-Demux: sample demultiplexing, multiplet detection, experiment planning, and novel cell-type verification in single cell sequencing Xin, Hongyi Lian, Qiuyu Jiang, Yale Luo, Jiadi Wang, Xinjun Erb, Carla Xu, Zhongli Zhang, Xiaoyi Heidrich-O’Hare, Elisa Yan, Qi Duerr, Richard H. Chen, Kong Chen, Wei Genome Biol Method Identifying and removing multiplets are essential to improving the scalability and the reliability of single cell RNA sequencing (scRNA-seq). Multiplets create artificial cell types in the dataset. We propose a Gaussian mixture model-based multiplet identification method, GMM-Demux. GMM-Demux accurately identifies and removes multiplets through sample barcoding, including cell hashing and MULTI-seq. GMM-Demux uses a droplet formation model to authenticate putative cell types discovered from a scRNA-seq dataset. We generate two in-house cell-hashing datasets and compared GMM-Demux against three state-of-the-art sample barcoding classifiers. We show that GMM-Demux is stable and highly accurate and recognizes 9 multiplet-induced fake cell types in a PBMC dataset. BioMed Central 2020-07-30 /pmc/articles/PMC7393741/ /pubmed/32731885 http://dx.doi.org/10.1186/s13059-020-02084-2 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Method
Xin, Hongyi
Lian, Qiuyu
Jiang, Yale
Luo, Jiadi
Wang, Xinjun
Erb, Carla
Xu, Zhongli
Zhang, Xiaoyi
Heidrich-O’Hare, Elisa
Yan, Qi
Duerr, Richard H.
Chen, Kong
Chen, Wei
GMM-Demux: sample demultiplexing, multiplet detection, experiment planning, and novel cell-type verification in single cell sequencing
title GMM-Demux: sample demultiplexing, multiplet detection, experiment planning, and novel cell-type verification in single cell sequencing
title_full GMM-Demux: sample demultiplexing, multiplet detection, experiment planning, and novel cell-type verification in single cell sequencing
title_fullStr GMM-Demux: sample demultiplexing, multiplet detection, experiment planning, and novel cell-type verification in single cell sequencing
title_full_unstemmed GMM-Demux: sample demultiplexing, multiplet detection, experiment planning, and novel cell-type verification in single cell sequencing
title_short GMM-Demux: sample demultiplexing, multiplet detection, experiment planning, and novel cell-type verification in single cell sequencing
title_sort gmm-demux: sample demultiplexing, multiplet detection, experiment planning, and novel cell-type verification in single cell sequencing
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7393741/
https://www.ncbi.nlm.nih.gov/pubmed/32731885
http://dx.doi.org/10.1186/s13059-020-02084-2
work_keys_str_mv AT xinhongyi gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing
AT lianqiuyu gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing
AT jiangyale gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing
AT luojiadi gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing
AT wangxinjun gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing
AT erbcarla gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing
AT xuzhongli gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing
AT zhangxiaoyi gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing
AT heidrichohareelisa gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing
AT yanqi gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing
AT duerrrichardh gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing
AT chenkong gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing
AT chenwei gmmdemuxsampledemultiplexingmultipletdetectionexperimentplanningandnovelcelltypeverificationinsinglecellsequencing