Cargando…

Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro

Assays such as CITE-seq can measure the abundance of cell surface proteins on individual cells using antibody derived tags (ADTs). However, many ADTs have high levels of background noise that can obfuscate down-stream analyses. Using an exploratory analysis of PBMC datasets, we find that some drople...

Descripción completa

Detalles Bibliográficos
Autores principales: Yin, Yuan, Yajima, Masanao, Campbell, Joshua D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9979990/
https://www.ncbi.nlm.nih.gov/pubmed/36865227
http://dx.doi.org/10.1101/2023.01.27.525964
_version_ 1784899829988589568
author Yin, Yuan
Yajima, Masanao
Campbell, Joshua D.
author_facet Yin, Yuan
Yajima, Masanao
Campbell, Joshua D.
author_sort Yin, Yuan
collection PubMed
description Assays such as CITE-seq can measure the abundance of cell surface proteins on individual cells using antibody derived tags (ADTs). However, many ADTs have high levels of background noise that can obfuscate down-stream analyses. Using an exploratory analysis of PBMC datasets, we find that some droplets that were originally called “empty” due to low levels of RNA contained high levels of ADTs and likely corresponded to neutrophils. We identified a novel type of artifact in the empty droplets called a “spongelet” which has medium levels of ADT expression and is distinct from ambient noise. ADT expression levels in the spongelets correlate to ADT expression levels in the background peak of true cells in several datasets suggesting that they can contribute to background noise along with ambient ADTs. We then developed DecontPro, a novel Bayesian hierarchical model that can decontaminate ADT data by estimating and removing contamination from these sources. DecontPro outperforms other decontamination tools in removing aberrantly expressed ADTs while retaining native ADTs and in improving clustering specificity. Overall, these results suggest that identification of empty drops should be performed separately for RNA and ADT data and that DecontPro can be incorporated into CITE-seq workflows to improve the quality of downstream analyses.
format Online
Article
Text
id pubmed-9979990
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-99799902023-03-03 Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro Yin, Yuan Yajima, Masanao Campbell, Joshua D. bioRxiv Article Assays such as CITE-seq can measure the abundance of cell surface proteins on individual cells using antibody derived tags (ADTs). However, many ADTs have high levels of background noise that can obfuscate down-stream analyses. Using an exploratory analysis of PBMC datasets, we find that some droplets that were originally called “empty” due to low levels of RNA contained high levels of ADTs and likely corresponded to neutrophils. We identified a novel type of artifact in the empty droplets called a “spongelet” which has medium levels of ADT expression and is distinct from ambient noise. ADT expression levels in the spongelets correlate to ADT expression levels in the background peak of true cells in several datasets suggesting that they can contribute to background noise along with ambient ADTs. We then developed DecontPro, a novel Bayesian hierarchical model that can decontaminate ADT data by estimating and removing contamination from these sources. DecontPro outperforms other decontamination tools in removing aberrantly expressed ADTs while retaining native ADTs and in improving clustering specificity. Overall, these results suggest that identification of empty drops should be performed separately for RNA and ADT data and that DecontPro can be incorporated into CITE-seq workflows to improve the quality of downstream analyses. Cold Spring Harbor Laboratory 2023-02-24 /pmc/articles/PMC9979990/ /pubmed/36865227 http://dx.doi.org/10.1101/2023.01.27.525964 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Yin, Yuan
Yajima, Masanao
Campbell, Joshua D.
Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro
title Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro
title_full Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro
title_fullStr Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro
title_full_unstemmed Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro
title_short Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro
title_sort characterization and decontamination of background noise in droplet-based single-cell protein expression data with decontpro
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9979990/
https://www.ncbi.nlm.nih.gov/pubmed/36865227
http://dx.doi.org/10.1101/2023.01.27.525964
work_keys_str_mv AT yinyuan characterizationanddecontaminationofbackgroundnoiseindropletbasedsinglecellproteinexpressiondatawithdecontpro
AT yajimamasanao characterizationanddecontaminationofbackgroundnoiseindropletbasedsinglecellproteinexpressiondatawithdecontpro
AT campbelljoshuad characterizationanddecontaminationofbackgroundnoiseindropletbasedsinglecellproteinexpressiondatawithdecontpro