Cargando…
Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro
Assays such as CITE-seq can measure the abundance of cell surface proteins on individual cells using antibody derived tags (ADTs). However, many ADTs have high levels of background noise that can obfuscate down-stream analyses. Using an exploratory analysis of PBMC datasets, we find that some drople...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9979990/ https://www.ncbi.nlm.nih.gov/pubmed/36865227 http://dx.doi.org/10.1101/2023.01.27.525964 |
_version_ | 1784899829988589568 |
---|---|
author | Yin, Yuan Yajima, Masanao Campbell, Joshua D. |
author_facet | Yin, Yuan Yajima, Masanao Campbell, Joshua D. |
author_sort | Yin, Yuan |
collection | PubMed |
description | Assays such as CITE-seq can measure the abundance of cell surface proteins on individual cells using antibody derived tags (ADTs). However, many ADTs have high levels of background noise that can obfuscate down-stream analyses. Using an exploratory analysis of PBMC datasets, we find that some droplets that were originally called “empty” due to low levels of RNA contained high levels of ADTs and likely corresponded to neutrophils. We identified a novel type of artifact in the empty droplets called a “spongelet” which has medium levels of ADT expression and is distinct from ambient noise. ADT expression levels in the spongelets correlate to ADT expression levels in the background peak of true cells in several datasets suggesting that they can contribute to background noise along with ambient ADTs. We then developed DecontPro, a novel Bayesian hierarchical model that can decontaminate ADT data by estimating and removing contamination from these sources. DecontPro outperforms other decontamination tools in removing aberrantly expressed ADTs while retaining native ADTs and in improving clustering specificity. Overall, these results suggest that identification of empty drops should be performed separately for RNA and ADT data and that DecontPro can be incorporated into CITE-seq workflows to improve the quality of downstream analyses. |
format | Online Article Text |
id | pubmed-9979990 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-99799902023-03-03 Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro Yin, Yuan Yajima, Masanao Campbell, Joshua D. bioRxiv Article Assays such as CITE-seq can measure the abundance of cell surface proteins on individual cells using antibody derived tags (ADTs). However, many ADTs have high levels of background noise that can obfuscate down-stream analyses. Using an exploratory analysis of PBMC datasets, we find that some droplets that were originally called “empty” due to low levels of RNA contained high levels of ADTs and likely corresponded to neutrophils. We identified a novel type of artifact in the empty droplets called a “spongelet” which has medium levels of ADT expression and is distinct from ambient noise. ADT expression levels in the spongelets correlate to ADT expression levels in the background peak of true cells in several datasets suggesting that they can contribute to background noise along with ambient ADTs. We then developed DecontPro, a novel Bayesian hierarchical model that can decontaminate ADT data by estimating and removing contamination from these sources. DecontPro outperforms other decontamination tools in removing aberrantly expressed ADTs while retaining native ADTs and in improving clustering specificity. Overall, these results suggest that identification of empty drops should be performed separately for RNA and ADT data and that DecontPro can be incorporated into CITE-seq workflows to improve the quality of downstream analyses. Cold Spring Harbor Laboratory 2023-02-24 /pmc/articles/PMC9979990/ /pubmed/36865227 http://dx.doi.org/10.1101/2023.01.27.525964 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Yin, Yuan Yajima, Masanao Campbell, Joshua D. Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro |
title | Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro |
title_full | Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro |
title_fullStr | Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro |
title_full_unstemmed | Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro |
title_short | Characterization and decontamination of background noise in droplet-based single-cell protein expression data with DecontPro |
title_sort | characterization and decontamination of background noise in droplet-based single-cell protein expression data with decontpro |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9979990/ https://www.ncbi.nlm.nih.gov/pubmed/36865227 http://dx.doi.org/10.1101/2023.01.27.525964 |
work_keys_str_mv | AT yinyuan characterizationanddecontaminationofbackgroundnoiseindropletbasedsinglecellproteinexpressiondatawithdecontpro AT yajimamasanao characterizationanddecontaminationofbackgroundnoiseindropletbasedsinglecellproteinexpressiondatawithdecontpro AT campbelljoshuad characterizationanddecontaminationofbackgroundnoiseindropletbasedsinglecellproteinexpressiondatawithdecontpro |