Cargando…

CellMixS: quantifying and visualizing batch effects in single-cell RNA-seq data

A key challenge in single-cell RNA-sequencing (scRNA-seq) data analysis is batch effects that can obscure the biological signal of interest. Although there are various tools and methods to correct for batch effects, their performance can vary. Therefore, it is important to understand how batch effec...

Descripción completa

Detalles Bibliográficos
Autores principales: Lütge, Almut, Zyprych-Walczak, Joanna, Brykczynska Kunzmann, Urszula, Crowell, Helena L, Calini, Daniela, Malhotra, Dheeraj, Soneson, Charlotte, Robinson, Mark D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Life Science Alliance LLC 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7994321/
https://www.ncbi.nlm.nih.gov/pubmed/33758076
http://dx.doi.org/10.26508/lsa.202001004
_version_ 1783669730861645824
author Lütge, Almut
Zyprych-Walczak, Joanna
Brykczynska Kunzmann, Urszula
Crowell, Helena L
Calini, Daniela
Malhotra, Dheeraj
Soneson, Charlotte
Robinson, Mark D
author_facet Lütge, Almut
Zyprych-Walczak, Joanna
Brykczynska Kunzmann, Urszula
Crowell, Helena L
Calini, Daniela
Malhotra, Dheeraj
Soneson, Charlotte
Robinson, Mark D
author_sort Lütge, Almut
collection PubMed
description A key challenge in single-cell RNA-sequencing (scRNA-seq) data analysis is batch effects that can obscure the biological signal of interest. Although there are various tools and methods to correct for batch effects, their performance can vary. Therefore, it is important to understand how batch effects manifest to adjust for them. Here, we systematically explore batch effects across various scRNA-seq datasets according to magnitude, cell type specificity, and complexity. We developed a cell-specific mixing score (cms) that quantifies mixing of cells from multiple batches. By considering distance distributions, the score is able to detect local batch bias as well as differentiate between unbalanced batches and systematic differences between cells of the same cell type. We compare metrics in scRNA-seq data using real and synthetic datasets and whereas these metrics target the same question and are used interchangeably, we find differences in scalability, sensitivity, and ability to handle differentially abundant cell types. We find that cell-specific metrics outperform cell type–specific and global metrics and recommend them for both method benchmarks and batch exploration.
format Online
Article
Text
id pubmed-7994321
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Life Science Alliance LLC
record_format MEDLINE/PubMed
spelling pubmed-79943212021-04-01 CellMixS: quantifying and visualizing batch effects in single-cell RNA-seq data Lütge, Almut Zyprych-Walczak, Joanna Brykczynska Kunzmann, Urszula Crowell, Helena L Calini, Daniela Malhotra, Dheeraj Soneson, Charlotte Robinson, Mark D Life Sci Alliance Research Articles A key challenge in single-cell RNA-sequencing (scRNA-seq) data analysis is batch effects that can obscure the biological signal of interest. Although there are various tools and methods to correct for batch effects, their performance can vary. Therefore, it is important to understand how batch effects manifest to adjust for them. Here, we systematically explore batch effects across various scRNA-seq datasets according to magnitude, cell type specificity, and complexity. We developed a cell-specific mixing score (cms) that quantifies mixing of cells from multiple batches. By considering distance distributions, the score is able to detect local batch bias as well as differentiate between unbalanced batches and systematic differences between cells of the same cell type. We compare metrics in scRNA-seq data using real and synthetic datasets and whereas these metrics target the same question and are used interchangeably, we find differences in scalability, sensitivity, and ability to handle differentially abundant cell types. We find that cell-specific metrics outperform cell type–specific and global metrics and recommend them for both method benchmarks and batch exploration. Life Science Alliance LLC 2021-03-23 /pmc/articles/PMC7994321/ /pubmed/33758076 http://dx.doi.org/10.26508/lsa.202001004 Text en © 2021 Lütge et al. https://creativecommons.org/licenses/by/4.0/This article is available under a Creative Commons License (Attribution 4.0 International, as described at https://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Articles
Lütge, Almut
Zyprych-Walczak, Joanna
Brykczynska Kunzmann, Urszula
Crowell, Helena L
Calini, Daniela
Malhotra, Dheeraj
Soneson, Charlotte
Robinson, Mark D
CellMixS: quantifying and visualizing batch effects in single-cell RNA-seq data
title CellMixS: quantifying and visualizing batch effects in single-cell RNA-seq data
title_full CellMixS: quantifying and visualizing batch effects in single-cell RNA-seq data
title_fullStr CellMixS: quantifying and visualizing batch effects in single-cell RNA-seq data
title_full_unstemmed CellMixS: quantifying and visualizing batch effects in single-cell RNA-seq data
title_short CellMixS: quantifying and visualizing batch effects in single-cell RNA-seq data
title_sort cellmixs: quantifying and visualizing batch effects in single-cell rna-seq data
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7994321/
https://www.ncbi.nlm.nih.gov/pubmed/33758076
http://dx.doi.org/10.26508/lsa.202001004
work_keys_str_mv AT lutgealmut cellmixsquantifyingandvisualizingbatcheffectsinsinglecellrnaseqdata
AT zyprychwalczakjoanna cellmixsquantifyingandvisualizingbatcheffectsinsinglecellrnaseqdata
AT brykczynskakunzmannurszula cellmixsquantifyingandvisualizingbatcheffectsinsinglecellrnaseqdata
AT crowellhelenal cellmixsquantifyingandvisualizingbatcheffectsinsinglecellrnaseqdata
AT calinidaniela cellmixsquantifyingandvisualizingbatcheffectsinsinglecellrnaseqdata
AT malhotradheeraj cellmixsquantifyingandvisualizingbatcheffectsinsinglecellrnaseqdata
AT sonesoncharlotte cellmixsquantifyingandvisualizingbatcheffectsinsinglecellrnaseqdata
AT robinsonmarkd cellmixsquantifyingandvisualizingbatcheffectsinsinglecellrnaseqdata