Cargando…

Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching

Single-cell RNA sequencing (scRNA-seq) technology is poised to replace bulk cell RNA sequencing for many biological and medical applications as it allows users to measure gene expression levels in a cell type–specific manner. However, data produced by scRNA-seq often exhibit batch effects that can b...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Mengjie, Zhan, Qi, Mu, Zepeng, Wang, Lili, Zheng, Zhaohui, Miao, Jinlin, Zhu, Ping, Li, Yang I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8015859/
https://www.ncbi.nlm.nih.gov/pubmed/33741686
http://dx.doi.org/10.1101/gr.261115.120
_version_ 1783673761179893760
author Chen, Mengjie
Zhan, Qi
Mu, Zepeng
Wang, Lili
Zheng, Zhaohui
Miao, Jinlin
Zhu, Ping
Li, Yang I.
author_facet Chen, Mengjie
Zhan, Qi
Mu, Zepeng
Wang, Lili
Zheng, Zhaohui
Miao, Jinlin
Zhu, Ping
Li, Yang I.
author_sort Chen, Mengjie
collection PubMed
description Single-cell RNA sequencing (scRNA-seq) technology is poised to replace bulk cell RNA sequencing for many biological and medical applications as it allows users to measure gene expression levels in a cell type–specific manner. However, data produced by scRNA-seq often exhibit batch effects that can be specific to a cell type, to a sample, or to an experiment, which prevent integration or comparisons across multiple experiments. Here, we present Dmatch, a method that leverages an external expression atlas of human primary cells and kernel density matching to align multiple scRNA-seq experiments for downstream biological analysis. Dmatch facilitates alignment of scRNA-seq data sets with cell types that may overlap only partially and thus allows integration of multiple distinct scRNA-seq experiments to extract biological insights. In simulation, Dmatch compares favorably to other alignment methods, both in terms of reducing sample-specific clustering and in terms of avoiding overcorrection. When applied to scRNA-seq data collected from clinical samples in a healthy individual and five autoimmune disease patients, Dmatch enabled cell type–specific differential gene expression comparisons across biopsy sites and disease conditions and uncovered a shared population of pro-inflammatory monocytes across biopsy sites in RA patients. We further show that Dmatch increases the number of eQTLs mapped from population scRNA-seq data. Dmatch is fast, scalable, and improves the utility of scRNA-seq for several important applications. Dmatch is freely available online.
format Online
Article
Text
id pubmed-8015859
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-80158592021-04-21 Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching Chen, Mengjie Zhan, Qi Mu, Zepeng Wang, Lili Zheng, Zhaohui Miao, Jinlin Zhu, Ping Li, Yang I. Genome Res Method Single-cell RNA sequencing (scRNA-seq) technology is poised to replace bulk cell RNA sequencing for many biological and medical applications as it allows users to measure gene expression levels in a cell type–specific manner. However, data produced by scRNA-seq often exhibit batch effects that can be specific to a cell type, to a sample, or to an experiment, which prevent integration or comparisons across multiple experiments. Here, we present Dmatch, a method that leverages an external expression atlas of human primary cells and kernel density matching to align multiple scRNA-seq experiments for downstream biological analysis. Dmatch facilitates alignment of scRNA-seq data sets with cell types that may overlap only partially and thus allows integration of multiple distinct scRNA-seq experiments to extract biological insights. In simulation, Dmatch compares favorably to other alignment methods, both in terms of reducing sample-specific clustering and in terms of avoiding overcorrection. When applied to scRNA-seq data collected from clinical samples in a healthy individual and five autoimmune disease patients, Dmatch enabled cell type–specific differential gene expression comparisons across biopsy sites and disease conditions and uncovered a shared population of pro-inflammatory monocytes across biopsy sites in RA patients. We further show that Dmatch increases the number of eQTLs mapped from population scRNA-seq data. Dmatch is fast, scalable, and improves the utility of scRNA-seq for several important applications. Dmatch is freely available online. Cold Spring Harbor Laboratory Press 2021-04 /pmc/articles/PMC8015859/ /pubmed/33741686 http://dx.doi.org/10.1101/gr.261115.120 Text en © 2021 Chen et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Chen, Mengjie
Zhan, Qi
Mu, Zepeng
Wang, Lili
Zheng, Zhaohui
Miao, Jinlin
Zhu, Ping
Li, Yang I.
Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching
title Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching
title_full Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching
title_fullStr Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching
title_full_unstemmed Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching
title_short Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching
title_sort alignment of single-cell rna-seq samples without overcorrection using kernel density matching
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8015859/
https://www.ncbi.nlm.nih.gov/pubmed/33741686
http://dx.doi.org/10.1101/gr.261115.120
work_keys_str_mv AT chenmengjie alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching
AT zhanqi alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching
AT muzepeng alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching
AT wanglili alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching
AT zhengzhaohui alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching
AT miaojinlin alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching
AT zhuping alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching
AT liyangi alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching