Cargando…
Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching
Single-cell RNA sequencing (scRNA-seq) technology is poised to replace bulk cell RNA sequencing for many biological and medical applications as it allows users to measure gene expression levels in a cell type–specific manner. However, data produced by scRNA-seq often exhibit batch effects that can b...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8015859/ https://www.ncbi.nlm.nih.gov/pubmed/33741686 http://dx.doi.org/10.1101/gr.261115.120 |
_version_ | 1783673761179893760 |
---|---|
author | Chen, Mengjie Zhan, Qi Mu, Zepeng Wang, Lili Zheng, Zhaohui Miao, Jinlin Zhu, Ping Li, Yang I. |
author_facet | Chen, Mengjie Zhan, Qi Mu, Zepeng Wang, Lili Zheng, Zhaohui Miao, Jinlin Zhu, Ping Li, Yang I. |
author_sort | Chen, Mengjie |
collection | PubMed |
description | Single-cell RNA sequencing (scRNA-seq) technology is poised to replace bulk cell RNA sequencing for many biological and medical applications as it allows users to measure gene expression levels in a cell type–specific manner. However, data produced by scRNA-seq often exhibit batch effects that can be specific to a cell type, to a sample, or to an experiment, which prevent integration or comparisons across multiple experiments. Here, we present Dmatch, a method that leverages an external expression atlas of human primary cells and kernel density matching to align multiple scRNA-seq experiments for downstream biological analysis. Dmatch facilitates alignment of scRNA-seq data sets with cell types that may overlap only partially and thus allows integration of multiple distinct scRNA-seq experiments to extract biological insights. In simulation, Dmatch compares favorably to other alignment methods, both in terms of reducing sample-specific clustering and in terms of avoiding overcorrection. When applied to scRNA-seq data collected from clinical samples in a healthy individual and five autoimmune disease patients, Dmatch enabled cell type–specific differential gene expression comparisons across biopsy sites and disease conditions and uncovered a shared population of pro-inflammatory monocytes across biopsy sites in RA patients. We further show that Dmatch increases the number of eQTLs mapped from population scRNA-seq data. Dmatch is fast, scalable, and improves the utility of scRNA-seq for several important applications. Dmatch is freely available online. |
format | Online Article Text |
id | pubmed-8015859 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-80158592021-04-21 Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching Chen, Mengjie Zhan, Qi Mu, Zepeng Wang, Lili Zheng, Zhaohui Miao, Jinlin Zhu, Ping Li, Yang I. Genome Res Method Single-cell RNA sequencing (scRNA-seq) technology is poised to replace bulk cell RNA sequencing for many biological and medical applications as it allows users to measure gene expression levels in a cell type–specific manner. However, data produced by scRNA-seq often exhibit batch effects that can be specific to a cell type, to a sample, or to an experiment, which prevent integration or comparisons across multiple experiments. Here, we present Dmatch, a method that leverages an external expression atlas of human primary cells and kernel density matching to align multiple scRNA-seq experiments for downstream biological analysis. Dmatch facilitates alignment of scRNA-seq data sets with cell types that may overlap only partially and thus allows integration of multiple distinct scRNA-seq experiments to extract biological insights. In simulation, Dmatch compares favorably to other alignment methods, both in terms of reducing sample-specific clustering and in terms of avoiding overcorrection. When applied to scRNA-seq data collected from clinical samples in a healthy individual and five autoimmune disease patients, Dmatch enabled cell type–specific differential gene expression comparisons across biopsy sites and disease conditions and uncovered a shared population of pro-inflammatory monocytes across biopsy sites in RA patients. We further show that Dmatch increases the number of eQTLs mapped from population scRNA-seq data. Dmatch is fast, scalable, and improves the utility of scRNA-seq for several important applications. Dmatch is freely available online. Cold Spring Harbor Laboratory Press 2021-04 /pmc/articles/PMC8015859/ /pubmed/33741686 http://dx.doi.org/10.1101/gr.261115.120 Text en © 2021 Chen et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Method Chen, Mengjie Zhan, Qi Mu, Zepeng Wang, Lili Zheng, Zhaohui Miao, Jinlin Zhu, Ping Li, Yang I. Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching |
title | Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching |
title_full | Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching |
title_fullStr | Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching |
title_full_unstemmed | Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching |
title_short | Alignment of single-cell RNA-seq samples without overcorrection using kernel density matching |
title_sort | alignment of single-cell rna-seq samples without overcorrection using kernel density matching |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8015859/ https://www.ncbi.nlm.nih.gov/pubmed/33741686 http://dx.doi.org/10.1101/gr.261115.120 |
work_keys_str_mv | AT chenmengjie alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching AT zhanqi alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching AT muzepeng alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching AT wanglili alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching AT zhengzhaohui alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching AT miaojinlin alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching AT zhuping alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching AT liyangi alignmentofsinglecellrnaseqsampleswithoutovercorrectionusingkerneldensitymatching |