Cargando…

A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa

Single-cell RNA sequencing (scRNA-seq) has been widely applied to discover new cell types by detecting sub-populations in a heterogeneous group of cells. Since scRNA-seq experiments have lower read coverage/tag counts and introduce more technical biases compared to bulk RNA-seq experiments, the limi...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Huanan, Lee, Catherine A. A., Li, Zhuliu, Garbe, John R., Eide, Cindy R., Petegrosso, Raphael, Kuang, Rui, Tolar, Jakub
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5908193/
https://www.ncbi.nlm.nih.gov/pubmed/29630593
http://dx.doi.org/10.1371/journal.pcbi.1006053
_version_ 1783315679017959424
author Zhang, Huanan
Lee, Catherine A. A.
Li, Zhuliu
Garbe, John R.
Eide, Cindy R.
Petegrosso, Raphael
Kuang, Rui
Tolar, Jakub
author_facet Zhang, Huanan
Lee, Catherine A. A.
Li, Zhuliu
Garbe, John R.
Eide, Cindy R.
Petegrosso, Raphael
Kuang, Rui
Tolar, Jakub
author_sort Zhang, Huanan
collection PubMed
description Single-cell RNA sequencing (scRNA-seq) has been widely applied to discover new cell types by detecting sub-populations in a heterogeneous group of cells. Since scRNA-seq experiments have lower read coverage/tag counts and introduce more technical biases compared to bulk RNA-seq experiments, the limited number of sampled cells combined with the experimental biases and other dataset specific variations presents a challenge to cross-dataset analysis and discovery of relevant biological variations across multiple cell populations. In this paper, we introduce a method of variance-driven multitask clustering of single-cell RNA-seq data (scVDMC) that utilizes multiple single-cell populations from biological replicates or different samples. scVDMC clusters single cells in multiple scRNA-seq experiments of similar cell types and markers but varying expression patterns such that the scRNA-seq data are better integrated than typical pooled analyses which only increase the sample size. By controlling the variance among the cell clusters within each dataset and across all the datasets, scVDMC detects cell sub-populations in each individual experiment with shared cell-type markers but varying cluster centers among all the experiments. Applied to two real scRNA-seq datasets with several replicates and one large-scale droplet-based dataset on three patient samples, scVDMC more accurately detected cell populations and known cell markers than pooled clustering and other recently proposed scRNA-seq clustering methods. In the case study applied to in-house Recessive Dystrophic Epidermolysis Bullosa (RDEB) scRNA-seq data, scVDMC revealed several new cell types and unknown markers validated by flow cytometry. MATLAB/Octave code available at https://github.com/kuanglab/scVDMC.
format Online
Article
Text
id pubmed-5908193
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-59081932018-05-04 A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa Zhang, Huanan Lee, Catherine A. A. Li, Zhuliu Garbe, John R. Eide, Cindy R. Petegrosso, Raphael Kuang, Rui Tolar, Jakub PLoS Comput Biol Research Article Single-cell RNA sequencing (scRNA-seq) has been widely applied to discover new cell types by detecting sub-populations in a heterogeneous group of cells. Since scRNA-seq experiments have lower read coverage/tag counts and introduce more technical biases compared to bulk RNA-seq experiments, the limited number of sampled cells combined with the experimental biases and other dataset specific variations presents a challenge to cross-dataset analysis and discovery of relevant biological variations across multiple cell populations. In this paper, we introduce a method of variance-driven multitask clustering of single-cell RNA-seq data (scVDMC) that utilizes multiple single-cell populations from biological replicates or different samples. scVDMC clusters single cells in multiple scRNA-seq experiments of similar cell types and markers but varying expression patterns such that the scRNA-seq data are better integrated than typical pooled analyses which only increase the sample size. By controlling the variance among the cell clusters within each dataset and across all the datasets, scVDMC detects cell sub-populations in each individual experiment with shared cell-type markers but varying cluster centers among all the experiments. Applied to two real scRNA-seq datasets with several replicates and one large-scale droplet-based dataset on three patient samples, scVDMC more accurately detected cell populations and known cell markers than pooled clustering and other recently proposed scRNA-seq clustering methods. In the case study applied to in-house Recessive Dystrophic Epidermolysis Bullosa (RDEB) scRNA-seq data, scVDMC revealed several new cell types and unknown markers validated by flow cytometry. MATLAB/Octave code available at https://github.com/kuanglab/scVDMC. Public Library of Science 2018-04-09 /pmc/articles/PMC5908193/ /pubmed/29630593 http://dx.doi.org/10.1371/journal.pcbi.1006053 Text en © 2018 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhang, Huanan
Lee, Catherine A. A.
Li, Zhuliu
Garbe, John R.
Eide, Cindy R.
Petegrosso, Raphael
Kuang, Rui
Tolar, Jakub
A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa
title A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa
title_full A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa
title_fullStr A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa
title_full_unstemmed A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa
title_short A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa
title_sort multitask clustering approach for single-cell rna-seq analysis in recessive dystrophic epidermolysis bullosa
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5908193/
https://www.ncbi.nlm.nih.gov/pubmed/29630593
http://dx.doi.org/10.1371/journal.pcbi.1006053
work_keys_str_mv AT zhanghuanan amultitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT leecatherineaa amultitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT lizhuliu amultitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT garbejohnr amultitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT eidecindyr amultitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT petegrossoraphael amultitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT kuangrui amultitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT tolarjakub amultitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT zhanghuanan multitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT leecatherineaa multitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT lizhuliu multitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT garbejohnr multitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT eidecindyr multitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT petegrossoraphael multitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT kuangrui multitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa
AT tolarjakub multitaskclusteringapproachforsinglecellrnaseqanalysisinrecessivedystrophicepidermolysisbullosa