Cargando…

Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis

Single-cell RNA sequencing (scRNA-seq) can characterize cell types and states through unsupervised clustering, but the ever increasing number of cells and batch effect impose computational challenges. We present DESC, an unsupervised deep embedding algorithm that clusters scRNA-seq data by iterative...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Xiangjie, Wang, Kui, Lyu, Yafei, Pan, Huize, Zhang, Jingxiao, Stambolian, Dwight, Susztak, Katalin, Reilly, Muredach P., Hu, Gang, Li, Mingyao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7214470/
https://www.ncbi.nlm.nih.gov/pubmed/32393754
http://dx.doi.org/10.1038/s41467-020-15851-3
_version_ 1783531978678599680
author Li, Xiangjie
Wang, Kui
Lyu, Yafei
Pan, Huize
Zhang, Jingxiao
Stambolian, Dwight
Susztak, Katalin
Reilly, Muredach P.
Hu, Gang
Li, Mingyao
author_facet Li, Xiangjie
Wang, Kui
Lyu, Yafei
Pan, Huize
Zhang, Jingxiao
Stambolian, Dwight
Susztak, Katalin
Reilly, Muredach P.
Hu, Gang
Li, Mingyao
author_sort Li, Xiangjie
collection PubMed
description Single-cell RNA sequencing (scRNA-seq) can characterize cell types and states through unsupervised clustering, but the ever increasing number of cells and batch effect impose computational challenges. We present DESC, an unsupervised deep embedding algorithm that clusters scRNA-seq data by iteratively optimizing a clustering objective function. Through iterative self-learning, DESC gradually removes batch effects, as long as technical differences across batches are smaller than true biological variations. As a soft clustering algorithm, cluster assignment probabilities from DESC are biologically interpretable and can reveal both discrete and pseudotemporal structure of cells. Comprehensive evaluations show that DESC offers a proper balance of clustering accuracy and stability, has a small footprint on memory, does not explicitly require batch information for batch effect removal, and can utilize GPU when available. As the scale of single-cell studies continues to grow, we believe DESC will offer a valuable tool for biomedical researchers to disentangle complex cellular heterogeneity.
format Online
Article
Text
id pubmed-7214470
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-72144702020-05-14 Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis Li, Xiangjie Wang, Kui Lyu, Yafei Pan, Huize Zhang, Jingxiao Stambolian, Dwight Susztak, Katalin Reilly, Muredach P. Hu, Gang Li, Mingyao Nat Commun Article Single-cell RNA sequencing (scRNA-seq) can characterize cell types and states through unsupervised clustering, but the ever increasing number of cells and batch effect impose computational challenges. We present DESC, an unsupervised deep embedding algorithm that clusters scRNA-seq data by iteratively optimizing a clustering objective function. Through iterative self-learning, DESC gradually removes batch effects, as long as technical differences across batches are smaller than true biological variations. As a soft clustering algorithm, cluster assignment probabilities from DESC are biologically interpretable and can reveal both discrete and pseudotemporal structure of cells. Comprehensive evaluations show that DESC offers a proper balance of clustering accuracy and stability, has a small footprint on memory, does not explicitly require batch information for batch effect removal, and can utilize GPU when available. As the scale of single-cell studies continues to grow, we believe DESC will offer a valuable tool for biomedical researchers to disentangle complex cellular heterogeneity. Nature Publishing Group UK 2020-05-11 /pmc/articles/PMC7214470/ /pubmed/32393754 http://dx.doi.org/10.1038/s41467-020-15851-3 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Li, Xiangjie
Wang, Kui
Lyu, Yafei
Pan, Huize
Zhang, Jingxiao
Stambolian, Dwight
Susztak, Katalin
Reilly, Muredach P.
Hu, Gang
Li, Mingyao
Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis
title Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis
title_full Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis
title_fullStr Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis
title_full_unstemmed Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis
title_short Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis
title_sort deep learning enables accurate clustering with batch effect removal in single-cell rna-seq analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7214470/
https://www.ncbi.nlm.nih.gov/pubmed/32393754
http://dx.doi.org/10.1038/s41467-020-15851-3
work_keys_str_mv AT lixiangjie deeplearningenablesaccurateclusteringwithbatcheffectremovalinsinglecellrnaseqanalysis
AT wangkui deeplearningenablesaccurateclusteringwithbatcheffectremovalinsinglecellrnaseqanalysis
AT lyuyafei deeplearningenablesaccurateclusteringwithbatcheffectremovalinsinglecellrnaseqanalysis
AT panhuize deeplearningenablesaccurateclusteringwithbatcheffectremovalinsinglecellrnaseqanalysis
AT zhangjingxiao deeplearningenablesaccurateclusteringwithbatcheffectremovalinsinglecellrnaseqanalysis
AT stamboliandwight deeplearningenablesaccurateclusteringwithbatcheffectremovalinsinglecellrnaseqanalysis
AT susztakkatalin deeplearningenablesaccurateclusteringwithbatcheffectremovalinsinglecellrnaseqanalysis
AT reillymuredachp deeplearningenablesaccurateclusteringwithbatcheffectremovalinsinglecellrnaseqanalysis
AT hugang deeplearningenablesaccurateclusteringwithbatcheffectremovalinsinglecellrnaseqanalysis
AT limingyao deeplearningenablesaccurateclusteringwithbatcheffectremovalinsinglecellrnaseqanalysis