Cargando…

scCAN: single-cell clustering using autoencoder and network fusion

Unsupervised clustering of single-cell RNA sequencing data (scRNA-seq) is important because it allows us to identify putative cell types. However, the large number of cells (up to millions), the high-dimensionality of the data (tens of thousands of genes), and the high dropout rates all present subs...

Descripción completa

Detalles Bibliográficos
Autores principales: Tran, Bang, Tran, Duc, Nguyen, Hung, Ro, Seungil, Nguyen, Tin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9206025/
https://www.ncbi.nlm.nih.gov/pubmed/35715568
http://dx.doi.org/10.1038/s41598-022-14218-6
_version_ 1784729254737477632
author Tran, Bang
Tran, Duc
Nguyen, Hung
Ro, Seungil
Nguyen, Tin
author_facet Tran, Bang
Tran, Duc
Nguyen, Hung
Ro, Seungil
Nguyen, Tin
author_sort Tran, Bang
collection PubMed
description Unsupervised clustering of single-cell RNA sequencing data (scRNA-seq) is important because it allows us to identify putative cell types. However, the large number of cells (up to millions), the high-dimensionality of the data (tens of thousands of genes), and the high dropout rates all present substantial challenges in single-cell analysis. Here we introduce a new method, named single-cell Clustering using Autoencoder and Network fusion (scCAN), that can overcome these challenges to accurately segregate different cell types in large and sparse scRNA-seq data. In an extensive analysis using 28 real scRNA-seq datasets (more than three million cells) and 243 simulated datasets, we validate that scCAN: (1) correctly estimates the number of true cell types, (2) accurately segregates cells of different types, (3) is robust against dropouts, and (4) is fast and memory efficient. We also compare scCAN with CIDR, SEURAT3, Monocle3, SHARP, and SCANPY. scCAN outperforms these state-of-the-art methods in terms of both accuracy and scalability. The scCAN package is available at https://cran.r-project.org/package=scCAN. Data and R scripts are available at http://sccan.tinnguyen-lab.com/
format Online
Article
Text
id pubmed-9206025
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-92060252022-06-19 scCAN: single-cell clustering using autoencoder and network fusion Tran, Bang Tran, Duc Nguyen, Hung Ro, Seungil Nguyen, Tin Sci Rep Article Unsupervised clustering of single-cell RNA sequencing data (scRNA-seq) is important because it allows us to identify putative cell types. However, the large number of cells (up to millions), the high-dimensionality of the data (tens of thousands of genes), and the high dropout rates all present substantial challenges in single-cell analysis. Here we introduce a new method, named single-cell Clustering using Autoencoder and Network fusion (scCAN), that can overcome these challenges to accurately segregate different cell types in large and sparse scRNA-seq data. In an extensive analysis using 28 real scRNA-seq datasets (more than three million cells) and 243 simulated datasets, we validate that scCAN: (1) correctly estimates the number of true cell types, (2) accurately segregates cells of different types, (3) is robust against dropouts, and (4) is fast and memory efficient. We also compare scCAN with CIDR, SEURAT3, Monocle3, SHARP, and SCANPY. scCAN outperforms these state-of-the-art methods in terms of both accuracy and scalability. The scCAN package is available at https://cran.r-project.org/package=scCAN. Data and R scripts are available at http://sccan.tinnguyen-lab.com/ Nature Publishing Group UK 2022-06-17 /pmc/articles/PMC9206025/ /pubmed/35715568 http://dx.doi.org/10.1038/s41598-022-14218-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Tran, Bang
Tran, Duc
Nguyen, Hung
Ro, Seungil
Nguyen, Tin
scCAN: single-cell clustering using autoencoder and network fusion
title scCAN: single-cell clustering using autoencoder and network fusion
title_full scCAN: single-cell clustering using autoencoder and network fusion
title_fullStr scCAN: single-cell clustering using autoencoder and network fusion
title_full_unstemmed scCAN: single-cell clustering using autoencoder and network fusion
title_short scCAN: single-cell clustering using autoencoder and network fusion
title_sort sccan: single-cell clustering using autoencoder and network fusion
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9206025/
https://www.ncbi.nlm.nih.gov/pubmed/35715568
http://dx.doi.org/10.1038/s41598-022-14218-6
work_keys_str_mv AT tranbang sccansinglecellclusteringusingautoencoderandnetworkfusion
AT tranduc sccansinglecellclusteringusingautoencoderandnetworkfusion
AT nguyenhung sccansinglecellclusteringusingautoencoderandnetworkfusion
AT roseungil sccansinglecellclusteringusingautoencoderandnetworkfusion
AT nguyentin sccansinglecellclusteringusingautoencoderandnetworkfusion