Cargando…
scCAN: single-cell clustering using autoencoder and network fusion
Unsupervised clustering of single-cell RNA sequencing data (scRNA-seq) is important because it allows us to identify putative cell types. However, the large number of cells (up to millions), the high-dimensionality of the data (tens of thousands of genes), and the high dropout rates all present subs...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9206025/ https://www.ncbi.nlm.nih.gov/pubmed/35715568 http://dx.doi.org/10.1038/s41598-022-14218-6 |
_version_ | 1784729254737477632 |
---|---|
author | Tran, Bang Tran, Duc Nguyen, Hung Ro, Seungil Nguyen, Tin |
author_facet | Tran, Bang Tran, Duc Nguyen, Hung Ro, Seungil Nguyen, Tin |
author_sort | Tran, Bang |
collection | PubMed |
description | Unsupervised clustering of single-cell RNA sequencing data (scRNA-seq) is important because it allows us to identify putative cell types. However, the large number of cells (up to millions), the high-dimensionality of the data (tens of thousands of genes), and the high dropout rates all present substantial challenges in single-cell analysis. Here we introduce a new method, named single-cell Clustering using Autoencoder and Network fusion (scCAN), that can overcome these challenges to accurately segregate different cell types in large and sparse scRNA-seq data. In an extensive analysis using 28 real scRNA-seq datasets (more than three million cells) and 243 simulated datasets, we validate that scCAN: (1) correctly estimates the number of true cell types, (2) accurately segregates cells of different types, (3) is robust against dropouts, and (4) is fast and memory efficient. We also compare scCAN with CIDR, SEURAT3, Monocle3, SHARP, and SCANPY. scCAN outperforms these state-of-the-art methods in terms of both accuracy and scalability. The scCAN package is available at https://cran.r-project.org/package=scCAN. Data and R scripts are available at http://sccan.tinnguyen-lab.com/ |
format | Online Article Text |
id | pubmed-9206025 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-92060252022-06-19 scCAN: single-cell clustering using autoencoder and network fusion Tran, Bang Tran, Duc Nguyen, Hung Ro, Seungil Nguyen, Tin Sci Rep Article Unsupervised clustering of single-cell RNA sequencing data (scRNA-seq) is important because it allows us to identify putative cell types. However, the large number of cells (up to millions), the high-dimensionality of the data (tens of thousands of genes), and the high dropout rates all present substantial challenges in single-cell analysis. Here we introduce a new method, named single-cell Clustering using Autoencoder and Network fusion (scCAN), that can overcome these challenges to accurately segregate different cell types in large and sparse scRNA-seq data. In an extensive analysis using 28 real scRNA-seq datasets (more than three million cells) and 243 simulated datasets, we validate that scCAN: (1) correctly estimates the number of true cell types, (2) accurately segregates cells of different types, (3) is robust against dropouts, and (4) is fast and memory efficient. We also compare scCAN with CIDR, SEURAT3, Monocle3, SHARP, and SCANPY. scCAN outperforms these state-of-the-art methods in terms of both accuracy and scalability. The scCAN package is available at https://cran.r-project.org/package=scCAN. Data and R scripts are available at http://sccan.tinnguyen-lab.com/ Nature Publishing Group UK 2022-06-17 /pmc/articles/PMC9206025/ /pubmed/35715568 http://dx.doi.org/10.1038/s41598-022-14218-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Tran, Bang Tran, Duc Nguyen, Hung Ro, Seungil Nguyen, Tin scCAN: single-cell clustering using autoencoder and network fusion |
title | scCAN: single-cell clustering using autoencoder and network fusion |
title_full | scCAN: single-cell clustering using autoencoder and network fusion |
title_fullStr | scCAN: single-cell clustering using autoencoder and network fusion |
title_full_unstemmed | scCAN: single-cell clustering using autoencoder and network fusion |
title_short | scCAN: single-cell clustering using autoencoder and network fusion |
title_sort | sccan: single-cell clustering using autoencoder and network fusion |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9206025/ https://www.ncbi.nlm.nih.gov/pubmed/35715568 http://dx.doi.org/10.1038/s41598-022-14218-6 |
work_keys_str_mv | AT tranbang sccansinglecellclusteringusingautoencoderandnetworkfusion AT tranduc sccansinglecellclusteringusingautoencoderandnetworkfusion AT nguyenhung sccansinglecellclusteringusingautoencoderandnetworkfusion AT roseungil sccansinglecellclusteringusingautoencoderandnetworkfusion AT nguyentin sccansinglecellclusteringusingautoencoderandnetworkfusion |