Cargando…

A reference-free approach for cell type classification with scRNA-seq

Single-cell RNA sequencing (scRNA-seq) has become a revolutionary technology to characterize cells under different biological conditions. Unlike bulk RNA-seq, gene expression from scRNA-seq is highly sparse due to limited sequencing depth per cell. This is worsened by tossing away a significant port...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Qi, Peng, Yifan, Liu, Jinze
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8335627/
https://www.ncbi.nlm.nih.gov/pubmed/34381979
http://dx.doi.org/10.1016/j.isci.2021.102855
_version_ 1783733150196695040
author Sun, Qi
Peng, Yifan
Liu, Jinze
author_facet Sun, Qi
Peng, Yifan
Liu, Jinze
author_sort Sun, Qi
collection PubMed
description Single-cell RNA sequencing (scRNA-seq) has become a revolutionary technology to characterize cells under different biological conditions. Unlike bulk RNA-seq, gene expression from scRNA-seq is highly sparse due to limited sequencing depth per cell. This is worsened by tossing away a significant portion of reads that attribute to gene quantification. To overcome data sparsity and fully utilize original reads, we propose scSimClassify, a reference-free and alignment-free approach to classify cell types with k-mer level features. The compressed k-mer groups (CKGs), identified by the simhash method, contain k-mers with similar abundance profiles and serve as the cells’ features. Our experiments demonstrate that CKG features lend themselves to better performance than gene expression features in scRNA-seq classification accuracy in the majority of experimental cases. Because CKGs are derived from raw reads without alignment to reference genome, scSimClassify offers an effective alternative to existing methods especially when reference genome is incomplete or insufficient to represent subject genomes.
format Online
Article
Text
id pubmed-8335627
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-83356272021-08-10 A reference-free approach for cell type classification with scRNA-seq Sun, Qi Peng, Yifan Liu, Jinze iScience Article Single-cell RNA sequencing (scRNA-seq) has become a revolutionary technology to characterize cells under different biological conditions. Unlike bulk RNA-seq, gene expression from scRNA-seq is highly sparse due to limited sequencing depth per cell. This is worsened by tossing away a significant portion of reads that attribute to gene quantification. To overcome data sparsity and fully utilize original reads, we propose scSimClassify, a reference-free and alignment-free approach to classify cell types with k-mer level features. The compressed k-mer groups (CKGs), identified by the simhash method, contain k-mers with similar abundance profiles and serve as the cells’ features. Our experiments demonstrate that CKG features lend themselves to better performance than gene expression features in scRNA-seq classification accuracy in the majority of experimental cases. Because CKGs are derived from raw reads without alignment to reference genome, scSimClassify offers an effective alternative to existing methods especially when reference genome is incomplete or insufficient to represent subject genomes. Elsevier 2021-07-14 /pmc/articles/PMC8335627/ /pubmed/34381979 http://dx.doi.org/10.1016/j.isci.2021.102855 Text en © 2021 The Authors. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Sun, Qi
Peng, Yifan
Liu, Jinze
A reference-free approach for cell type classification with scRNA-seq
title A reference-free approach for cell type classification with scRNA-seq
title_full A reference-free approach for cell type classification with scRNA-seq
title_fullStr A reference-free approach for cell type classification with scRNA-seq
title_full_unstemmed A reference-free approach for cell type classification with scRNA-seq
title_short A reference-free approach for cell type classification with scRNA-seq
title_sort reference-free approach for cell type classification with scrna-seq
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8335627/
https://www.ncbi.nlm.nih.gov/pubmed/34381979
http://dx.doi.org/10.1016/j.isci.2021.102855
work_keys_str_mv AT sunqi areferencefreeapproachforcelltypeclassificationwithscrnaseq
AT pengyifan areferencefreeapproachforcelltypeclassificationwithscrnaseq
AT liujinze areferencefreeapproachforcelltypeclassificationwithscrnaseq
AT sunqi referencefreeapproachforcelltypeclassificationwithscrnaseq
AT pengyifan referencefreeapproachforcelltypeclassificationwithscrnaseq
AT liujinze referencefreeapproachforcelltypeclassificationwithscrnaseq