Cargando…

Single cell RNA-seq data clustering using TF-IDF based methods

BACKGROUND: Single cell transcriptomics is critical for understanding cellular heterogeneity and identification of novel cell types. Leveraging the recent advances in single cell RNA sequencing (scRNA-Seq) technology requires novel unsupervised clustering algorithms that are robust to high levels of...

Descripción completa

Detalles Bibliográficos
Autores principales: Moussa, Marmar, Măndoiu, Ion I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6101073/
https://www.ncbi.nlm.nih.gov/pubmed/30367575
http://dx.doi.org/10.1186/s12864-018-4922-4
Descripción
Sumario:BACKGROUND: Single cell transcriptomics is critical for understanding cellular heterogeneity and identification of novel cell types. Leveraging the recent advances in single cell RNA sequencing (scRNA-Seq) technology requires novel unsupervised clustering algorithms that are robust to high levels of technical and biological noise and scale to datasets of millions of cells. RESULTS: We present novel computational approaches for clustering scRNA-seq data based on the Term Frequency - Inverse Document Frequency (TF-IDF) transformation that has been successfully used in the field of text analysis. CONCLUSIONS: Empirical experimental results show that TF-IDF methods consistently outperform commonly used scRNA-Seq clustering approaches.