Cargando…

scROSHI: robust supervised hierarchical identification of single cells

Identifying cell types based on expression profiles is a pillar of single cell analysis. Existing machine-learning methods identify predictive features from annotated training data, which are often not available in early-stage studies. This can lead to overfitting and inferior performance when appli...

Descripción completa

Detalles Bibliográficos
Autores principales: Prummer, Michael, Bertolini, Anne, Bosshard, Lars, Barkmann, Florian, Yates, Josephine, Boeva, Valentina, Stekhoven, Daniel, Singer, Franziska
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10273189/
https://www.ncbi.nlm.nih.gov/pubmed/37332656
http://dx.doi.org/10.1093/nargab/lqad058
Descripción
Sumario:Identifying cell types based on expression profiles is a pillar of single cell analysis. Existing machine-learning methods identify predictive features from annotated training data, which are often not available in early-stage studies. This can lead to overfitting and inferior performance when applied to new data. To address these challenges we present scROSHI, which utilizes previously obtained cell type-specific gene lists and does not require training or the existence of annotated data. By respecting the hierarchical nature of cell type relationships and assigning cells consecutively to more specialized identities, excellent prediction performance is achieved. In a benchmark based on publicly available PBMC data sets, scROSHI outperforms competing methods when training data are limited or the diversity between experiments is large.