Cargando…

Fast searches of large collections of single cell data using scfind

Single cell technologies have made it possible to profile millions of cells, but for these resources to be useful they must be easy to query and access. To facilitate interactive and intuitive access to single cell data we have developed scfind, a single cell analysis tool that facilitates fast sear...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Jimmy Tsz Hang, Patikas, Nikolaos, Kiselev, Vladimir Yu, Hemberg, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7116898/
https://www.ncbi.nlm.nih.gov/pubmed/33649586
http://dx.doi.org/10.1038/s41592-021-01076-9
_version_ 1783514256505831424
author Lee, Jimmy Tsz Hang
Patikas, Nikolaos
Kiselev, Vladimir Yu
Hemberg, Martin
author_facet Lee, Jimmy Tsz Hang
Patikas, Nikolaos
Kiselev, Vladimir Yu
Hemberg, Martin
author_sort Lee, Jimmy Tsz Hang
collection PubMed
description Single cell technologies have made it possible to profile millions of cells, but for these resources to be useful they must be easy to query and access. To facilitate interactive and intuitive access to single cell data we have developed scfind, a single cell analysis tool that facilitates fast search of biologically or clinically relevant marker genes in cell atlases. Using transcriptome data from six mouse cell atlases we show how scfind can be used to evaluate marker genes, to perform in silico gating, and to identify both cell-type specific and housekeeping genes. Moreover, we have developed a subquery optimization routine to ensure that long and complex queries return meaningful results. To make scfind more user friendly, we use indices of PubMed abstracts and techniques from natural language processing to allow for arbitrary queries. Finally, we show how scfind can be used for multi-omics analyses by combining single-cell ATAC-seq data with transcriptome data.
format Online
Article
Text
id pubmed-7116898
institution National Center for Biotechnology Information
language English
publishDate 2021
record_format MEDLINE/PubMed
spelling pubmed-71168982021-09-01 Fast searches of large collections of single cell data using scfind Lee, Jimmy Tsz Hang Patikas, Nikolaos Kiselev, Vladimir Yu Hemberg, Martin Nat Methods Article Single cell technologies have made it possible to profile millions of cells, but for these resources to be useful they must be easy to query and access. To facilitate interactive and intuitive access to single cell data we have developed scfind, a single cell analysis tool that facilitates fast search of biologically or clinically relevant marker genes in cell atlases. Using transcriptome data from six mouse cell atlases we show how scfind can be used to evaluate marker genes, to perform in silico gating, and to identify both cell-type specific and housekeeping genes. Moreover, we have developed a subquery optimization routine to ensure that long and complex queries return meaningful results. To make scfind more user friendly, we use indices of PubMed abstracts and techniques from natural language processing to allow for arbitrary queries. Finally, we show how scfind can be used for multi-omics analyses by combining single-cell ATAC-seq data with transcriptome data. 2021-03-01 2021-03-01 /pmc/articles/PMC7116898/ /pubmed/33649586 http://dx.doi.org/10.1038/s41592-021-01076-9 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Lee, Jimmy Tsz Hang
Patikas, Nikolaos
Kiselev, Vladimir Yu
Hemberg, Martin
Fast searches of large collections of single cell data using scfind
title Fast searches of large collections of single cell data using scfind
title_full Fast searches of large collections of single cell data using scfind
title_fullStr Fast searches of large collections of single cell data using scfind
title_full_unstemmed Fast searches of large collections of single cell data using scfind
title_short Fast searches of large collections of single cell data using scfind
title_sort fast searches of large collections of single cell data using scfind
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7116898/
https://www.ncbi.nlm.nih.gov/pubmed/33649586
http://dx.doi.org/10.1038/s41592-021-01076-9
work_keys_str_mv AT leejimmytszhang fastsearchesoflargecollectionsofsinglecelldatausingscfind
AT patikasnikolaos fastsearchesoflargecollectionsofsinglecelldatausingscfind
AT kiselevvladimiryu fastsearchesoflargecollectionsofsinglecelldatausingscfind
AT hembergmartin fastsearchesoflargecollectionsofsinglecelldatausingscfind