Cargando…
Fast searches of large collections of single cell data using scfind
Single cell technologies have made it possible to profile millions of cells, but for these resources to be useful they must be easy to query and access. To facilitate interactive and intuitive access to single cell data we have developed scfind, a single cell analysis tool that facilitates fast sear...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7116898/ https://www.ncbi.nlm.nih.gov/pubmed/33649586 http://dx.doi.org/10.1038/s41592-021-01076-9 |
_version_ | 1783514256505831424 |
---|---|
author | Lee, Jimmy Tsz Hang Patikas, Nikolaos Kiselev, Vladimir Yu Hemberg, Martin |
author_facet | Lee, Jimmy Tsz Hang Patikas, Nikolaos Kiselev, Vladimir Yu Hemberg, Martin |
author_sort | Lee, Jimmy Tsz Hang |
collection | PubMed |
description | Single cell technologies have made it possible to profile millions of cells, but for these resources to be useful they must be easy to query and access. To facilitate interactive and intuitive access to single cell data we have developed scfind, a single cell analysis tool that facilitates fast search of biologically or clinically relevant marker genes in cell atlases. Using transcriptome data from six mouse cell atlases we show how scfind can be used to evaluate marker genes, to perform in silico gating, and to identify both cell-type specific and housekeeping genes. Moreover, we have developed a subquery optimization routine to ensure that long and complex queries return meaningful results. To make scfind more user friendly, we use indices of PubMed abstracts and techniques from natural language processing to allow for arbitrary queries. Finally, we show how scfind can be used for multi-omics analyses by combining single-cell ATAC-seq data with transcriptome data. |
format | Online Article Text |
id | pubmed-7116898 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
record_format | MEDLINE/PubMed |
spelling | pubmed-71168982021-09-01 Fast searches of large collections of single cell data using scfind Lee, Jimmy Tsz Hang Patikas, Nikolaos Kiselev, Vladimir Yu Hemberg, Martin Nat Methods Article Single cell technologies have made it possible to profile millions of cells, but for these resources to be useful they must be easy to query and access. To facilitate interactive and intuitive access to single cell data we have developed scfind, a single cell analysis tool that facilitates fast search of biologically or clinically relevant marker genes in cell atlases. Using transcriptome data from six mouse cell atlases we show how scfind can be used to evaluate marker genes, to perform in silico gating, and to identify both cell-type specific and housekeeping genes. Moreover, we have developed a subquery optimization routine to ensure that long and complex queries return meaningful results. To make scfind more user friendly, we use indices of PubMed abstracts and techniques from natural language processing to allow for arbitrary queries. Finally, we show how scfind can be used for multi-omics analyses by combining single-cell ATAC-seq data with transcriptome data. 2021-03-01 2021-03-01 /pmc/articles/PMC7116898/ /pubmed/33649586 http://dx.doi.org/10.1038/s41592-021-01076-9 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms |
spellingShingle | Article Lee, Jimmy Tsz Hang Patikas, Nikolaos Kiselev, Vladimir Yu Hemberg, Martin Fast searches of large collections of single cell data using scfind |
title | Fast searches of large collections of single cell data using scfind |
title_full | Fast searches of large collections of single cell data using scfind |
title_fullStr | Fast searches of large collections of single cell data using scfind |
title_full_unstemmed | Fast searches of large collections of single cell data using scfind |
title_short | Fast searches of large collections of single cell data using scfind |
title_sort | fast searches of large collections of single cell data using scfind |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7116898/ https://www.ncbi.nlm.nih.gov/pubmed/33649586 http://dx.doi.org/10.1038/s41592-021-01076-9 |
work_keys_str_mv | AT leejimmytszhang fastsearchesoflargecollectionsofsinglecelldatausingscfind AT patikasnikolaos fastsearchesoflargecollectionsofsinglecelldatausingscfind AT kiselevvladimiryu fastsearchesoflargecollectionsofsinglecelldatausingscfind AT hembergmartin fastsearchesoflargecollectionsofsinglecelldatausingscfind |