Cargando…
A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification
BACKGROUND: The advent of single cell RNA sequencing (scRNA-seq) enabled researchers to study transcriptomic activity within individual cells and identify inherent cell types in the sample. Although numerous computational tools have been developed to analyze single cell transcriptomes, there are no...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6551246/ https://www.ncbi.nlm.nih.gov/pubmed/31167661 http://dx.doi.org/10.1186/s12859-019-2817-2 |
_version_ | 1783424363682332672 |
---|---|
author | Abrams, Douglas Kumar, Parveen Karuturi, R. Krishna Murthy George, Joshy |
author_facet | Abrams, Douglas Kumar, Parveen Karuturi, R. Krishna Murthy George, Joshy |
author_sort | Abrams, Douglas |
collection | PubMed |
description | BACKGROUND: The advent of single cell RNA sequencing (scRNA-seq) enabled researchers to study transcriptomic activity within individual cells and identify inherent cell types in the sample. Although numerous computational tools have been developed to analyze single cell transcriptomes, there are no published studies and analytical packages available to guide experimental design and to devise suitable analysis procedure for cell type identification. RESULTS: We have developed an empirical methodology to address this important gap in single cell experimental design and analysis into an easy-to-use tool called SCEED (Single Cell Empirical Experimental Design and analysis). With SCEED, user can choose a variety of combinations of tools for analysis, conduct performance analysis of analytical procedures and choose the best procedure, and estimate sample size (number of cells to be profiled) required for a given analytical procedure at varying levels of cell type rarity and other experimental parameters. Using SCEED, we examined 3 single cell algorithms using 48 simulated single cell datasets that were generated for varying number of cell types and their proportions, number of genes expressed per cell, number of marker genes and their fold change, and number of single cells successfully profiled in the experiment. CONCLUSIONS: Based on our study, we found that when marker genes are expressed at fold change of 4 or more, either Seurat or SIMLR algorithm can be used to analyze single cell dataset for any number of single cells isolated (minimum 1000 single cells were tested). However, when marker genes are expected to be only up to fold change of 2, choice of the single cell algorithm is dependent on the number of single cells isolated and rarity of cell types to be identified. In conclusion, our work allows the assessment of various single cell methods and also aids in the design of single cell experiments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2817-2) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6551246 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-65512462019-06-07 A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification Abrams, Douglas Kumar, Parveen Karuturi, R. Krishna Murthy George, Joshy BMC Bioinformatics Research BACKGROUND: The advent of single cell RNA sequencing (scRNA-seq) enabled researchers to study transcriptomic activity within individual cells and identify inherent cell types in the sample. Although numerous computational tools have been developed to analyze single cell transcriptomes, there are no published studies and analytical packages available to guide experimental design and to devise suitable analysis procedure for cell type identification. RESULTS: We have developed an empirical methodology to address this important gap in single cell experimental design and analysis into an easy-to-use tool called SCEED (Single Cell Empirical Experimental Design and analysis). With SCEED, user can choose a variety of combinations of tools for analysis, conduct performance analysis of analytical procedures and choose the best procedure, and estimate sample size (number of cells to be profiled) required for a given analytical procedure at varying levels of cell type rarity and other experimental parameters. Using SCEED, we examined 3 single cell algorithms using 48 simulated single cell datasets that were generated for varying number of cell types and their proportions, number of genes expressed per cell, number of marker genes and their fold change, and number of single cells successfully profiled in the experiment. CONCLUSIONS: Based on our study, we found that when marker genes are expressed at fold change of 4 or more, either Seurat or SIMLR algorithm can be used to analyze single cell dataset for any number of single cells isolated (minimum 1000 single cells were tested). However, when marker genes are expected to be only up to fold change of 2, choice of the single cell algorithm is dependent on the number of single cells isolated and rarity of cell types to be identified. In conclusion, our work allows the assessment of various single cell methods and also aids in the design of single cell experiments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2817-2) contains supplementary material, which is available to authorized users. BioMed Central 2019-06-06 /pmc/articles/PMC6551246/ /pubmed/31167661 http://dx.doi.org/10.1186/s12859-019-2817-2 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Abrams, Douglas Kumar, Parveen Karuturi, R. Krishna Murthy George, Joshy A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification |
title | A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification |
title_full | A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification |
title_fullStr | A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification |
title_full_unstemmed | A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification |
title_short | A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification |
title_sort | computational method to aid the design and analysis of single cell rna-seq experiments for cell type identification |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6551246/ https://www.ncbi.nlm.nih.gov/pubmed/31167661 http://dx.doi.org/10.1186/s12859-019-2817-2 |
work_keys_str_mv | AT abramsdouglas acomputationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification AT kumarparveen acomputationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification AT karuturirkrishnamurthy acomputationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification AT georgejoshy acomputationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification AT abramsdouglas computationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification AT kumarparveen computationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification AT karuturirkrishnamurthy computationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification AT georgejoshy computationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification |