Cargando…

A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification

BACKGROUND: The advent of single cell RNA sequencing (scRNA-seq) enabled researchers to study transcriptomic activity within individual cells and identify inherent cell types in the sample. Although numerous computational tools have been developed to analyze single cell transcriptomes, there are no...

Descripción completa

Detalles Bibliográficos
Autores principales: Abrams, Douglas, Kumar, Parveen, Karuturi, R. Krishna Murthy, George, Joshy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6551246/
https://www.ncbi.nlm.nih.gov/pubmed/31167661
http://dx.doi.org/10.1186/s12859-019-2817-2
_version_ 1783424363682332672
author Abrams, Douglas
Kumar, Parveen
Karuturi, R. Krishna Murthy
George, Joshy
author_facet Abrams, Douglas
Kumar, Parveen
Karuturi, R. Krishna Murthy
George, Joshy
author_sort Abrams, Douglas
collection PubMed
description BACKGROUND: The advent of single cell RNA sequencing (scRNA-seq) enabled researchers to study transcriptomic activity within individual cells and identify inherent cell types in the sample. Although numerous computational tools have been developed to analyze single cell transcriptomes, there are no published studies and analytical packages available to guide experimental design and to devise suitable analysis procedure for cell type identification. RESULTS: We have developed an empirical methodology to address this important gap in single cell experimental design and analysis into an easy-to-use tool called SCEED (Single Cell Empirical Experimental Design and analysis). With SCEED, user can choose a variety of combinations of tools for analysis, conduct performance analysis of analytical procedures and choose the best procedure, and estimate sample size (number of cells to be profiled) required for a given analytical procedure at varying levels of cell type rarity and other experimental parameters. Using SCEED, we examined 3 single cell algorithms using 48 simulated single cell datasets that were generated for varying number of cell types and their proportions, number of genes expressed per cell, number of marker genes and their fold change, and number of single cells successfully profiled in the experiment. CONCLUSIONS: Based on our study, we found that when marker genes are expressed at fold change of 4 or more, either Seurat or SIMLR algorithm can be used to analyze single cell dataset for any number of single cells isolated (minimum 1000 single cells were tested). However, when marker genes are expected to be only up to fold change of 2, choice of the single cell algorithm is dependent on the number of single cells isolated and rarity of cell types to be identified. In conclusion, our work allows the assessment of various single cell methods and also aids in the design of single cell experiments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2817-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6551246
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65512462019-06-07 A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification Abrams, Douglas Kumar, Parveen Karuturi, R. Krishna Murthy George, Joshy BMC Bioinformatics Research BACKGROUND: The advent of single cell RNA sequencing (scRNA-seq) enabled researchers to study transcriptomic activity within individual cells and identify inherent cell types in the sample. Although numerous computational tools have been developed to analyze single cell transcriptomes, there are no published studies and analytical packages available to guide experimental design and to devise suitable analysis procedure for cell type identification. RESULTS: We have developed an empirical methodology to address this important gap in single cell experimental design and analysis into an easy-to-use tool called SCEED (Single Cell Empirical Experimental Design and analysis). With SCEED, user can choose a variety of combinations of tools for analysis, conduct performance analysis of analytical procedures and choose the best procedure, and estimate sample size (number of cells to be profiled) required for a given analytical procedure at varying levels of cell type rarity and other experimental parameters. Using SCEED, we examined 3 single cell algorithms using 48 simulated single cell datasets that were generated for varying number of cell types and their proportions, number of genes expressed per cell, number of marker genes and their fold change, and number of single cells successfully profiled in the experiment. CONCLUSIONS: Based on our study, we found that when marker genes are expressed at fold change of 4 or more, either Seurat or SIMLR algorithm can be used to analyze single cell dataset for any number of single cells isolated (minimum 1000 single cells were tested). However, when marker genes are expected to be only up to fold change of 2, choice of the single cell algorithm is dependent on the number of single cells isolated and rarity of cell types to be identified. In conclusion, our work allows the assessment of various single cell methods and also aids in the design of single cell experiments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2817-2) contains supplementary material, which is available to authorized users. BioMed Central 2019-06-06 /pmc/articles/PMC6551246/ /pubmed/31167661 http://dx.doi.org/10.1186/s12859-019-2817-2 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Abrams, Douglas
Kumar, Parveen
Karuturi, R. Krishna Murthy
George, Joshy
A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification
title A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification
title_full A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification
title_fullStr A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification
title_full_unstemmed A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification
title_short A computational method to aid the design and analysis of single cell RNA-seq experiments for cell type identification
title_sort computational method to aid the design and analysis of single cell rna-seq experiments for cell type identification
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6551246/
https://www.ncbi.nlm.nih.gov/pubmed/31167661
http://dx.doi.org/10.1186/s12859-019-2817-2
work_keys_str_mv AT abramsdouglas acomputationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification
AT kumarparveen acomputationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification
AT karuturirkrishnamurthy acomputationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification
AT georgejoshy acomputationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification
AT abramsdouglas computationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification
AT kumarparveen computationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification
AT karuturirkrishnamurthy computationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification
AT georgejoshy computationalmethodtoaidthedesignandanalysisofsinglecellrnaseqexperimentsforcelltypeidentification