Cargando…

Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data

Recent technological breakthroughs have made it possible to measure RNA expression at the single-cell level, thus paving the way for exploring expression heterogeneity among individual cells. Current single-cell RNA sequencing (scRNA-seq) protocols are complex and introduce technical biases that var...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Cheng, Hu, Yu, Kelly, Derek, Kim, Junhyong, Li, Mingyao, Zhang, Nancy R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737676/
https://www.ncbi.nlm.nih.gov/pubmed/29036714
http://dx.doi.org/10.1093/nar/gkx754
_version_ 1783287565877510144
author Jia, Cheng
Hu, Yu
Kelly, Derek
Kim, Junhyong
Li, Mingyao
Zhang, Nancy R.
author_facet Jia, Cheng
Hu, Yu
Kelly, Derek
Kim, Junhyong
Li, Mingyao
Zhang, Nancy R.
author_sort Jia, Cheng
collection PubMed
description Recent technological breakthroughs have made it possible to measure RNA expression at the single-cell level, thus paving the way for exploring expression heterogeneity among individual cells. Current single-cell RNA sequencing (scRNA-seq) protocols are complex and introduce technical biases that vary across cells, which can bias downstream analysis without proper adjustment. To account for cell-to-cell technical differences, we propose a statistical framework, TASC (Toolkit for Analysis of Single Cell RNA-seq), an empirical Bayes approach to reliably model the cell-specific dropout rates and amplification bias by use of external RNA spike-ins. TASC incorporates the technical parameters, which reflect cell-to-cell batch effects, into a hierarchical mixture model to estimate the biological variance of a gene and detect differentially expressed genes. More importantly, TASC is able to adjust for covariates to further eliminate confounding that may originate from cell size and cell cycle differences. In simulation and real scRNA-seq data, TASC achieves accurate Type I error control and displays competitive sensitivity and improved robustness to batch effects in differential expression analysis, compared to existing methods. TASC is programmed to be computationally efficient, taking advantage of multi-threaded parallelization. We believe that TASC will provide a robust platform for researchers to leverage the power of scRNA-seq.
format Online
Article
Text
id pubmed-5737676
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57376762018-01-04 Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data Jia, Cheng Hu, Yu Kelly, Derek Kim, Junhyong Li, Mingyao Zhang, Nancy R. Nucleic Acids Res Computational Biology Recent technological breakthroughs have made it possible to measure RNA expression at the single-cell level, thus paving the way for exploring expression heterogeneity among individual cells. Current single-cell RNA sequencing (scRNA-seq) protocols are complex and introduce technical biases that vary across cells, which can bias downstream analysis without proper adjustment. To account for cell-to-cell technical differences, we propose a statistical framework, TASC (Toolkit for Analysis of Single Cell RNA-seq), an empirical Bayes approach to reliably model the cell-specific dropout rates and amplification bias by use of external RNA spike-ins. TASC incorporates the technical parameters, which reflect cell-to-cell batch effects, into a hierarchical mixture model to estimate the biological variance of a gene and detect differentially expressed genes. More importantly, TASC is able to adjust for covariates to further eliminate confounding that may originate from cell size and cell cycle differences. In simulation and real scRNA-seq data, TASC achieves accurate Type I error control and displays competitive sensitivity and improved robustness to batch effects in differential expression analysis, compared to existing methods. TASC is programmed to be computationally efficient, taking advantage of multi-threaded parallelization. We believe that TASC will provide a robust platform for researchers to leverage the power of scRNA-seq. Oxford University Press 2017-11-02 2017-09-25 /pmc/articles/PMC5737676/ /pubmed/29036714 http://dx.doi.org/10.1093/nar/gkx754 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Jia, Cheng
Hu, Yu
Kelly, Derek
Kim, Junhyong
Li, Mingyao
Zhang, Nancy R.
Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data
title Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data
title_full Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data
title_fullStr Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data
title_full_unstemmed Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data
title_short Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data
title_sort accounting for technical noise in differential expression analysis of single-cell rna sequencing data
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737676/
https://www.ncbi.nlm.nih.gov/pubmed/29036714
http://dx.doi.org/10.1093/nar/gkx754
work_keys_str_mv AT jiacheng accountingfortechnicalnoiseindifferentialexpressionanalysisofsinglecellrnasequencingdata
AT huyu accountingfortechnicalnoiseindifferentialexpressionanalysisofsinglecellrnasequencingdata
AT kellyderek accountingfortechnicalnoiseindifferentialexpressionanalysisofsinglecellrnasequencingdata
AT kimjunhyong accountingfortechnicalnoiseindifferentialexpressionanalysisofsinglecellrnasequencingdata
AT limingyao accountingfortechnicalnoiseindifferentialexpressionanalysisofsinglecellrnasequencingdata
AT zhangnancyr accountingfortechnicalnoiseindifferentialexpressionanalysisofsinglecellrnasequencingdata