Cargando…
Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data
Recent technological breakthroughs have made it possible to measure RNA expression at the single-cell level, thus paving the way for exploring expression heterogeneity among individual cells. Current single-cell RNA sequencing (scRNA-seq) protocols are complex and introduce technical biases that var...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737676/ https://www.ncbi.nlm.nih.gov/pubmed/29036714 http://dx.doi.org/10.1093/nar/gkx754 |
_version_ | 1783287565877510144 |
---|---|
author | Jia, Cheng Hu, Yu Kelly, Derek Kim, Junhyong Li, Mingyao Zhang, Nancy R. |
author_facet | Jia, Cheng Hu, Yu Kelly, Derek Kim, Junhyong Li, Mingyao Zhang, Nancy R. |
author_sort | Jia, Cheng |
collection | PubMed |
description | Recent technological breakthroughs have made it possible to measure RNA expression at the single-cell level, thus paving the way for exploring expression heterogeneity among individual cells. Current single-cell RNA sequencing (scRNA-seq) protocols are complex and introduce technical biases that vary across cells, which can bias downstream analysis without proper adjustment. To account for cell-to-cell technical differences, we propose a statistical framework, TASC (Toolkit for Analysis of Single Cell RNA-seq), an empirical Bayes approach to reliably model the cell-specific dropout rates and amplification bias by use of external RNA spike-ins. TASC incorporates the technical parameters, which reflect cell-to-cell batch effects, into a hierarchical mixture model to estimate the biological variance of a gene and detect differentially expressed genes. More importantly, TASC is able to adjust for covariates to further eliminate confounding that may originate from cell size and cell cycle differences. In simulation and real scRNA-seq data, TASC achieves accurate Type I error control and displays competitive sensitivity and improved robustness to batch effects in differential expression analysis, compared to existing methods. TASC is programmed to be computationally efficient, taking advantage of multi-threaded parallelization. We believe that TASC will provide a robust platform for researchers to leverage the power of scRNA-seq. |
format | Online Article Text |
id | pubmed-5737676 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-57376762018-01-04 Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data Jia, Cheng Hu, Yu Kelly, Derek Kim, Junhyong Li, Mingyao Zhang, Nancy R. Nucleic Acids Res Computational Biology Recent technological breakthroughs have made it possible to measure RNA expression at the single-cell level, thus paving the way for exploring expression heterogeneity among individual cells. Current single-cell RNA sequencing (scRNA-seq) protocols are complex and introduce technical biases that vary across cells, which can bias downstream analysis without proper adjustment. To account for cell-to-cell technical differences, we propose a statistical framework, TASC (Toolkit for Analysis of Single Cell RNA-seq), an empirical Bayes approach to reliably model the cell-specific dropout rates and amplification bias by use of external RNA spike-ins. TASC incorporates the technical parameters, which reflect cell-to-cell batch effects, into a hierarchical mixture model to estimate the biological variance of a gene and detect differentially expressed genes. More importantly, TASC is able to adjust for covariates to further eliminate confounding that may originate from cell size and cell cycle differences. In simulation and real scRNA-seq data, TASC achieves accurate Type I error control and displays competitive sensitivity and improved robustness to batch effects in differential expression analysis, compared to existing methods. TASC is programmed to be computationally efficient, taking advantage of multi-threaded parallelization. We believe that TASC will provide a robust platform for researchers to leverage the power of scRNA-seq. Oxford University Press 2017-11-02 2017-09-25 /pmc/articles/PMC5737676/ /pubmed/29036714 http://dx.doi.org/10.1093/nar/gkx754 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Jia, Cheng Hu, Yu Kelly, Derek Kim, Junhyong Li, Mingyao Zhang, Nancy R. Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data |
title | Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data |
title_full | Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data |
title_fullStr | Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data |
title_full_unstemmed | Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data |
title_short | Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data |
title_sort | accounting for technical noise in differential expression analysis of single-cell rna sequencing data |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737676/ https://www.ncbi.nlm.nih.gov/pubmed/29036714 http://dx.doi.org/10.1093/nar/gkx754 |
work_keys_str_mv | AT jiacheng accountingfortechnicalnoiseindifferentialexpressionanalysisofsinglecellrnasequencingdata AT huyu accountingfortechnicalnoiseindifferentialexpressionanalysisofsinglecellrnasequencingdata AT kellyderek accountingfortechnicalnoiseindifferentialexpressionanalysisofsinglecellrnasequencingdata AT kimjunhyong accountingfortechnicalnoiseindifferentialexpressionanalysisofsinglecellrnasequencingdata AT limingyao accountingfortechnicalnoiseindifferentialexpressionanalysisofsinglecellrnasequencingdata AT zhangnancyr accountingfortechnicalnoiseindifferentialexpressionanalysisofsinglecellrnasequencingdata |