Cargando…

Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data

An increasing number of studies are using single-cell RNA-sequencing (scRNA-seq) to characterize the gene expression profiles of individual cells. One common analysis applied to scRNA-seq data involves detecting differentially expressed (DE) genes between cells in different biological groups. Howeve...

Descripción completa

Detalles Bibliográficos
Autores principales: Lun, Aaron T. L., Marioni, John C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5862359/
https://www.ncbi.nlm.nih.gov/pubmed/28334062
http://dx.doi.org/10.1093/biostatistics/kxw055
_version_ 1783308212330561536
author Lun, Aaron T. L.
Marioni, John C.
author_facet Lun, Aaron T. L.
Marioni, John C.
author_sort Lun, Aaron T. L.
collection PubMed
description An increasing number of studies are using single-cell RNA-sequencing (scRNA-seq) to characterize the gene expression profiles of individual cells. One common analysis applied to scRNA-seq data involves detecting differentially expressed (DE) genes between cells in different biological groups. However, many experiments are designed such that the cells to be compared are processed in separate plates or chips, meaning that the groupings are confounded with systematic plate effects. This confounding aspect is frequently ignored in DE analyses of scRNA-seq data. In this article, we demonstrate that failing to consider plate effects in the statistical model results in loss of type I error control. A solution is proposed whereby counts are summed from all cells in each plate and the count sums for all plates are used in the DE analysis. This restores type I error control in the presence of plate effects without compromising detection power in simulated data. Summation is also robust to varying numbers and library sizes of cells on each plate. Similar results are observed in DE analyses of real data where the use of count sums instead of single-cell counts improves specificity and the ranking of relevant genes. This suggests that summation can assist in maintaining statistical rigour in DE analyses of scRNA-seq data with plate effects.
format Online
Article
Text
id pubmed-5862359
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58623592018-03-29 Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data Lun, Aaron T. L. Marioni, John C. Biostatistics Articles An increasing number of studies are using single-cell RNA-sequencing (scRNA-seq) to characterize the gene expression profiles of individual cells. One common analysis applied to scRNA-seq data involves detecting differentially expressed (DE) genes between cells in different biological groups. However, many experiments are designed such that the cells to be compared are processed in separate plates or chips, meaning that the groupings are confounded with systematic plate effects. This confounding aspect is frequently ignored in DE analyses of scRNA-seq data. In this article, we demonstrate that failing to consider plate effects in the statistical model results in loss of type I error control. A solution is proposed whereby counts are summed from all cells in each plate and the count sums for all plates are used in the DE analysis. This restores type I error control in the presence of plate effects without compromising detection power in simulated data. Summation is also robust to varying numbers and library sizes of cells on each plate. Similar results are observed in DE analyses of real data where the use of count sums instead of single-cell counts improves specificity and the ranking of relevant genes. This suggests that summation can assist in maintaining statistical rigour in DE analyses of scRNA-seq data with plate effects. Oxford University Press 2017-07 2017-02-06 /pmc/articles/PMC5862359/ /pubmed/28334062 http://dx.doi.org/10.1093/biostatistics/kxw055 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Lun, Aaron T. L.
Marioni, John C.
Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data
title Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data
title_full Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data
title_fullStr Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data
title_full_unstemmed Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data
title_short Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data
title_sort overcoming confounding plate effects in differential expression analyses of single-cell rna-seq data
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5862359/
https://www.ncbi.nlm.nih.gov/pubmed/28334062
http://dx.doi.org/10.1093/biostatistics/kxw055
work_keys_str_mv AT lunaarontl overcomingconfoundingplateeffectsindifferentialexpressionanalysesofsinglecellrnaseqdata
AT marionijohnc overcomingconfoundingplateeffectsindifferentialexpressionanalysesofsinglecellrnaseqdata