Cargando…
Pooling across cells to normalize single-cell RNA sequencing data with many zero counts
Normalization of single-cell RNA sequencing data is necessary to eliminate cell-specific biases prior to downstream analyses. However, this is not straightforward for noisy single-cell data where many counts are zero. We present a novel approach where expression values are summed across pools of cel...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848819/ https://www.ncbi.nlm.nih.gov/pubmed/27122128 http://dx.doi.org/10.1186/s13059-016-0947-7 |
_version_ | 1782429424927375360 |
---|---|
author | L. Lun, Aaron T. Bach, Karsten Marioni, John C. |
author_facet | L. Lun, Aaron T. Bach, Karsten Marioni, John C. |
author_sort | L. Lun, Aaron T. |
collection | PubMed |
description | Normalization of single-cell RNA sequencing data is necessary to eliminate cell-specific biases prior to downstream analyses. However, this is not straightforward for noisy single-cell data where many counts are zero. We present a novel approach where expression values are summed across pools of cells, and the summed values are used for normalization. Pool-based size factors are then deconvolved to yield cell-based factors. Our deconvolution approach outperforms existing methods for accurate normalization of cell-specific biases in simulated data. Similar behavior is observed in real data, where deconvolution improves the relevance of results of downstream analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-016-0947-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4848819 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48488192016-04-29 Pooling across cells to normalize single-cell RNA sequencing data with many zero counts L. Lun, Aaron T. Bach, Karsten Marioni, John C. Genome Biol Method Normalization of single-cell RNA sequencing data is necessary to eliminate cell-specific biases prior to downstream analyses. However, this is not straightforward for noisy single-cell data where many counts are zero. We present a novel approach where expression values are summed across pools of cells, and the summed values are used for normalization. Pool-based size factors are then deconvolved to yield cell-based factors. Our deconvolution approach outperforms existing methods for accurate normalization of cell-specific biases in simulated data. Similar behavior is observed in real data, where deconvolution improves the relevance of results of downstream analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-016-0947-7) contains supplementary material, which is available to authorized users. BioMed Central 2016-04-27 /pmc/articles/PMC4848819/ /pubmed/27122128 http://dx.doi.org/10.1186/s13059-016-0947-7 Text en © Lun et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Method L. Lun, Aaron T. Bach, Karsten Marioni, John C. Pooling across cells to normalize single-cell RNA sequencing data with many zero counts |
title | Pooling across cells to normalize single-cell RNA sequencing data with many zero counts |
title_full | Pooling across cells to normalize single-cell RNA sequencing data with many zero counts |
title_fullStr | Pooling across cells to normalize single-cell RNA sequencing data with many zero counts |
title_full_unstemmed | Pooling across cells to normalize single-cell RNA sequencing data with many zero counts |
title_short | Pooling across cells to normalize single-cell RNA sequencing data with many zero counts |
title_sort | pooling across cells to normalize single-cell rna sequencing data with many zero counts |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848819/ https://www.ncbi.nlm.nih.gov/pubmed/27122128 http://dx.doi.org/10.1186/s13059-016-0947-7 |
work_keys_str_mv | AT llunaaront poolingacrosscellstonormalizesinglecellrnasequencingdatawithmanyzerocounts AT bachkarsten poolingacrosscellstonormalizesinglecellrnasequencingdatawithmanyzerocounts AT marionijohnc poolingacrosscellstonormalizesinglecellrnasequencingdatawithmanyzerocounts |