Cargando…

Quantile normalization of single-cell RNA-seq read counts without unique molecular identifiers

Single-cell RNA-seq (scRNA-seq) profiles gene expression of individual cells. Unique molecular identifiers (UMIs) remove duplicates in read counts resulting from polymerase chain reaction, a major source of noise. For scRNA-seq data lacking UMIs, we propose quasi-UMIs: quantile normalization of read...

Descripción completa

Detalles Bibliográficos
Autores principales: Townes, F. William, Irizarry, Rafael A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7333325/
https://www.ncbi.nlm.nih.gov/pubmed/32620142
http://dx.doi.org/10.1186/s13059-020-02078-0
_version_ 1783553728533495808
author Townes, F. William
Irizarry, Rafael A.
author_facet Townes, F. William
Irizarry, Rafael A.
author_sort Townes, F. William
collection PubMed
description Single-cell RNA-seq (scRNA-seq) profiles gene expression of individual cells. Unique molecular identifiers (UMIs) remove duplicates in read counts resulting from polymerase chain reaction, a major source of noise. For scRNA-seq data lacking UMIs, we propose quasi-UMIs: quantile normalization of read counts to a compound Poisson distribution empirically derived from UMI datasets. When applied to ground-truth datasets having both reads and UMIs, quasi-UMI normalization has higher accuracy than competing methods. Using quasi-UMIs enables methods designed specifically for UMI data to be applied to non-UMI scRNA-seq datasets.
format Online
Article
Text
id pubmed-7333325
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73333252020-07-06 Quantile normalization of single-cell RNA-seq read counts without unique molecular identifiers Townes, F. William Irizarry, Rafael A. Genome Biol Method Single-cell RNA-seq (scRNA-seq) profiles gene expression of individual cells. Unique molecular identifiers (UMIs) remove duplicates in read counts resulting from polymerase chain reaction, a major source of noise. For scRNA-seq data lacking UMIs, we propose quasi-UMIs: quantile normalization of read counts to a compound Poisson distribution empirically derived from UMI datasets. When applied to ground-truth datasets having both reads and UMIs, quasi-UMI normalization has higher accuracy than competing methods. Using quasi-UMIs enables methods designed specifically for UMI data to be applied to non-UMI scRNA-seq datasets. BioMed Central 2020-07-03 /pmc/articles/PMC7333325/ /pubmed/32620142 http://dx.doi.org/10.1186/s13059-020-02078-0 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Method
Townes, F. William
Irizarry, Rafael A.
Quantile normalization of single-cell RNA-seq read counts without unique molecular identifiers
title Quantile normalization of single-cell RNA-seq read counts without unique molecular identifiers
title_full Quantile normalization of single-cell RNA-seq read counts without unique molecular identifiers
title_fullStr Quantile normalization of single-cell RNA-seq read counts without unique molecular identifiers
title_full_unstemmed Quantile normalization of single-cell RNA-seq read counts without unique molecular identifiers
title_short Quantile normalization of single-cell RNA-seq read counts without unique molecular identifiers
title_sort quantile normalization of single-cell rna-seq read counts without unique molecular identifiers
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7333325/
https://www.ncbi.nlm.nih.gov/pubmed/32620142
http://dx.doi.org/10.1186/s13059-020-02078-0
work_keys_str_mv AT townesfwilliam quantilenormalizationofsinglecellrnaseqreadcountswithoutuniquemolecularidentifiers
AT irizarryrafaela quantilenormalizationofsinglecellrnaseqreadcountswithoutuniquemolecularidentifiers