Cargando…

zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs

BACKGROUND: Single-cell RNA-sequencing (scRNA-seq) experiments typically analyze hundreds or thousands of cells after amplification of the cDNA. The high throughput is made possible by the early introduction of sample-specific bar codes (BCs), and the amplification bias is alleviated by unique molec...

Descripción completa

Detalles Bibliográficos
Autores principales:	Parekh, Swati, Ziegenhain, Christoph, Vieth, Beate, Enard, Wolfgang, Hellmann, Ines
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2018
Materias:	Technical Note
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6007394/ https://www.ncbi.nlm.nih.gov/pubmed/29846586 http://dx.doi.org/10.1093/gigascience/giy059

_version_	1783333028904304640
author	Parekh, Swati Ziegenhain, Christoph Vieth, Beate Enard, Wolfgang Hellmann, Ines
author_facet	Parekh, Swati Ziegenhain, Christoph Vieth, Beate Enard, Wolfgang Hellmann, Ines
author_sort	Parekh, Swati
collection	PubMed
description	BACKGROUND: Single-cell RNA-sequencing (scRNA-seq) experiments typically analyze hundreds or thousands of cells after amplification of the cDNA. The high throughput is made possible by the early introduction of sample-specific bar codes (BCs), and the amplification bias is alleviated by unique molecular identifiers (UMIs). Thus, the ideal analysis pipeline for scRNA-seq data needs to efficiently tabulate reads according to both BC and UMI. FINDINGS: zUMIs is a pipeline that can handle both known and random BCs and also efficiently collapse UMIs, either just for exon mapping reads or for both exon and intron mapping reads. If BC annotation is missing, zUMIs can accurately detect intact cells from the distribution of sequencing reads. Another unique feature of zUMIs is the adaptive downsampling function that facilitates dealing with hugely varying library sizes but also allows the user to evaluate whether the library has been sequenced to saturation. To illustrate the utility of zUMIs, we analyzed a single-nucleus RNA-seq dataset and show that more than 35% of all reads map to introns. Also, we show that these intronic reads are informative about expression levels, significantly increasing the number of detected genes and improving the cluster resolution. CONCLUSIONS: zUMIs flexibility makes if possible to accommodate data generated with any of the major scRNA-seq protocols that use BCs and UMIs and is the most feature-rich, fast, and user-friendly pipeline to process such scRNA-seq data.
format	Online Article Text
id	pubmed-6007394
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-60073942018-07-05 zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs Parekh, Swati Ziegenhain, Christoph Vieth, Beate Enard, Wolfgang Hellmann, Ines Gigascience Technical Note BACKGROUND: Single-cell RNA-sequencing (scRNA-seq) experiments typically analyze hundreds or thousands of cells after amplification of the cDNA. The high throughput is made possible by the early introduction of sample-specific bar codes (BCs), and the amplification bias is alleviated by unique molecular identifiers (UMIs). Thus, the ideal analysis pipeline for scRNA-seq data needs to efficiently tabulate reads according to both BC and UMI. FINDINGS: zUMIs is a pipeline that can handle both known and random BCs and also efficiently collapse UMIs, either just for exon mapping reads or for both exon and intron mapping reads. If BC annotation is missing, zUMIs can accurately detect intact cells from the distribution of sequencing reads. Another unique feature of zUMIs is the adaptive downsampling function that facilitates dealing with hugely varying library sizes but also allows the user to evaluate whether the library has been sequenced to saturation. To illustrate the utility of zUMIs, we analyzed a single-nucleus RNA-seq dataset and show that more than 35% of all reads map to introns. Also, we show that these intronic reads are informative about expression levels, significantly increasing the number of detected genes and improving the cluster resolution. CONCLUSIONS: zUMIs flexibility makes if possible to accommodate data generated with any of the major scRNA-seq protocols that use BCs and UMIs and is the most feature-rich, fast, and user-friendly pipeline to process such scRNA-seq data. Oxford University Press 2018-05-26 /pmc/articles/PMC6007394/ /pubmed/29846586 http://dx.doi.org/10.1093/gigascience/giy059 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Technical Note Parekh, Swati Ziegenhain, Christoph Vieth, Beate Enard, Wolfgang Hellmann, Ines zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs
title	zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs
title_full	zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs
title_fullStr	zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs
title_full_unstemmed	zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs
title_short	zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs
title_sort	zumis - a fast and flexible pipeline to process rna sequencing data with umis
topic	Technical Note
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6007394/ https://www.ncbi.nlm.nih.gov/pubmed/29846586 http://dx.doi.org/10.1093/gigascience/giy059
work_keys_str_mv	AT parekhswati zumisafastandflexiblepipelinetoprocessrnasequencingdatawithumis AT ziegenhainchristoph zumisafastandflexiblepipelinetoprocessrnasequencingdatawithumis AT viethbeate zumisafastandflexiblepipelinetoprocessrnasequencingdatawithumis AT enardwolfgang zumisafastandflexiblepipelinetoprocessrnasequencingdatawithumis AT hellmannines zumisafastandflexiblepipelinetoprocessrnasequencingdatawithumis

zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs

Ejemplares similares