Cargando…

Impact of sequencing depth and read length on single cell RNA sequencing data of T cells

Single cell RNA sequencing (scRNA-seq) provides great potential in measuring the gene expression profiles of heterogeneous cell populations. In immunology, scRNA-seq allowed the characterisation of transcript sequence diversity of functionally relevant T cell subsets, and the identification of the f...

Descripción completa

Detalles Bibliográficos
Autores principales: Rizzetto, Simone, Eltahla, Auda A., Lin, Peijie, Bull, Rowena, Lloyd, Andrew R., Ho, Joshua W. K., Venturi, Vanessa, Luciani, Fabio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5630586/
https://www.ncbi.nlm.nih.gov/pubmed/28986563
http://dx.doi.org/10.1038/s41598-017-12989-x
_version_ 1783269247565168640
author Rizzetto, Simone
Eltahla, Auda A.
Lin, Peijie
Bull, Rowena
Lloyd, Andrew R.
Ho, Joshua W. K.
Venturi, Vanessa
Luciani, Fabio
author_facet Rizzetto, Simone
Eltahla, Auda A.
Lin, Peijie
Bull, Rowena
Lloyd, Andrew R.
Ho, Joshua W. K.
Venturi, Vanessa
Luciani, Fabio
author_sort Rizzetto, Simone
collection PubMed
description Single cell RNA sequencing (scRNA-seq) provides great potential in measuring the gene expression profiles of heterogeneous cell populations. In immunology, scRNA-seq allowed the characterisation of transcript sequence diversity of functionally relevant T cell subsets, and the identification of the full length T cell receptor (TCRαβ), which defines the specificity against cognate antigens. Several factors, e.g. RNA library capture, cell quality, and sequencing output affect the quality of scRNA-seq data. We studied the effects of read length and sequencing depth on the quality of gene expression profiles, cell type identification, and TCRαβ reconstruction, utilising 1,305 single cells from 8 publically available scRNA-seq datasets, and simulation-based analyses. Gene expression was characterised by an increased number of unique genes identified with short read lengths (<50 bp), but these featured higher technical variability compared to profiles from longer reads. Successful TCRαβ reconstruction was achieved for 6 datasets (81% − 100%) with at least 0.25 millions (PE) reads of length >50 bp, while it failed for datasets with <30 bp reads. Sufficient read length and sequencing depth can control technical noise to enable accurate identification of TCRαβ and gene expression profiles from scRNA-seq data of T cells.
format Online
Article
Text
id pubmed-5630586
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-56305862017-10-17 Impact of sequencing depth and read length on single cell RNA sequencing data of T cells Rizzetto, Simone Eltahla, Auda A. Lin, Peijie Bull, Rowena Lloyd, Andrew R. Ho, Joshua W. K. Venturi, Vanessa Luciani, Fabio Sci Rep Article Single cell RNA sequencing (scRNA-seq) provides great potential in measuring the gene expression profiles of heterogeneous cell populations. In immunology, scRNA-seq allowed the characterisation of transcript sequence diversity of functionally relevant T cell subsets, and the identification of the full length T cell receptor (TCRαβ), which defines the specificity against cognate antigens. Several factors, e.g. RNA library capture, cell quality, and sequencing output affect the quality of scRNA-seq data. We studied the effects of read length and sequencing depth on the quality of gene expression profiles, cell type identification, and TCRαβ reconstruction, utilising 1,305 single cells from 8 publically available scRNA-seq datasets, and simulation-based analyses. Gene expression was characterised by an increased number of unique genes identified with short read lengths (<50 bp), but these featured higher technical variability compared to profiles from longer reads. Successful TCRαβ reconstruction was achieved for 6 datasets (81% − 100%) with at least 0.25 millions (PE) reads of length >50 bp, while it failed for datasets with <30 bp reads. Sufficient read length and sequencing depth can control technical noise to enable accurate identification of TCRαβ and gene expression profiles from scRNA-seq data of T cells. Nature Publishing Group UK 2017-10-06 /pmc/articles/PMC5630586/ /pubmed/28986563 http://dx.doi.org/10.1038/s41598-017-12989-x Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Rizzetto, Simone
Eltahla, Auda A.
Lin, Peijie
Bull, Rowena
Lloyd, Andrew R.
Ho, Joshua W. K.
Venturi, Vanessa
Luciani, Fabio
Impact of sequencing depth and read length on single cell RNA sequencing data of T cells
title Impact of sequencing depth and read length on single cell RNA sequencing data of T cells
title_full Impact of sequencing depth and read length on single cell RNA sequencing data of T cells
title_fullStr Impact of sequencing depth and read length on single cell RNA sequencing data of T cells
title_full_unstemmed Impact of sequencing depth and read length on single cell RNA sequencing data of T cells
title_short Impact of sequencing depth and read length on single cell RNA sequencing data of T cells
title_sort impact of sequencing depth and read length on single cell rna sequencing data of t cells
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5630586/
https://www.ncbi.nlm.nih.gov/pubmed/28986563
http://dx.doi.org/10.1038/s41598-017-12989-x
work_keys_str_mv AT rizzettosimone impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells
AT eltahlaaudaa impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells
AT linpeijie impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells
AT bullrowena impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells
AT lloydandrewr impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells
AT hojoshuawk impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells
AT venturivanessa impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells
AT lucianifabio impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells