Cargando…
Impact of sequencing depth and read length on single cell RNA sequencing data of T cells
Single cell RNA sequencing (scRNA-seq) provides great potential in measuring the gene expression profiles of heterogeneous cell populations. In immunology, scRNA-seq allowed the characterisation of transcript sequence diversity of functionally relevant T cell subsets, and the identification of the f...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5630586/ https://www.ncbi.nlm.nih.gov/pubmed/28986563 http://dx.doi.org/10.1038/s41598-017-12989-x |
_version_ | 1783269247565168640 |
---|---|
author | Rizzetto, Simone Eltahla, Auda A. Lin, Peijie Bull, Rowena Lloyd, Andrew R. Ho, Joshua W. K. Venturi, Vanessa Luciani, Fabio |
author_facet | Rizzetto, Simone Eltahla, Auda A. Lin, Peijie Bull, Rowena Lloyd, Andrew R. Ho, Joshua W. K. Venturi, Vanessa Luciani, Fabio |
author_sort | Rizzetto, Simone |
collection | PubMed |
description | Single cell RNA sequencing (scRNA-seq) provides great potential in measuring the gene expression profiles of heterogeneous cell populations. In immunology, scRNA-seq allowed the characterisation of transcript sequence diversity of functionally relevant T cell subsets, and the identification of the full length T cell receptor (TCRαβ), which defines the specificity against cognate antigens. Several factors, e.g. RNA library capture, cell quality, and sequencing output affect the quality of scRNA-seq data. We studied the effects of read length and sequencing depth on the quality of gene expression profiles, cell type identification, and TCRαβ reconstruction, utilising 1,305 single cells from 8 publically available scRNA-seq datasets, and simulation-based analyses. Gene expression was characterised by an increased number of unique genes identified with short read lengths (<50 bp), but these featured higher technical variability compared to profiles from longer reads. Successful TCRαβ reconstruction was achieved for 6 datasets (81% − 100%) with at least 0.25 millions (PE) reads of length >50 bp, while it failed for datasets with <30 bp reads. Sufficient read length and sequencing depth can control technical noise to enable accurate identification of TCRαβ and gene expression profiles from scRNA-seq data of T cells. |
format | Online Article Text |
id | pubmed-5630586 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-56305862017-10-17 Impact of sequencing depth and read length on single cell RNA sequencing data of T cells Rizzetto, Simone Eltahla, Auda A. Lin, Peijie Bull, Rowena Lloyd, Andrew R. Ho, Joshua W. K. Venturi, Vanessa Luciani, Fabio Sci Rep Article Single cell RNA sequencing (scRNA-seq) provides great potential in measuring the gene expression profiles of heterogeneous cell populations. In immunology, scRNA-seq allowed the characterisation of transcript sequence diversity of functionally relevant T cell subsets, and the identification of the full length T cell receptor (TCRαβ), which defines the specificity against cognate antigens. Several factors, e.g. RNA library capture, cell quality, and sequencing output affect the quality of scRNA-seq data. We studied the effects of read length and sequencing depth on the quality of gene expression profiles, cell type identification, and TCRαβ reconstruction, utilising 1,305 single cells from 8 publically available scRNA-seq datasets, and simulation-based analyses. Gene expression was characterised by an increased number of unique genes identified with short read lengths (<50 bp), but these featured higher technical variability compared to profiles from longer reads. Successful TCRαβ reconstruction was achieved for 6 datasets (81% − 100%) with at least 0.25 millions (PE) reads of length >50 bp, while it failed for datasets with <30 bp reads. Sufficient read length and sequencing depth can control technical noise to enable accurate identification of TCRαβ and gene expression profiles from scRNA-seq data of T cells. Nature Publishing Group UK 2017-10-06 /pmc/articles/PMC5630586/ /pubmed/28986563 http://dx.doi.org/10.1038/s41598-017-12989-x Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Rizzetto, Simone Eltahla, Auda A. Lin, Peijie Bull, Rowena Lloyd, Andrew R. Ho, Joshua W. K. Venturi, Vanessa Luciani, Fabio Impact of sequencing depth and read length on single cell RNA sequencing data of T cells |
title | Impact of sequencing depth and read length on single cell RNA sequencing data of T cells |
title_full | Impact of sequencing depth and read length on single cell RNA sequencing data of T cells |
title_fullStr | Impact of sequencing depth and read length on single cell RNA sequencing data of T cells |
title_full_unstemmed | Impact of sequencing depth and read length on single cell RNA sequencing data of T cells |
title_short | Impact of sequencing depth and read length on single cell RNA sequencing data of T cells |
title_sort | impact of sequencing depth and read length on single cell rna sequencing data of t cells |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5630586/ https://www.ncbi.nlm.nih.gov/pubmed/28986563 http://dx.doi.org/10.1038/s41598-017-12989-x |
work_keys_str_mv | AT rizzettosimone impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells AT eltahlaaudaa impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells AT linpeijie impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells AT bullrowena impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells AT lloydandrewr impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells AT hojoshuawk impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells AT venturivanessa impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells AT lucianifabio impactofsequencingdepthandreadlengthonsinglecellrnasequencingdataoftcells |