Cargando…

Obstacles to detecting isoforms using full-length scRNA-seq data

BACKGROUND: Early single-cell RNA-seq (scRNA-seq) studies suggested that it was unusual to see more than one isoform being produced from a gene in a single cell, even when multiple isoforms were detected in matched bulk RNA-seq samples. However, these studies generally did not consider the impact of...

Descripción completa

Detalles Bibliográficos
Autores principales: Westoby, Jennifer, Artemov, Pavel, Hemberg, Martin, Ferguson-Smith, Anne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7087381/
https://www.ncbi.nlm.nih.gov/pubmed/32293520
http://dx.doi.org/10.1186/s13059-020-01981-w
_version_ 1783509327140618240
author Westoby, Jennifer
Artemov, Pavel
Hemberg, Martin
Ferguson-Smith, Anne
author_facet Westoby, Jennifer
Artemov, Pavel
Hemberg, Martin
Ferguson-Smith, Anne
author_sort Westoby, Jennifer
collection PubMed
description BACKGROUND: Early single-cell RNA-seq (scRNA-seq) studies suggested that it was unusual to see more than one isoform being produced from a gene in a single cell, even when multiple isoforms were detected in matched bulk RNA-seq samples. However, these studies generally did not consider the impact of dropouts or isoform quantification errors, potentially confounding the results of these analyses. RESULTS: In this study, we take a simulation based approach in which we explicitly account for dropouts and isoform quantification errors. We use our simulations to ask to what extent it is possible to study alternative splicing using scRNA-seq. Additionally, we ask what limitations must be overcome to make splicing analysis feasible. We find that the high rate of dropouts associated with scRNA-seq is a major obstacle to studying alternative splicing. In mice and other well-established model organisms, the relatively low rate of isoform quantification errors poses a lesser obstacle to splicing analysis. We find that different models of isoform choice meaningfully change our simulation results. CONCLUSIONS: To accurately study alternative splicing with single-cell RNA-seq, a better understanding of isoform choice and the errors associated with scRNA-seq is required. An increase in the capture efficiency of scRNA-seq would also be beneficial. Until some or all of the above are achieved, we do not recommend attempting to resolve isoforms in individual cells using scRNA-seq.
format Online
Article
Text
id pubmed-7087381
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70873812020-03-24 Obstacles to detecting isoforms using full-length scRNA-seq data Westoby, Jennifer Artemov, Pavel Hemberg, Martin Ferguson-Smith, Anne Genome Biol Research BACKGROUND: Early single-cell RNA-seq (scRNA-seq) studies suggested that it was unusual to see more than one isoform being produced from a gene in a single cell, even when multiple isoforms were detected in matched bulk RNA-seq samples. However, these studies generally did not consider the impact of dropouts or isoform quantification errors, potentially confounding the results of these analyses. RESULTS: In this study, we take a simulation based approach in which we explicitly account for dropouts and isoform quantification errors. We use our simulations to ask to what extent it is possible to study alternative splicing using scRNA-seq. Additionally, we ask what limitations must be overcome to make splicing analysis feasible. We find that the high rate of dropouts associated with scRNA-seq is a major obstacle to studying alternative splicing. In mice and other well-established model organisms, the relatively low rate of isoform quantification errors poses a lesser obstacle to splicing analysis. We find that different models of isoform choice meaningfully change our simulation results. CONCLUSIONS: To accurately study alternative splicing with single-cell RNA-seq, a better understanding of isoform choice and the errors associated with scRNA-seq is required. An increase in the capture efficiency of scRNA-seq would also be beneficial. Until some or all of the above are achieved, we do not recommend attempting to resolve isoforms in individual cells using scRNA-seq. BioMed Central 2020-03-23 /pmc/articles/PMC7087381/ /pubmed/32293520 http://dx.doi.org/10.1186/s13059-020-01981-w Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Westoby, Jennifer
Artemov, Pavel
Hemberg, Martin
Ferguson-Smith, Anne
Obstacles to detecting isoforms using full-length scRNA-seq data
title Obstacles to detecting isoforms using full-length scRNA-seq data
title_full Obstacles to detecting isoforms using full-length scRNA-seq data
title_fullStr Obstacles to detecting isoforms using full-length scRNA-seq data
title_full_unstemmed Obstacles to detecting isoforms using full-length scRNA-seq data
title_short Obstacles to detecting isoforms using full-length scRNA-seq data
title_sort obstacles to detecting isoforms using full-length scrna-seq data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7087381/
https://www.ncbi.nlm.nih.gov/pubmed/32293520
http://dx.doi.org/10.1186/s13059-020-01981-w
work_keys_str_mv AT westobyjennifer obstaclestodetectingisoformsusingfulllengthscrnaseqdata
AT artemovpavel obstaclestodetectingisoformsusingfulllengthscrnaseqdata
AT hembergmartin obstaclestodetectingisoformsusingfulllengthscrnaseqdata
AT fergusonsmithanne obstaclestodetectingisoformsusingfulllengthscrnaseqdata