Cargando…

Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data

Transposable elements (TEs) are an integral part of the host transcriptome. TE-containing noncoding RNAs (ncRNAs) show considerable tissue specificity and play important roles during development, including stem cell maintenance and cell differentiation. Recent advances in single-cell RNA-seq (scRNA-...

Descripción completa

Detalles Bibliográficos
Autores principales: Shao, Wanqing, Wang, Ting
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7849386/
https://www.ncbi.nlm.nih.gov/pubmed/33355230
http://dx.doi.org/10.1101/gr.265173.120
_version_ 1783645291214274560
author Shao, Wanqing
Wang, Ting
author_facet Shao, Wanqing
Wang, Ting
author_sort Shao, Wanqing
collection PubMed
description Transposable elements (TEs) are an integral part of the host transcriptome. TE-containing noncoding RNAs (ncRNAs) show considerable tissue specificity and play important roles during development, including stem cell maintenance and cell differentiation. Recent advances in single-cell RNA-seq (scRNA-seq) revolutionized cell type–specific gene expression analysis. However, effective scRNA-seq quantification tools tailored for TEs are lacking, limiting our ability to dissect TE expression dynamics at single-cell resolution. To address this issue, we established a TE expression quantification pipeline that is compatible with scRNA-seq data generated across multiple technology platforms. We constructed TE-containing ncRNA references using bulk RNA-seq data and showed that quantifying TE expression at the transcript level effectively reduces noise. As proof of principle, we applied this strategy to mouse embryonic stem cells and successfully captured the expression profile of endogenous retroviruses in single cells. We further expanded our analysis to scRNA-seq data from early stages of mouse embryogenesis. Our results illustrated the dynamic TE expression at preimplantation stages and revealed 146 TE-containing ncRNA transcripts with substantial tissue specificity during gastrulation and early organogenesis.
format Online
Article
Text
id pubmed-7849386
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-78493862021-07-01 Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data Shao, Wanqing Wang, Ting Genome Res Method Transposable elements (TEs) are an integral part of the host transcriptome. TE-containing noncoding RNAs (ncRNAs) show considerable tissue specificity and play important roles during development, including stem cell maintenance and cell differentiation. Recent advances in single-cell RNA-seq (scRNA-seq) revolutionized cell type–specific gene expression analysis. However, effective scRNA-seq quantification tools tailored for TEs are lacking, limiting our ability to dissect TE expression dynamics at single-cell resolution. To address this issue, we established a TE expression quantification pipeline that is compatible with scRNA-seq data generated across multiple technology platforms. We constructed TE-containing ncRNA references using bulk RNA-seq data and showed that quantifying TE expression at the transcript level effectively reduces noise. As proof of principle, we applied this strategy to mouse embryonic stem cells and successfully captured the expression profile of endogenous retroviruses in single cells. We further expanded our analysis to scRNA-seq data from early stages of mouse embryogenesis. Our results illustrated the dynamic TE expression at preimplantation stages and revealed 146 TE-containing ncRNA transcripts with substantial tissue specificity during gastrulation and early organogenesis. Cold Spring Harbor Laboratory Press 2021-01 /pmc/articles/PMC7849386/ /pubmed/33355230 http://dx.doi.org/10.1101/gr.265173.120 Text en © 2021 Shao and Wang; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Shao, Wanqing
Wang, Ting
Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data
title Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data
title_full Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data
title_fullStr Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data
title_full_unstemmed Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data
title_short Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data
title_sort transcript assembly improves expression quantification of transposable elements in single-cell rna-seq data
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7849386/
https://www.ncbi.nlm.nih.gov/pubmed/33355230
http://dx.doi.org/10.1101/gr.265173.120
work_keys_str_mv AT shaowanqing transcriptassemblyimprovesexpressionquantificationoftransposableelementsinsinglecellrnaseqdata
AT wangting transcriptassemblyimprovesexpressionquantificationoftransposableelementsinsinglecellrnaseqdata