Cargando…

Direct full-length RNA sequencing reveals unexpected transcriptome complexity during Caenorhabditis elegans development

Massively parallel sequencing of the polyadenylated RNAs has played a key role in delineating transcriptome complexity, including alternative use of an exon, promoter, 5′ or 3′ splice site or polyadenylation site, and RNA modification. However, reads derived from the current RNA-seq technologies are...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Runsheng, Ren, Xiaoliang, Ding, Qiutao, Bi, Yu, Xie, Dongying, Zhao, Zhongying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7050527/
https://www.ncbi.nlm.nih.gov/pubmed/32024662
http://dx.doi.org/10.1101/gr.251512.119
_version_ 1783502626864758784
author Li, Runsheng
Ren, Xiaoliang
Ding, Qiutao
Bi, Yu
Xie, Dongying
Zhao, Zhongying
author_facet Li, Runsheng
Ren, Xiaoliang
Ding, Qiutao
Bi, Yu
Xie, Dongying
Zhao, Zhongying
author_sort Li, Runsheng
collection PubMed
description Massively parallel sequencing of the polyadenylated RNAs has played a key role in delineating transcriptome complexity, including alternative use of an exon, promoter, 5′ or 3′ splice site or polyadenylation site, and RNA modification. However, reads derived from the current RNA-seq technologies are usually short and deprived of information on modification, compromising their potential in defining transcriptome complexity. Here, we applied a direct RNA sequencing method with ultralong reads using Oxford Nanopore Technologies to study the transcriptome complexity in Caenorhabditis elegans. We generated approximately six million reads using native poly(A)-tailed mRNAs from three developmental stages, with average read lengths ranging from 900 to 1100 nt. Around half of the reads represent full-length transcripts. To utilize the full-length transcripts in defining transcriptome complexity, we devised a method to classify the long reads as the same as existing transcripts or as a novel transcript using sequence mapping tracks rather than existing intron/exon structures, which allowed us to identify roughly 57,000 novel isoforms and recover at least 26,000 out of the 33,500 existing isoforms. The sets of genes with differential expression versus differential isoform usage over development are largely different, implying a fine-tuned regulation at isoform level. We also observed an unexpected increase in putative RNA modification in all bases in the coding region relative to the UTR, suggesting their possible roles in translation. The RNA reads and the method for read classification are expected to deliver new insights into RNA processing and modification and their underlying biology in the future.
format Online
Article
Text
id pubmed-7050527
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-70505272020-08-01 Direct full-length RNA sequencing reveals unexpected transcriptome complexity during Caenorhabditis elegans development Li, Runsheng Ren, Xiaoliang Ding, Qiutao Bi, Yu Xie, Dongying Zhao, Zhongying Genome Res Resource Massively parallel sequencing of the polyadenylated RNAs has played a key role in delineating transcriptome complexity, including alternative use of an exon, promoter, 5′ or 3′ splice site or polyadenylation site, and RNA modification. However, reads derived from the current RNA-seq technologies are usually short and deprived of information on modification, compromising their potential in defining transcriptome complexity. Here, we applied a direct RNA sequencing method with ultralong reads using Oxford Nanopore Technologies to study the transcriptome complexity in Caenorhabditis elegans. We generated approximately six million reads using native poly(A)-tailed mRNAs from three developmental stages, with average read lengths ranging from 900 to 1100 nt. Around half of the reads represent full-length transcripts. To utilize the full-length transcripts in defining transcriptome complexity, we devised a method to classify the long reads as the same as existing transcripts or as a novel transcript using sequence mapping tracks rather than existing intron/exon structures, which allowed us to identify roughly 57,000 novel isoforms and recover at least 26,000 out of the 33,500 existing isoforms. The sets of genes with differential expression versus differential isoform usage over development are largely different, implying a fine-tuned regulation at isoform level. We also observed an unexpected increase in putative RNA modification in all bases in the coding region relative to the UTR, suggesting their possible roles in translation. The RNA reads and the method for read classification are expected to deliver new insights into RNA processing and modification and their underlying biology in the future. Cold Spring Harbor Laboratory Press 2020-02 /pmc/articles/PMC7050527/ /pubmed/32024662 http://dx.doi.org/10.1101/gr.251512.119 Text en © 2020 Li et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Resource
Li, Runsheng
Ren, Xiaoliang
Ding, Qiutao
Bi, Yu
Xie, Dongying
Zhao, Zhongying
Direct full-length RNA sequencing reveals unexpected transcriptome complexity during Caenorhabditis elegans development
title Direct full-length RNA sequencing reveals unexpected transcriptome complexity during Caenorhabditis elegans development
title_full Direct full-length RNA sequencing reveals unexpected transcriptome complexity during Caenorhabditis elegans development
title_fullStr Direct full-length RNA sequencing reveals unexpected transcriptome complexity during Caenorhabditis elegans development
title_full_unstemmed Direct full-length RNA sequencing reveals unexpected transcriptome complexity during Caenorhabditis elegans development
title_short Direct full-length RNA sequencing reveals unexpected transcriptome complexity during Caenorhabditis elegans development
title_sort direct full-length rna sequencing reveals unexpected transcriptome complexity during caenorhabditis elegans development
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7050527/
https://www.ncbi.nlm.nih.gov/pubmed/32024662
http://dx.doi.org/10.1101/gr.251512.119
work_keys_str_mv AT lirunsheng directfulllengthrnasequencingrevealsunexpectedtranscriptomecomplexityduringcaenorhabditiselegansdevelopment
AT renxiaoliang directfulllengthrnasequencingrevealsunexpectedtranscriptomecomplexityduringcaenorhabditiselegansdevelopment
AT dingqiutao directfulllengthrnasequencingrevealsunexpectedtranscriptomecomplexityduringcaenorhabditiselegansdevelopment
AT biyu directfulllengthrnasequencingrevealsunexpectedtranscriptomecomplexityduringcaenorhabditiselegansdevelopment
AT xiedongying directfulllengthrnasequencingrevealsunexpectedtranscriptomecomplexityduringcaenorhabditiselegansdevelopment
AT zhaozhongying directfulllengthrnasequencingrevealsunexpectedtranscriptomecomplexityduringcaenorhabditiselegansdevelopment