Cargando…

Methodologies for Transcript Profiling Using Long-Read Technologies

RNA sequencing using next-generation sequencing technologies (NGS) is currently the standard approach for gene expression profiling, particularly for large-scale high-throughput studies. NGS technologies comprise high throughput, cost efficient short-read RNA-Seq, while emerging single molecule, lon...

Descripción completa

Detalles Bibliográficos
Autores principales: Oikonomopoulos, Spyros, Bayega, Anthony, Fahiminiya, Somayyeh, Djambazian, Haig, Berube, Pierre, Ragoussis, Jiannis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7358353/
https://www.ncbi.nlm.nih.gov/pubmed/32733532
http://dx.doi.org/10.3389/fgene.2020.00606
_version_ 1783558835275825152
author Oikonomopoulos, Spyros
Bayega, Anthony
Fahiminiya, Somayyeh
Djambazian, Haig
Berube, Pierre
Ragoussis, Jiannis
author_facet Oikonomopoulos, Spyros
Bayega, Anthony
Fahiminiya, Somayyeh
Djambazian, Haig
Berube, Pierre
Ragoussis, Jiannis
author_sort Oikonomopoulos, Spyros
collection PubMed
description RNA sequencing using next-generation sequencing technologies (NGS) is currently the standard approach for gene expression profiling, particularly for large-scale high-throughput studies. NGS technologies comprise high throughput, cost efficient short-read RNA-Seq, while emerging single molecule, long-read RNA-Seq technologies have enabled new approaches to study the transcriptome and its function. The emerging single molecule, long-read technologies are currently commercially available by Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT), while new methodologies based on short-read sequencing approaches are also being developed in order to provide long range single molecule level information—for example, the ones represented by the 10x Genomics linked read methodology. The shift toward long-read sequencing technologies for transcriptome characterization is based on current increases in throughput and decreases in cost, making these attractive for de novo transcriptome assembly, isoform expression quantification, and in-depth RNA species analysis. These types of analyses were challenging with standard short sequencing approaches, due to the complex nature of the transcriptome, which consists of variable lengths of transcripts and multiple alternatively spliced isoforms for most genes, as well as the high sequence similarity of highly abundant species of RNA, such as rRNAs. Here we aim to focus on single molecule level sequencing technologies and single-cell technologies that, combined with perturbation tools, allow the analysis of complete RNA species, whether short or long, at high resolution. In parallel, these tools have opened new ways in understanding gene functions at the tissue, network, and pathway levels, as well as their detailed functional characterization. Analysis of the epi-transcriptome, including RNA methylation and modification and the effects of such modifications on biological systems is now enabled through direct RNA sequencing instead of classical indirect approaches. However, many difficulties and challenges remain, such as methodologies to generate full-length RNA or cDNA libraries from all different species of RNAs, not only poly-A containing transcripts, and the identification of allele-specific transcripts due to current error rates of single molecule technologies, while the bioinformatics analysis on long-read data for accurate identification of 5′ and 3′ UTRs is still in development.
format Online
Article
Text
id pubmed-7358353
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-73583532020-07-29 Methodologies for Transcript Profiling Using Long-Read Technologies Oikonomopoulos, Spyros Bayega, Anthony Fahiminiya, Somayyeh Djambazian, Haig Berube, Pierre Ragoussis, Jiannis Front Genet Genetics RNA sequencing using next-generation sequencing technologies (NGS) is currently the standard approach for gene expression profiling, particularly for large-scale high-throughput studies. NGS technologies comprise high throughput, cost efficient short-read RNA-Seq, while emerging single molecule, long-read RNA-Seq technologies have enabled new approaches to study the transcriptome and its function. The emerging single molecule, long-read technologies are currently commercially available by Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT), while new methodologies based on short-read sequencing approaches are also being developed in order to provide long range single molecule level information—for example, the ones represented by the 10x Genomics linked read methodology. The shift toward long-read sequencing technologies for transcriptome characterization is based on current increases in throughput and decreases in cost, making these attractive for de novo transcriptome assembly, isoform expression quantification, and in-depth RNA species analysis. These types of analyses were challenging with standard short sequencing approaches, due to the complex nature of the transcriptome, which consists of variable lengths of transcripts and multiple alternatively spliced isoforms for most genes, as well as the high sequence similarity of highly abundant species of RNA, such as rRNAs. Here we aim to focus on single molecule level sequencing technologies and single-cell technologies that, combined with perturbation tools, allow the analysis of complete RNA species, whether short or long, at high resolution. In parallel, these tools have opened new ways in understanding gene functions at the tissue, network, and pathway levels, as well as their detailed functional characterization. Analysis of the epi-transcriptome, including RNA methylation and modification and the effects of such modifications on biological systems is now enabled through direct RNA sequencing instead of classical indirect approaches. However, many difficulties and challenges remain, such as methodologies to generate full-length RNA or cDNA libraries from all different species of RNAs, not only poly-A containing transcripts, and the identification of allele-specific transcripts due to current error rates of single molecule technologies, while the bioinformatics analysis on long-read data for accurate identification of 5′ and 3′ UTRs is still in development. Frontiers Media S.A. 2020-07-07 /pmc/articles/PMC7358353/ /pubmed/32733532 http://dx.doi.org/10.3389/fgene.2020.00606 Text en Copyright © 2020 Oikonomopoulos, Bayega, Fahiminiya, Djambazian, Berube and Ragoussis. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Oikonomopoulos, Spyros
Bayega, Anthony
Fahiminiya, Somayyeh
Djambazian, Haig
Berube, Pierre
Ragoussis, Jiannis
Methodologies for Transcript Profiling Using Long-Read Technologies
title Methodologies for Transcript Profiling Using Long-Read Technologies
title_full Methodologies for Transcript Profiling Using Long-Read Technologies
title_fullStr Methodologies for Transcript Profiling Using Long-Read Technologies
title_full_unstemmed Methodologies for Transcript Profiling Using Long-Read Technologies
title_short Methodologies for Transcript Profiling Using Long-Read Technologies
title_sort methodologies for transcript profiling using long-read technologies
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7358353/
https://www.ncbi.nlm.nih.gov/pubmed/32733532
http://dx.doi.org/10.3389/fgene.2020.00606
work_keys_str_mv AT oikonomopoulosspyros methodologiesfortranscriptprofilingusinglongreadtechnologies
AT bayegaanthony methodologiesfortranscriptprofilingusinglongreadtechnologies
AT fahiminiyasomayyeh methodologiesfortranscriptprofilingusinglongreadtechnologies
AT djambazianhaig methodologiesfortranscriptprofilingusinglongreadtechnologies
AT berubepierre methodologiesfortranscriptprofilingusinglongreadtechnologies
AT ragoussisjiannis methodologiesfortranscriptprofilingusinglongreadtechnologies