Cargando…

TIF-Seq2 disentangles overlapping isoforms in complex human transcriptomes

Eukaryotic transcriptomes are complex, involving thousands of overlapping transcripts. The interleaved nature of the transcriptomes limits our ability to identify regulatory regions, and in some cases can lead to misinterpretation of gene expression. To improve the understanding of the overlapping t...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jingwen, Li, Bingnan, Marques, Sueli, Steinmetz, Lars M, Wei, Wu, Pelechano, Vicent
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7544212/
https://www.ncbi.nlm.nih.gov/pubmed/32816037
http://dx.doi.org/10.1093/nar/gkaa691
_version_ 1783591813709299712
author Wang, Jingwen
Li, Bingnan
Marques, Sueli
Steinmetz, Lars M
Wei, Wu
Pelechano, Vicent
author_facet Wang, Jingwen
Li, Bingnan
Marques, Sueli
Steinmetz, Lars M
Wei, Wu
Pelechano, Vicent
author_sort Wang, Jingwen
collection PubMed
description Eukaryotic transcriptomes are complex, involving thousands of overlapping transcripts. The interleaved nature of the transcriptomes limits our ability to identify regulatory regions, and in some cases can lead to misinterpretation of gene expression. To improve the understanding of the overlapping transcriptomes, we have developed an optimized method, TIF-Seq2, able to sequence simultaneously the 5′ and 3′ ends of individual RNA molecules at single-nucleotide resolution. We investigated the transcriptome of a well characterized human cell line (K562) and identified thousands of unannotated transcript isoforms. By focusing on transcripts which are challenging to be investigated with RNA-Seq, we accurately defined boundaries of lowly expressed unannotated and read-through transcripts putatively encoding fusion genes. We validated our results by targeted long-read sequencing and standard RNA-Seq for chronic myeloid leukaemia patient samples. Taking the advantage of TIF-Seq2, we explored transcription regulation among overlapping units and investigated their crosstalk. We show that most overlapping upstream transcripts use poly(A) sites within the first 2 kb of the downstream transcription units. Our work shows that, by paring the 5′ and 3′ end of each RNA, TIF-Seq2 can improve the annotation of complex genomes, facilitate accurate assignment of promoters to genes and easily identify transcriptionally fused genes.
format Online
Article
Text
id pubmed-7544212
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-75442122020-10-15 TIF-Seq2 disentangles overlapping isoforms in complex human transcriptomes Wang, Jingwen Li, Bingnan Marques, Sueli Steinmetz, Lars M Wei, Wu Pelechano, Vicent Nucleic Acids Res Methods Online Eukaryotic transcriptomes are complex, involving thousands of overlapping transcripts. The interleaved nature of the transcriptomes limits our ability to identify regulatory regions, and in some cases can lead to misinterpretation of gene expression. To improve the understanding of the overlapping transcriptomes, we have developed an optimized method, TIF-Seq2, able to sequence simultaneously the 5′ and 3′ ends of individual RNA molecules at single-nucleotide resolution. We investigated the transcriptome of a well characterized human cell line (K562) and identified thousands of unannotated transcript isoforms. By focusing on transcripts which are challenging to be investigated with RNA-Seq, we accurately defined boundaries of lowly expressed unannotated and read-through transcripts putatively encoding fusion genes. We validated our results by targeted long-read sequencing and standard RNA-Seq for chronic myeloid leukaemia patient samples. Taking the advantage of TIF-Seq2, we explored transcription regulation among overlapping units and investigated their crosstalk. We show that most overlapping upstream transcripts use poly(A) sites within the first 2 kb of the downstream transcription units. Our work shows that, by paring the 5′ and 3′ end of each RNA, TIF-Seq2 can improve the annotation of complex genomes, facilitate accurate assignment of promoters to genes and easily identify transcriptionally fused genes. Oxford University Press 2020-08-20 /pmc/articles/PMC7544212/ /pubmed/32816037 http://dx.doi.org/10.1093/nar/gkaa691 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Wang, Jingwen
Li, Bingnan
Marques, Sueli
Steinmetz, Lars M
Wei, Wu
Pelechano, Vicent
TIF-Seq2 disentangles overlapping isoforms in complex human transcriptomes
title TIF-Seq2 disentangles overlapping isoforms in complex human transcriptomes
title_full TIF-Seq2 disentangles overlapping isoforms in complex human transcriptomes
title_fullStr TIF-Seq2 disentangles overlapping isoforms in complex human transcriptomes
title_full_unstemmed TIF-Seq2 disentangles overlapping isoforms in complex human transcriptomes
title_short TIF-Seq2 disentangles overlapping isoforms in complex human transcriptomes
title_sort tif-seq2 disentangles overlapping isoforms in complex human transcriptomes
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7544212/
https://www.ncbi.nlm.nih.gov/pubmed/32816037
http://dx.doi.org/10.1093/nar/gkaa691
work_keys_str_mv AT wangjingwen tifseq2disentanglesoverlappingisoformsincomplexhumantranscriptomes
AT libingnan tifseq2disentanglesoverlappingisoformsincomplexhumantranscriptomes
AT marquessueli tifseq2disentanglesoverlappingisoformsincomplexhumantranscriptomes
AT steinmetzlarsm tifseq2disentanglesoverlappingisoformsincomplexhumantranscriptomes
AT weiwu tifseq2disentanglesoverlappingisoformsincomplexhumantranscriptomes
AT pelechanovicent tifseq2disentanglesoverlappingisoformsincomplexhumantranscriptomes