Cargando…

Targeted transcriptome analysis using synthetic long read sequencing uncovers isoform reprograming in the progression of colon cancer

The characterization of human gene expression is limited by short read lengths, high error rates and large input requirements. Here, we used a synthetic long read (SLR) sequencing approach, LoopSeq, to generate accurate sequencing reads that span full length transcripts using standard short read dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Silvia, Wu, Indira, Yu, Yan-Ping, Balamotis, Michael, Ren, Baoguo, Ben Yehezkel, Tuval, Luo, Jian-Hua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8079361/
https://www.ncbi.nlm.nih.gov/pubmed/33907296
http://dx.doi.org/10.1038/s42003-021-02024-1
_version_ 1783685212163538944
author Liu, Silvia
Wu, Indira
Yu, Yan-Ping
Balamotis, Michael
Ren, Baoguo
Ben Yehezkel, Tuval
Luo, Jian-Hua
author_facet Liu, Silvia
Wu, Indira
Yu, Yan-Ping
Balamotis, Michael
Ren, Baoguo
Ben Yehezkel, Tuval
Luo, Jian-Hua
author_sort Liu, Silvia
collection PubMed
description The characterization of human gene expression is limited by short read lengths, high error rates and large input requirements. Here, we used a synthetic long read (SLR) sequencing approach, LoopSeq, to generate accurate sequencing reads that span full length transcripts using standard short read data. LoopSeq identified isoforms from control samples with 99.4% accuracy and a 0.01% per-base error rate, exceeding the accuracy reported for other long-read technologies. Applied to targeted transcriptome sequencing from colon cancers and their metastatic counterparts, LoopSeq revealed large scale isoform redistributions from benign colon mucosa to primary colon cancer and metastatic cancer and identified several previously unknown fusion isoforms. Strikingly, single nucleotide variants (SNVs) occurred dominantly in specific isoforms and some SNVs underwent isoform switching in cancer progression. The ability to use short reads to generate accurate long-read data as the raw unit of information holds promise as a widely accessible approach in transcriptome sequencing.
format Online
Article
Text
id pubmed-8079361
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-80793612021-05-05 Targeted transcriptome analysis using synthetic long read sequencing uncovers isoform reprograming in the progression of colon cancer Liu, Silvia Wu, Indira Yu, Yan-Ping Balamotis, Michael Ren, Baoguo Ben Yehezkel, Tuval Luo, Jian-Hua Commun Biol Article The characterization of human gene expression is limited by short read lengths, high error rates and large input requirements. Here, we used a synthetic long read (SLR) sequencing approach, LoopSeq, to generate accurate sequencing reads that span full length transcripts using standard short read data. LoopSeq identified isoforms from control samples with 99.4% accuracy and a 0.01% per-base error rate, exceeding the accuracy reported for other long-read technologies. Applied to targeted transcriptome sequencing from colon cancers and their metastatic counterparts, LoopSeq revealed large scale isoform redistributions from benign colon mucosa to primary colon cancer and metastatic cancer and identified several previously unknown fusion isoforms. Strikingly, single nucleotide variants (SNVs) occurred dominantly in specific isoforms and some SNVs underwent isoform switching in cancer progression. The ability to use short reads to generate accurate long-read data as the raw unit of information holds promise as a widely accessible approach in transcriptome sequencing. Nature Publishing Group UK 2021-04-27 /pmc/articles/PMC8079361/ /pubmed/33907296 http://dx.doi.org/10.1038/s42003-021-02024-1 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Liu, Silvia
Wu, Indira
Yu, Yan-Ping
Balamotis, Michael
Ren, Baoguo
Ben Yehezkel, Tuval
Luo, Jian-Hua
Targeted transcriptome analysis using synthetic long read sequencing uncovers isoform reprograming in the progression of colon cancer
title Targeted transcriptome analysis using synthetic long read sequencing uncovers isoform reprograming in the progression of colon cancer
title_full Targeted transcriptome analysis using synthetic long read sequencing uncovers isoform reprograming in the progression of colon cancer
title_fullStr Targeted transcriptome analysis using synthetic long read sequencing uncovers isoform reprograming in the progression of colon cancer
title_full_unstemmed Targeted transcriptome analysis using synthetic long read sequencing uncovers isoform reprograming in the progression of colon cancer
title_short Targeted transcriptome analysis using synthetic long read sequencing uncovers isoform reprograming in the progression of colon cancer
title_sort targeted transcriptome analysis using synthetic long read sequencing uncovers isoform reprograming in the progression of colon cancer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8079361/
https://www.ncbi.nlm.nih.gov/pubmed/33907296
http://dx.doi.org/10.1038/s42003-021-02024-1
work_keys_str_mv AT liusilvia targetedtranscriptomeanalysisusingsyntheticlongreadsequencinguncoversisoformreprogramingintheprogressionofcoloncancer
AT wuindira targetedtranscriptomeanalysisusingsyntheticlongreadsequencinguncoversisoformreprogramingintheprogressionofcoloncancer
AT yuyanping targetedtranscriptomeanalysisusingsyntheticlongreadsequencinguncoversisoformreprogramingintheprogressionofcoloncancer
AT balamotismichael targetedtranscriptomeanalysisusingsyntheticlongreadsequencinguncoversisoformreprogramingintheprogressionofcoloncancer
AT renbaoguo targetedtranscriptomeanalysisusingsyntheticlongreadsequencinguncoversisoformreprogramingintheprogressionofcoloncancer
AT benyehezkeltuval targetedtranscriptomeanalysisusingsyntheticlongreadsequencinguncoversisoformreprogramingintheprogressionofcoloncancer
AT luojianhua targetedtranscriptomeanalysisusingsyntheticlongreadsequencinguncoversisoformreprogramingintheprogressionofcoloncancer