Cargando…

Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads

Single-molecule long-read sequencing has been used to improve mRNA isoform identification. However, not all single-molecule long reads represent full transcripts due to incomplete cDNA synthesis and sequencing length limits. This drives a need for long-read transcript assembly. By adding long-read-s...

Descripción completa

Detalles Bibliográficos
Autores principales: Tung, Laura H., Shao, Mingfu, Kingsford, Carl
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6918626/
https://www.ncbi.nlm.nih.gov/pubmed/31849338
http://dx.doi.org/10.1186/s13059-019-1883-0
_version_ 1783480629270151168
author Tung, Laura H.
Shao, Mingfu
Kingsford, Carl
author_facet Tung, Laura H.
Shao, Mingfu
Kingsford, Carl
author_sort Tung, Laura H.
collection PubMed
description Single-molecule long-read sequencing has been used to improve mRNA isoform identification. However, not all single-molecule long reads represent full transcripts due to incomplete cDNA synthesis and sequencing length limits. This drives a need for long-read transcript assembly. By adding long-read-specific optimizations to Scallop, we developed Scallop-LR, a reference-based long-read transcript assembler. Analyzing 26 PacBio samples, we quantified the benefit of performing transcript assembly on long reads. We demonstrate Scallop-LR identifies more known transcripts and potentially novel isoforms for the human transcriptome than Iso-Seq Analysis and StringTie, indicating that long-read transcript assembly by Scallop-LR can reveal a more complete human transcriptome.
format Online
Article
Text
id pubmed-6918626
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69186262019-12-20 Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads Tung, Laura H. Shao, Mingfu Kingsford, Carl Genome Biol Method Single-molecule long-read sequencing has been used to improve mRNA isoform identification. However, not all single-molecule long reads represent full transcripts due to incomplete cDNA synthesis and sequencing length limits. This drives a need for long-read transcript assembly. By adding long-read-specific optimizations to Scallop, we developed Scallop-LR, a reference-based long-read transcript assembler. Analyzing 26 PacBio samples, we quantified the benefit of performing transcript assembly on long reads. We demonstrate Scallop-LR identifies more known transcripts and potentially novel isoforms for the human transcriptome than Iso-Seq Analysis and StringTie, indicating that long-read transcript assembly by Scallop-LR can reveal a more complete human transcriptome. BioMed Central 2019-12-18 /pmc/articles/PMC6918626/ /pubmed/31849338 http://dx.doi.org/10.1186/s13059-019-1883-0 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Method
Tung, Laura H.
Shao, Mingfu
Kingsford, Carl
Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads
title Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads
title_full Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads
title_fullStr Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads
title_full_unstemmed Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads
title_short Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads
title_sort quantifying the benefit offered by transcript assembly with scallop-lr on single-molecule long reads
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6918626/
https://www.ncbi.nlm.nih.gov/pubmed/31849338
http://dx.doi.org/10.1186/s13059-019-1883-0
work_keys_str_mv AT tunglaurah quantifyingthebenefitofferedbytranscriptassemblywithscalloplronsinglemoleculelongreads
AT shaomingfu quantifyingthebenefitofferedbytranscriptassemblywithscalloplronsinglemoleculelongreads
AT kingsfordcarl quantifyingthebenefitofferedbytranscriptassemblywithscalloplronsinglemoleculelongreads