Cargando…
Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads
Single-molecule long-read sequencing has been used to improve mRNA isoform identification. However, not all single-molecule long reads represent full transcripts due to incomplete cDNA synthesis and sequencing length limits. This drives a need for long-read transcript assembly. By adding long-read-s...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6918626/ https://www.ncbi.nlm.nih.gov/pubmed/31849338 http://dx.doi.org/10.1186/s13059-019-1883-0 |
_version_ | 1783480629270151168 |
---|---|
author | Tung, Laura H. Shao, Mingfu Kingsford, Carl |
author_facet | Tung, Laura H. Shao, Mingfu Kingsford, Carl |
author_sort | Tung, Laura H. |
collection | PubMed |
description | Single-molecule long-read sequencing has been used to improve mRNA isoform identification. However, not all single-molecule long reads represent full transcripts due to incomplete cDNA synthesis and sequencing length limits. This drives a need for long-read transcript assembly. By adding long-read-specific optimizations to Scallop, we developed Scallop-LR, a reference-based long-read transcript assembler. Analyzing 26 PacBio samples, we quantified the benefit of performing transcript assembly on long reads. We demonstrate Scallop-LR identifies more known transcripts and potentially novel isoforms for the human transcriptome than Iso-Seq Analysis and StringTie, indicating that long-read transcript assembly by Scallop-LR can reveal a more complete human transcriptome. |
format | Online Article Text |
id | pubmed-6918626 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69186262019-12-20 Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads Tung, Laura H. Shao, Mingfu Kingsford, Carl Genome Biol Method Single-molecule long-read sequencing has been used to improve mRNA isoform identification. However, not all single-molecule long reads represent full transcripts due to incomplete cDNA synthesis and sequencing length limits. This drives a need for long-read transcript assembly. By adding long-read-specific optimizations to Scallop, we developed Scallop-LR, a reference-based long-read transcript assembler. Analyzing 26 PacBio samples, we quantified the benefit of performing transcript assembly on long reads. We demonstrate Scallop-LR identifies more known transcripts and potentially novel isoforms for the human transcriptome than Iso-Seq Analysis and StringTie, indicating that long-read transcript assembly by Scallop-LR can reveal a more complete human transcriptome. BioMed Central 2019-12-18 /pmc/articles/PMC6918626/ /pubmed/31849338 http://dx.doi.org/10.1186/s13059-019-1883-0 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Method Tung, Laura H. Shao, Mingfu Kingsford, Carl Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads |
title | Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads |
title_full | Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads |
title_fullStr | Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads |
title_full_unstemmed | Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads |
title_short | Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads |
title_sort | quantifying the benefit offered by transcript assembly with scallop-lr on single-molecule long reads |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6918626/ https://www.ncbi.nlm.nih.gov/pubmed/31849338 http://dx.doi.org/10.1186/s13059-019-1883-0 |
work_keys_str_mv | AT tunglaurah quantifyingthebenefitofferedbytranscriptassemblywithscalloplronsinglemoleculelongreads AT shaomingfu quantifyingthebenefitofferedbytranscriptassemblywithscalloplronsinglemoleculelongreads AT kingsfordcarl quantifyingthebenefitofferedbytranscriptassemblywithscalloplronsinglemoleculelongreads |