Cargando…

Accurate assembly of transcripts through phase-preserving graph decomposition

We introduce Scallop, an accurate reference-based transcript assembler that improves reconstruction of multi-exon and lowly expressed transcripts. Scallop preserves long-range phasing paths extracted from reads, while producing a parsimonious set of transcripts and minimizing coverage deviation. On...

Descripción completa

Detalles Bibliográficos
Autores principales: Shao, Mingfu, Kingsford, Carl
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5722698/
https://www.ncbi.nlm.nih.gov/pubmed/29131147
http://dx.doi.org/10.1038/nbt.4020
_version_ 1783285064744828928
author Shao, Mingfu
Kingsford, Carl
author_facet Shao, Mingfu
Kingsford, Carl
author_sort Shao, Mingfu
collection PubMed
description We introduce Scallop, an accurate reference-based transcript assembler that improves reconstruction of multi-exon and lowly expressed transcripts. Scallop preserves long-range phasing paths extracted from reads, while producing a parsimonious set of transcripts and minimizing coverage deviation. On 10 human RNA-seq samples, Scallop produces 34.5% and 36.3% more correct multi-exon transcripts than StringTie and TransComb, and respectively identifies 67.5% and 52.3% more lowly expressed transcripts. Scallop achieves higher sensitivity and precision than previous approaches over a wide range of coverage thresholds.
format Online
Article
Text
id pubmed-5722698
institution National Center for Biotechnology Information
language English
publishDate 2017
record_format MEDLINE/PubMed
spelling pubmed-57226982018-05-13 Accurate assembly of transcripts through phase-preserving graph decomposition Shao, Mingfu Kingsford, Carl Nat Biotechnol Article We introduce Scallop, an accurate reference-based transcript assembler that improves reconstruction of multi-exon and lowly expressed transcripts. Scallop preserves long-range phasing paths extracted from reads, while producing a parsimonious set of transcripts and minimizing coverage deviation. On 10 human RNA-seq samples, Scallop produces 34.5% and 36.3% more correct multi-exon transcripts than StringTie and TransComb, and respectively identifies 67.5% and 52.3% more lowly expressed transcripts. Scallop achieves higher sensitivity and precision than previous approaches over a wide range of coverage thresholds. 2017-11-13 2017-12 /pmc/articles/PMC5722698/ /pubmed/29131147 http://dx.doi.org/10.1038/nbt.4020 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Shao, Mingfu
Kingsford, Carl
Accurate assembly of transcripts through phase-preserving graph decomposition
title Accurate assembly of transcripts through phase-preserving graph decomposition
title_full Accurate assembly of transcripts through phase-preserving graph decomposition
title_fullStr Accurate assembly of transcripts through phase-preserving graph decomposition
title_full_unstemmed Accurate assembly of transcripts through phase-preserving graph decomposition
title_short Accurate assembly of transcripts through phase-preserving graph decomposition
title_sort accurate assembly of transcripts through phase-preserving graph decomposition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5722698/
https://www.ncbi.nlm.nih.gov/pubmed/29131147
http://dx.doi.org/10.1038/nbt.4020
work_keys_str_mv AT shaomingfu accurateassemblyoftranscriptsthroughphasepreservinggraphdecomposition
AT kingsfordcarl accurateassemblyoftranscriptsthroughphasepreservinggraphdecomposition