Cargando…

RefShannon: A genome-guided transcriptome assembler using sparse flow decomposition

High throughput sequencing of RNA (RNA-Seq) has become a staple in modern molecular biology, with applications not only in quantifying gene expression but also in isoform-level analysis of the RNA transcripts. To enable such an isoform-level analysis, a transcriptome assembly algorithm is utilized t...

Descripción completa

Detalles Bibliográficos
Autores principales: Mao, Shunfu, Pachter, Lior, Tse, David, Kannan, Sreeram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7266320/
https://www.ncbi.nlm.nih.gov/pubmed/32484809
http://dx.doi.org/10.1371/journal.pone.0232946
_version_ 1783541285186961408
author Mao, Shunfu
Pachter, Lior
Tse, David
Kannan, Sreeram
author_facet Mao, Shunfu
Pachter, Lior
Tse, David
Kannan, Sreeram
author_sort Mao, Shunfu
collection PubMed
description High throughput sequencing of RNA (RNA-Seq) has become a staple in modern molecular biology, with applications not only in quantifying gene expression but also in isoform-level analysis of the RNA transcripts. To enable such an isoform-level analysis, a transcriptome assembly algorithm is utilized to stitch together the observed short reads into the corresponding transcripts. This task is complicated due to the complexity of alternative splicing - a mechanism by which the same gene may generate multiple distinct RNA transcripts. We develop a novel genome-guided transcriptome assembler, RefShannon, that exploits the varying abundances of the different transcripts, in enabling an accurate reconstruction of the transcripts. Our evaluation shows RefShannon is able to improve sensitivity effectively (up to 22%) at a given specificity in comparison with other state-of-the-art assemblers. RefShannon is written in Python and is available from Github (https://github.com/shunfumao/RefShannon).
format Online
Article
Text
id pubmed-7266320
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-72663202020-06-10 RefShannon: A genome-guided transcriptome assembler using sparse flow decomposition Mao, Shunfu Pachter, Lior Tse, David Kannan, Sreeram PLoS One Research Article High throughput sequencing of RNA (RNA-Seq) has become a staple in modern molecular biology, with applications not only in quantifying gene expression but also in isoform-level analysis of the RNA transcripts. To enable such an isoform-level analysis, a transcriptome assembly algorithm is utilized to stitch together the observed short reads into the corresponding transcripts. This task is complicated due to the complexity of alternative splicing - a mechanism by which the same gene may generate multiple distinct RNA transcripts. We develop a novel genome-guided transcriptome assembler, RefShannon, that exploits the varying abundances of the different transcripts, in enabling an accurate reconstruction of the transcripts. Our evaluation shows RefShannon is able to improve sensitivity effectively (up to 22%) at a given specificity in comparison with other state-of-the-art assemblers. RefShannon is written in Python and is available from Github (https://github.com/shunfumao/RefShannon). Public Library of Science 2020-06-02 /pmc/articles/PMC7266320/ /pubmed/32484809 http://dx.doi.org/10.1371/journal.pone.0232946 Text en © 2020 Mao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mao, Shunfu
Pachter, Lior
Tse, David
Kannan, Sreeram
RefShannon: A genome-guided transcriptome assembler using sparse flow decomposition
title RefShannon: A genome-guided transcriptome assembler using sparse flow decomposition
title_full RefShannon: A genome-guided transcriptome assembler using sparse flow decomposition
title_fullStr RefShannon: A genome-guided transcriptome assembler using sparse flow decomposition
title_full_unstemmed RefShannon: A genome-guided transcriptome assembler using sparse flow decomposition
title_short RefShannon: A genome-guided transcriptome assembler using sparse flow decomposition
title_sort refshannon: a genome-guided transcriptome assembler using sparse flow decomposition
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7266320/
https://www.ncbi.nlm.nih.gov/pubmed/32484809
http://dx.doi.org/10.1371/journal.pone.0232946
work_keys_str_mv AT maoshunfu refshannonagenomeguidedtranscriptomeassemblerusingsparseflowdecomposition
AT pachterlior refshannonagenomeguidedtranscriptomeassemblerusingsparseflowdecomposition
AT tsedavid refshannonagenomeguidedtranscriptomeassemblerusingsparseflowdecomposition
AT kannansreeram refshannonagenomeguidedtranscriptomeassemblerusingsparseflowdecomposition