Cargando…

BOAssembler: A Bayesian Optimization Framework to Improve RNA-Seq Assembly Performance

High throughput sequencing of RNA (RNA-Seq) can provide us with millions of short fragments of RNA transcripts from a sample. How to better recover the original RNA transcripts from those fragments (RNA-Seq assembly) is still a difficult task. For example, RNA-Seq assembly tools typically require hy...

Descripción completa

Detalles Bibliográficos
Autores principales: Mao, Shunfu, Jiang, Yihan, Mathew, Edwin Basil, Kannan, Sreeram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7197064/
http://dx.doi.org/10.1007/978-3-030-42266-0_15
_version_ 1783528810075914240
author Mao, Shunfu
Jiang, Yihan
Mathew, Edwin Basil
Kannan, Sreeram
author_facet Mao, Shunfu
Jiang, Yihan
Mathew, Edwin Basil
Kannan, Sreeram
author_sort Mao, Shunfu
collection PubMed
description High throughput sequencing of RNA (RNA-Seq) can provide us with millions of short fragments of RNA transcripts from a sample. How to better recover the original RNA transcripts from those fragments (RNA-Seq assembly) is still a difficult task. For example, RNA-Seq assembly tools typically require hyper-parameter tuning to achieve good performance for particular datasets. This kind of tuning is usually unintuitive and time-consuming. Consequently, users often resort to default parameters, which do not guarantee consistent good performance for various datasets. Results: Here we propose BOAssembler, a framework that enables end-to-end automatic tuning of RNA-Seq assemblers, based on Bayesian Optimization principles. Experiments show this data-driven approach is effective to improve the overall assembly performance. The approach would be helpful for downstream (e.g. gene, protein, cell) analysis, and more broadly, for future bioinformatics benchmark studies. Availability: https://github.com/shunfumao/boassembler.
format Online
Article
Text
id pubmed-7197064
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-71970642020-05-04 BOAssembler: A Bayesian Optimization Framework to Improve RNA-Seq Assembly Performance Mao, Shunfu Jiang, Yihan Mathew, Edwin Basil Kannan, Sreeram Algorithms for Computational Biology Article High throughput sequencing of RNA (RNA-Seq) can provide us with millions of short fragments of RNA transcripts from a sample. How to better recover the original RNA transcripts from those fragments (RNA-Seq assembly) is still a difficult task. For example, RNA-Seq assembly tools typically require hyper-parameter tuning to achieve good performance for particular datasets. This kind of tuning is usually unintuitive and time-consuming. Consequently, users often resort to default parameters, which do not guarantee consistent good performance for various datasets. Results: Here we propose BOAssembler, a framework that enables end-to-end automatic tuning of RNA-Seq assemblers, based on Bayesian Optimization principles. Experiments show this data-driven approach is effective to improve the overall assembly performance. The approach would be helpful for downstream (e.g. gene, protein, cell) analysis, and more broadly, for future bioinformatics benchmark studies. Availability: https://github.com/shunfumao/boassembler. 2020-02-01 /pmc/articles/PMC7197064/ http://dx.doi.org/10.1007/978-3-030-42266-0_15 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Mao, Shunfu
Jiang, Yihan
Mathew, Edwin Basil
Kannan, Sreeram
BOAssembler: A Bayesian Optimization Framework to Improve RNA-Seq Assembly Performance
title BOAssembler: A Bayesian Optimization Framework to Improve RNA-Seq Assembly Performance
title_full BOAssembler: A Bayesian Optimization Framework to Improve RNA-Seq Assembly Performance
title_fullStr BOAssembler: A Bayesian Optimization Framework to Improve RNA-Seq Assembly Performance
title_full_unstemmed BOAssembler: A Bayesian Optimization Framework to Improve RNA-Seq Assembly Performance
title_short BOAssembler: A Bayesian Optimization Framework to Improve RNA-Seq Assembly Performance
title_sort boassembler: a bayesian optimization framework to improve rna-seq assembly performance
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7197064/
http://dx.doi.org/10.1007/978-3-030-42266-0_15
work_keys_str_mv AT maoshunfu boassemblerabayesianoptimizationframeworktoimprovernaseqassemblyperformance
AT jiangyihan boassemblerabayesianoptimizationframeworktoimprovernaseqassemblyperformance
AT mathewedwinbasil boassemblerabayesianoptimizationframeworktoimprovernaseqassemblyperformance
AT kannansreeram boassemblerabayesianoptimizationframeworktoimprovernaseqassemblyperformance