Cargando…

BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data

High-throughput RNA-seq technology has provided an unprecedented opportunity to reveal the very complex structures of transcriptomes. However, it is an important and highly challenging task to assemble vast amounts of short RNA-seq reads into transcriptomes with alternative splicing isoforms. In thi...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Juntao, Li, Guojun, Chang, Zheng, Yu, Ting, Liu, Bingqiang, McMullen, Rick, Chen, Pengyin, Huang, Xiuzhen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4760927/
https://www.ncbi.nlm.nih.gov/pubmed/26894997
http://dx.doi.org/10.1371/journal.pcbi.1004772
_version_ 1782416908673351680
author Liu, Juntao
Li, Guojun
Chang, Zheng
Yu, Ting
Liu, Bingqiang
McMullen, Rick
Chen, Pengyin
Huang, Xiuzhen
author_facet Liu, Juntao
Li, Guojun
Chang, Zheng
Yu, Ting
Liu, Bingqiang
McMullen, Rick
Chen, Pengyin
Huang, Xiuzhen
author_sort Liu, Juntao
collection PubMed
description High-throughput RNA-seq technology has provided an unprecedented opportunity to reveal the very complex structures of transcriptomes. However, it is an important and highly challenging task to assemble vast amounts of short RNA-seq reads into transcriptomes with alternative splicing isoforms. In this study, we present a novel de novo assembler, BinPacker, by modeling the transcriptome assembly problem as tracking a set of trajectories of items with their sizes representing coverage of their corresponding isoforms by solving a series of bin-packing problems. This approach, which subtly integrates coverage information into the procedure, has two exclusive features: 1) only splicing junctions are involved in the assembling procedure; 2) massive pell-mell reads are assembled seemingly by moving a comb along junction edges on a splicing graph. Being tested on both real and simulated RNA-seq datasets, it outperforms almost all the existing de novo assemblers on all the tested datasets, and even outperforms those ab initio assemblers on the real dog dataset. In addition, it runs substantially faster and requires less memory space than most of the assemblers. BinPacker is published under GNU GENERAL PUBLIC LICENSE and the source is available from: http://sourceforge.net/projects/transcriptomeassembly/files/BinPacker_1.0.tar.gz/download. Quick installation version is available from: http://sourceforge.net/projects/transcriptomeassembly/files/BinPacker_binary.tar.gz/download.
format Online
Article
Text
id pubmed-4760927
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-47609272016-03-07 BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data Liu, Juntao Li, Guojun Chang, Zheng Yu, Ting Liu, Bingqiang McMullen, Rick Chen, Pengyin Huang, Xiuzhen PLoS Comput Biol Research Article High-throughput RNA-seq technology has provided an unprecedented opportunity to reveal the very complex structures of transcriptomes. However, it is an important and highly challenging task to assemble vast amounts of short RNA-seq reads into transcriptomes with alternative splicing isoforms. In this study, we present a novel de novo assembler, BinPacker, by modeling the transcriptome assembly problem as tracking a set of trajectories of items with their sizes representing coverage of their corresponding isoforms by solving a series of bin-packing problems. This approach, which subtly integrates coverage information into the procedure, has two exclusive features: 1) only splicing junctions are involved in the assembling procedure; 2) massive pell-mell reads are assembled seemingly by moving a comb along junction edges on a splicing graph. Being tested on both real and simulated RNA-seq datasets, it outperforms almost all the existing de novo assemblers on all the tested datasets, and even outperforms those ab initio assemblers on the real dog dataset. In addition, it runs substantially faster and requires less memory space than most of the assemblers. BinPacker is published under GNU GENERAL PUBLIC LICENSE and the source is available from: http://sourceforge.net/projects/transcriptomeassembly/files/BinPacker_1.0.tar.gz/download. Quick installation version is available from: http://sourceforge.net/projects/transcriptomeassembly/files/BinPacker_binary.tar.gz/download. Public Library of Science 2016-02-19 /pmc/articles/PMC4760927/ /pubmed/26894997 http://dx.doi.org/10.1371/journal.pcbi.1004772 Text en © 2016 Liu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Liu, Juntao
Li, Guojun
Chang, Zheng
Yu, Ting
Liu, Bingqiang
McMullen, Rick
Chen, Pengyin
Huang, Xiuzhen
BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data
title BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data
title_full BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data
title_fullStr BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data
title_full_unstemmed BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data
title_short BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data
title_sort binpacker: packing-based de novo transcriptome assembly from rna-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4760927/
https://www.ncbi.nlm.nih.gov/pubmed/26894997
http://dx.doi.org/10.1371/journal.pcbi.1004772
work_keys_str_mv AT liujuntao binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata
AT liguojun binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata
AT changzheng binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata
AT yuting binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata
AT liubingqiang binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata
AT mcmullenrick binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata
AT chenpengyin binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata
AT huangxiuzhen binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata