Cargando…
BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data
High-throughput RNA-seq technology has provided an unprecedented opportunity to reveal the very complex structures of transcriptomes. However, it is an important and highly challenging task to assemble vast amounts of short RNA-seq reads into transcriptomes with alternative splicing isoforms. In thi...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4760927/ https://www.ncbi.nlm.nih.gov/pubmed/26894997 http://dx.doi.org/10.1371/journal.pcbi.1004772 |
_version_ | 1782416908673351680 |
---|---|
author | Liu, Juntao Li, Guojun Chang, Zheng Yu, Ting Liu, Bingqiang McMullen, Rick Chen, Pengyin Huang, Xiuzhen |
author_facet | Liu, Juntao Li, Guojun Chang, Zheng Yu, Ting Liu, Bingqiang McMullen, Rick Chen, Pengyin Huang, Xiuzhen |
author_sort | Liu, Juntao |
collection | PubMed |
description | High-throughput RNA-seq technology has provided an unprecedented opportunity to reveal the very complex structures of transcriptomes. However, it is an important and highly challenging task to assemble vast amounts of short RNA-seq reads into transcriptomes with alternative splicing isoforms. In this study, we present a novel de novo assembler, BinPacker, by modeling the transcriptome assembly problem as tracking a set of trajectories of items with their sizes representing coverage of their corresponding isoforms by solving a series of bin-packing problems. This approach, which subtly integrates coverage information into the procedure, has two exclusive features: 1) only splicing junctions are involved in the assembling procedure; 2) massive pell-mell reads are assembled seemingly by moving a comb along junction edges on a splicing graph. Being tested on both real and simulated RNA-seq datasets, it outperforms almost all the existing de novo assemblers on all the tested datasets, and even outperforms those ab initio assemblers on the real dog dataset. In addition, it runs substantially faster and requires less memory space than most of the assemblers. BinPacker is published under GNU GENERAL PUBLIC LICENSE and the source is available from: http://sourceforge.net/projects/transcriptomeassembly/files/BinPacker_1.0.tar.gz/download. Quick installation version is available from: http://sourceforge.net/projects/transcriptomeassembly/files/BinPacker_binary.tar.gz/download. |
format | Online Article Text |
id | pubmed-4760927 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-47609272016-03-07 BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data Liu, Juntao Li, Guojun Chang, Zheng Yu, Ting Liu, Bingqiang McMullen, Rick Chen, Pengyin Huang, Xiuzhen PLoS Comput Biol Research Article High-throughput RNA-seq technology has provided an unprecedented opportunity to reveal the very complex structures of transcriptomes. However, it is an important and highly challenging task to assemble vast amounts of short RNA-seq reads into transcriptomes with alternative splicing isoforms. In this study, we present a novel de novo assembler, BinPacker, by modeling the transcriptome assembly problem as tracking a set of trajectories of items with their sizes representing coverage of their corresponding isoforms by solving a series of bin-packing problems. This approach, which subtly integrates coverage information into the procedure, has two exclusive features: 1) only splicing junctions are involved in the assembling procedure; 2) massive pell-mell reads are assembled seemingly by moving a comb along junction edges on a splicing graph. Being tested on both real and simulated RNA-seq datasets, it outperforms almost all the existing de novo assemblers on all the tested datasets, and even outperforms those ab initio assemblers on the real dog dataset. In addition, it runs substantially faster and requires less memory space than most of the assemblers. BinPacker is published under GNU GENERAL PUBLIC LICENSE and the source is available from: http://sourceforge.net/projects/transcriptomeassembly/files/BinPacker_1.0.tar.gz/download. Quick installation version is available from: http://sourceforge.net/projects/transcriptomeassembly/files/BinPacker_binary.tar.gz/download. Public Library of Science 2016-02-19 /pmc/articles/PMC4760927/ /pubmed/26894997 http://dx.doi.org/10.1371/journal.pcbi.1004772 Text en © 2016 Liu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Liu, Juntao Li, Guojun Chang, Zheng Yu, Ting Liu, Bingqiang McMullen, Rick Chen, Pengyin Huang, Xiuzhen BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data |
title | BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data |
title_full | BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data |
title_fullStr | BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data |
title_full_unstemmed | BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data |
title_short | BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data |
title_sort | binpacker: packing-based de novo transcriptome assembly from rna-seq data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4760927/ https://www.ncbi.nlm.nih.gov/pubmed/26894997 http://dx.doi.org/10.1371/journal.pcbi.1004772 |
work_keys_str_mv | AT liujuntao binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata AT liguojun binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata AT changzheng binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata AT yuting binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata AT liubingqiang binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata AT mcmullenrick binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata AT chenpengyin binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata AT huangxiuzhen binpackerpackingbaseddenovotranscriptomeassemblyfromrnaseqdata |