Cargando…
Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels
Motivation: High-throughput sequencing has made the analysis of new model organisms more affordable. Although assembling a new genome can still be costly and difficult, it is possible to use RNA-seq to sequence mRNA. In the absence of a known genome, it is necessary to assemble these sequences de no...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3324515/ https://www.ncbi.nlm.nih.gov/pubmed/22368243 http://dx.doi.org/10.1093/bioinformatics/bts094 |
_version_ | 1782229324511838208 |
---|---|
author | Schulz, Marcel H. Zerbino, Daniel R. Vingron, Martin Birney, Ewan |
author_facet | Schulz, Marcel H. Zerbino, Daniel R. Vingron, Martin Birney, Ewan |
author_sort | Schulz, Marcel H. |
collection | PubMed |
description | Motivation: High-throughput sequencing has made the analysis of new model organisms more affordable. Although assembling a new genome can still be costly and difficult, it is possible to use RNA-seq to sequence mRNA. In the absence of a known genome, it is necessary to assemble these sequences de novo, taking into account possible alternative isoforms and the dynamic range of expression values. Results: We present a software package named Oases designed to heuristically assemble RNA-seq reads in the absence of a reference genome, across a broad spectrum of expression values and in presence of alternative isoforms. It achieves this by using an array of hash lengths, a dynamic filtering of noise, a robust resolution of alternative splicing events and the efficient merging of multiple assemblies. It was tested on human and mouse RNA-seq data and is shown to improve significantly on the transABySS and Trinity de novo transcriptome assemblers. Availability and implementation: Oases is freely available under the GPL license at www.ebi.ac.uk/~zerbino/oases/ Contact: dzerbino@ucsc.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-3324515 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-33245152012-04-12 Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels Schulz, Marcel H. Zerbino, Daniel R. Vingron, Martin Birney, Ewan Bioinformatics Original Papers Motivation: High-throughput sequencing has made the analysis of new model organisms more affordable. Although assembling a new genome can still be costly and difficult, it is possible to use RNA-seq to sequence mRNA. In the absence of a known genome, it is necessary to assemble these sequences de novo, taking into account possible alternative isoforms and the dynamic range of expression values. Results: We present a software package named Oases designed to heuristically assemble RNA-seq reads in the absence of a reference genome, across a broad spectrum of expression values and in presence of alternative isoforms. It achieves this by using an array of hash lengths, a dynamic filtering of noise, a robust resolution of alternative splicing events and the efficient merging of multiple assemblies. It was tested on human and mouse RNA-seq data and is shown to improve significantly on the transABySS and Trinity de novo transcriptome assemblers. Availability and implementation: Oases is freely available under the GPL license at www.ebi.ac.uk/~zerbino/oases/ Contact: dzerbino@ucsc.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-04-15 2012-02-24 /pmc/articles/PMC3324515/ /pubmed/22368243 http://dx.doi.org/10.1093/bioinformatics/bts094 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Schulz, Marcel H. Zerbino, Daniel R. Vingron, Martin Birney, Ewan Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels |
title | Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels |
title_full | Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels |
title_fullStr | Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels |
title_full_unstemmed | Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels |
title_short | Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels |
title_sort | oases: robust de novo rna-seq assembly across the dynamic range of expression levels |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3324515/ https://www.ncbi.nlm.nih.gov/pubmed/22368243 http://dx.doi.org/10.1093/bioinformatics/bts094 |
work_keys_str_mv | AT schulzmarcelh oasesrobustdenovornaseqassemblyacrossthedynamicrangeofexpressionlevels AT zerbinodanielr oasesrobustdenovornaseqassemblyacrossthedynamicrangeofexpressionlevels AT vingronmartin oasesrobustdenovornaseqassemblyacrossthedynamicrangeofexpressionlevels AT birneyewan oasesrobustdenovornaseqassemblyacrossthedynamicrangeofexpressionlevels |