Cargando…

A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms

BACKGROUND: With increased availability of de novo assembly algorithms, it is feasible to study entire transcriptomes of non-model organisms. While algorithms are available that are specifically designed for performing transcriptome assembly from high-throughput sequencing data, they are very memory...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sze, Sing-Hoi, Pimsler, Meaghan L., Tomberlin, Jeffery K., Jones, Corbin D., Tarone, Aaron M.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2017
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5461550/ https://www.ncbi.nlm.nih.gov/pubmed/28589866 http://dx.doi.org/10.1186/s12864-017-3735-1

Descripción
Sumario:	BACKGROUND: With increased availability of de novo assembly algorithms, it is feasible to study entire transcriptomes of non-model organisms. While algorithms are available that are specifically designed for performing transcriptome assembly from high-throughput sequencing data, they are very memory-intensive, limiting their applications to small data sets with few libraries. RESULTS: We develop a transcriptome assembly algorithm that recovers alternatively spliced isoforms and expression levels while utilizing as many RNA-Seq libraries as possible that contain hundreds of gigabases of data. New techniques are developed so that computations can be performed on a computing cluster with moderate amount of physical memory. CONCLUSIONS: Our strategy minimizes memory consumption while simultaneously obtaining comparable or improved accuracy over existing algorithms. It provides support for incremental updates of assemblies when new libraries become available.

A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms

Ejemplares similares