Cargando…

Exploiting sparseness in de novo genome assembly

BACKGROUND: The very large memory requirements for the construction of assembly graphs for de novo genome assembly limit current algorithms to super-computing environments. METHODS: In this paper, we demonstrate that constructing a sparse assembly graph which stores only a small fraction of the obse...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ye, Chengxi, Ma, Zhanshan Sam, Cannon, Charles H, Pop, Mihai, Yu, Douglas W
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2012
Materias:	Proceedings
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3369186/ https://www.ncbi.nlm.nih.gov/pubmed/22537038 http://dx.doi.org/10.1186/1471-2105-13-S6-S1

Descripción
Sumario:	BACKGROUND: The very large memory requirements for the construction of assembly graphs for de novo genome assembly limit current algorithms to super-computing environments. METHODS: In this paper, we demonstrate that constructing a sparse assembly graph which stores only a small fraction of the observed k-mers as nodes and the links between these nodes allows the de novo assembly of even moderately-sized genomes (~500 M) on a typical laptop computer. RESULTS: We implement this sparse graph concept in a proof-of-principle software package, SparseAssembler, utilizing a new sparse k-mer graph structure evolved from the de Bruijn graph. We test our SparseAssembler with both simulated and real data, achieving ~90% memory savings and retaining high assembly accuracy, without sacrificing speed in comparison to existing de novo assemblers.

Exploiting sparseness in de novo genome assembly

Ejemplares similares