Cargando…

LRSim: A Linked-Reads Simulator Generating Insights for Better Genome Partitioning

Linked-read sequencing, using highly-multiplexed genome partitioning and barcoding, can span hundreds of kilobases to improve de novo assembly, haplotype phasing, and other applications. Based on our analysis of 14 datasets, we introduce LRSim that simulates linked-reads by emulating the library pre...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Ruibang, Sedlazeck, Fritz J., Darby, Charlotte A., Kelly, Stephen M., Schatz, Michael C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5711661/
https://www.ncbi.nlm.nih.gov/pubmed/29213995
http://dx.doi.org/10.1016/j.csbj.2017.10.002
Descripción
Sumario:Linked-read sequencing, using highly-multiplexed genome partitioning and barcoding, can span hundreds of kilobases to improve de novo assembly, haplotype phasing, and other applications. Based on our analysis of 14 datasets, we introduce LRSim that simulates linked-reads by emulating the library preparation and sequencing process with fine control over variants, linked-read characteristics, and the short-read profile. We conclude from the phasing and assembly of multiple datasets, recommendations on coverage, fragment length, and partitioning when sequencing genomes of different sizes and complexities. These optimizations improve results by orders of magnitude, and enable the development of novel methods. LRSim is available at https://github.com/aquaskyline/LRSIM.