Cargando…

Modelling BioNano optical data and simulation study of genome map assembly

MOTIVATION: The launch of the BioNano next-generation mapping system has greatly enhanced the performance of physical map construction, thus rapidly expanding the application of optical mapping in genome research. Data biases have profound implications for downstream applications. However, very litt...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Ping, Jing, Xinyun, Ren, Jian, Cao, Han, Hao, Pei, Li, Xuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6247929/
https://www.ncbi.nlm.nih.gov/pubmed/29893801
http://dx.doi.org/10.1093/bioinformatics/bty456
Descripción
Sumario:MOTIVATION: The launch of the BioNano next-generation mapping system has greatly enhanced the performance of physical map construction, thus rapidly expanding the application of optical mapping in genome research. Data biases have profound implications for downstream applications. However, very little is known about the properties and biases of BioNano data, and the very factors that contribute to whole-genome optical map assembly. RESULTS: We generated BioNano molecule data from eight organisms with diverse base compositions. We first characterized the properties/biases of BioNano molecule data, i.e. molecule length distribution, false labelling signal, variation of optical resolution and coverage distribution bias, and their inducing factors such as chimeric molecules, fragile sites and DNA molecule stretching. Second, we developed the BioNano Molecule SIMulator (BMSIM), a novel computer simulation program for optical data. BMSIM, is of great use for future genome mapping projects. Third, we evaluated the experimental variables that impact whole-genome optical map assembly. Specifically, the effects of coverage depth, molecule length, false-positive and false-negative labelling signals, chimeric molecules and nicking enzyme and nick site density were investigated. Our simulation study provides the empirical findings on how to control experimental variables and gauge analytical parameters to maximize benefit and minimize cost on whole-genome optical map assembly. AVAILABILITY AND IMPLEMENTATION: BMSIM is freely available on: https://github.com/pingchen09990102/BMSIM. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.