Cargando…

Optimized distributed systems achieve significant performance improvement on sorted merging of massive VCF files

BACKGROUND: Sorted merging of genomic data is a common data operation necessary in many sequencing-based studies. It involves sorting and merging genomic data from different subjects by their genomic locations. In particular, merging a large number of variant call format (VCF) files is frequently re...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Xiaobo, Gao, Jingjing, Jin, Peng, Eng, Celeste, Burchard, Esteban G, Beaty, Terri H, Ruczinski, Ingo, Mathias, Rasika A, Barnes, Kathleen, Wang, Fusheng, Qin, Zhaohui S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6007233/
https://www.ncbi.nlm.nih.gov/pubmed/29762754
http://dx.doi.org/10.1093/gigascience/giy052