Cargando…

genozip: a fast and efficient compression tool for VCF files

MOTIVATION: genozip is a new lossless compression tool for Variant Call Format (VCF) files. By applying field-specific algorithms and fully utilizing the available computational hardware, genozip achieves the highest compression ratios amongst existing lossless compression tools known to the authors...

Descripción completa

Detalles Bibliográficos
Autores principales: Lan, Divon, Tobler, Raymond, Souilmi, Yassine, Llamas, Bastien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7332572/
https://www.ncbi.nlm.nih.gov/pubmed/32407471
http://dx.doi.org/10.1093/bioinformatics/btaa290
Descripción
Sumario:MOTIVATION: genozip is a new lossless compression tool for Variant Call Format (VCF) files. By applying field-specific algorithms and fully utilizing the available computational hardware, genozip achieves the highest compression ratios amongst existing lossless compression tools known to the authors, at speeds comparable with the fastest multi-threaded compressors. AVAILABILITY AND IMPLEMENTATION: genozip is freely available to non-commercial users. It can be installed via conda-forge, Docker Hub, or downloaded from github.com/divonlan/genozip. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.