Cargando…
Nucleotide Archival Format (NAF) enables efficient lossless reference-free compression of DNA sequences
SUMMARY: DNA sequence databases use compression such as gzip to reduce the required storage space and network transmission time. We describe Nucleotide Archival Format (NAF)—a new file format for lossless reference-free compression of FASTA and FASTQ-formatted nucleotide sequences. Nucleotide Archiv...
Autores principales: | Kryukov, Kirill, Ueda, Mahoko Takahashi, Nakagawa, So, Imanishi, Tadashi |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6761962/ https://www.ncbi.nlm.nih.gov/pubmed/30799504 http://dx.doi.org/10.1093/bioinformatics/btz144 |
Ejemplares similares
-
Sequence Compression Benchmark (SCB) database—A comprehensive evaluation of reference-free compressors for FASTA-formatted sequences
por: Kryukov, Kirill, et al.
Publicado: (2020) -
Efficient compression of SARS-CoV-2 genome data using Nucleotide Archival Format
por: Kryukov, Kirill, et al.
Publicado: (2022) -
Comprehensive genomic analysis reveals dynamic evolution of endogenous retroviruses that code for retroviral-like protein domains
por: Ueda, Mahoko Takahashi, et al.
Publicado: (2020) -
A SARS-CoV-2 sequence submission tool for the European Nucleotide Archive
por: Roncoroni, Miguel, et al.
Publicado: (2021) -
Crumble: reference free lossy compression of sequence quality values
por: Bonfield, James K, et al.
Publicado: (2019)