Cargando…
Sequence Compression Benchmark (SCB) database—A comprehensive evaluation of reference-free compressors for FASTA-formatted sequences
BACKGROUND: Nearly all molecular sequence databases currently use gzip for data compression. Ongoing rapid accumulation of stored data calls for a more efficient compression tool. Although numerous compressors exist, both specialized and general-purpose, choosing one of them was difficult because no...
Autores principales: | Kryukov, Kirill, Ueda, Mahoko Takahashi, Nakagawa, So, Imanishi, Tadashi |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7336184/ https://www.ncbi.nlm.nih.gov/pubmed/32627830 http://dx.doi.org/10.1093/gigascience/giaa072 |
Ejemplares similares
-
Nucleotide Archival Format (NAF) enables efficient lossless reference-free compression of DNA sequences
por: Kryukov, Kirill, et al.
Publicado: (2019) -
FastaValidator: an open-source Java library to parse and validate FASTA formatted sequences
por: Waldmann, Jost, et al.
Publicado: (2014) -
MFCompress: a compression tool for FASTA and multi-FASTA data
por: Pinho, Armando J., et al.
Publicado: (2014) -
Comprehensive genomic analysis reveals dynamic evolution of endogenous retroviruses that code for retroviral-like protein domains
por: Ueda, Mahoko Takahashi, et al.
Publicado: (2020) -
FASTA/Q data compressors for MapReduce-Hadoop genomics: space and time savings made easy
por: Ferraro Petrillo, Umberto, et al.
Publicado: (2021)