Cargando…
Data structures based on k-mers for querying large collections of sequencing data sets
High-throughput sequencing data sets are usually deposited in public repositories (e.g., the European Nucleotide Archive) to ensure reproducibility. As the amount of data has reached petabyte scale, repositories do not allow one to perform online sequence searches, yet, such a feature would be highl...
Autores principales: | Marchet, Camille, Boucher, Christina, Puglisi, Simon J., Medvedev, Paul, Salson, Mikaël, Chikhi, Rayan |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7849385/ https://www.ncbi.nlm.nih.gov/pubmed/33328168 http://dx.doi.org/10.1101/gr.260604.119 |
Ejemplares similares
-
REINDEER: efficient indexing of k-mer presence and abundance in sequencing datasets
por: Marchet, Camille, et al.
Publicado: (2020) -
Disk compression of k-mer sets
por: Rahman, Amatur, et al.
Publicado: (2021) -
DE-kupl: exhaustive capture of biological variation in RNA-seq data through k-mer decomposition
por: Audoux, Jérôme, et al.
Publicado: (2017) -
Querying large read collections in main memory: a versatile data structure
por: Philippe, Nicolas, et al.
Publicado: (2011) -
The K-mer File Format: a standardized and compact disk representation of sets of k-mers
por: Dufresne, Yoann, et al.
Publicado: (2022)