Cargando…
RabbitTClust: enabling fast clustering analysis of millions of bacteria genomes with MinHash sketches
We present RabbitTClust, a fast and memory-efficient genome clustering tool based on sketch-based distance estimation. Our approach enables efficient processing of large-scale datasets by combining dimensionality reduction techniques with streaming and parallelization on modern multi-core platforms....
Autores principales: | Xu, Xiaoming, Yin, Zekun, Yan, Lifeng, Zhang, Hao, Xu, Borui, Wei, Yanjie, Niu, Beifang, Schmidt, Bertil, Liu, Weiguo |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10190105/ https://www.ncbi.nlm.nih.gov/pubmed/37198663 http://dx.doi.org/10.1186/s13059-023-02961-6 |
Ejemplares similares
-
Viral coinfection analysis using a MinHash toolkit
por: Dawson, Eric T., et al.
Publicado: (2019) -
Mash: fast genome and metagenome distance estimation using MinHash
por: Ondov, Brian D., et al.
Publicado: (2016) -
On the transformation of MinHash-based uncorrected distances into proper evolutionary distances for phylogenetic inference
por: Criscuolo, Alexis
Publicado: (2020) -
A hybrid cloud read aligner based on MinHash and kmer voting that preserves privacy
por: Popic, Victoria, et al.
Publicado: (2017) -
Scalable phylogenetic profiling using MinHash uncovers likely eukaryotic sexual reproduction genes
por: Moi, David, et al.
Publicado: (2020)