Cargando…
Alignment-free clustering of large data sets of unannotated protein conserved regions using minhashing
BACKGROUND: Clustering of protein sequences is of key importance in predicting the structure and function of newly sequenced proteins and is also of use for their annotation. With the advent of multiple high-throughput sequencing technologies, new protein sequences are becoming available at an extra...
Autores principales: | Abnousi, Armen, Broschat, Shira L., Kalyanaraman, Ananth |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5838936/ https://www.ncbi.nlm.nih.gov/pubmed/29506470 http://dx.doi.org/10.1186/s12859-018-2080-y |
Ejemplares similares
-
A Fast Alignment-Free Approach for De Novo Detection of Protein Conserved Regions
por: Abnousi, Armen, et al.
Publicado: (2016) -
A hybrid cloud read aligner based on MinHash and kmer voting that preserves privacy
por: Popic, Victoria, et al.
Publicado: (2017) -
RabbitTClust: enabling fast clustering analysis of millions of bacteria genomes with MinHash sketches
por: Xu, Xiaoming, et al.
Publicado: (2023) -
Viral coinfection analysis using a MinHash toolkit
por: Dawson, Eric T., et al.
Publicado: (2019) -
Mash: fast genome and metagenome distance estimation using MinHash
por: Ondov, Brian D., et al.
Publicado: (2016)