Cargando…
Alignment-free clustering of large data sets of unannotated protein conserved regions using minhashing
BACKGROUND: Clustering of protein sequences is of key importance in predicting the structure and function of newly sequenced proteins and is also of use for their annotation. With the advent of multiple high-throughput sequencing technologies, new protein sequences are becoming available at an extra...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5838936/ https://www.ncbi.nlm.nih.gov/pubmed/29506470 http://dx.doi.org/10.1186/s12859-018-2080-y |