Cargando…
Complet+: a computationally scalable method to improve completeness of large-scale protein sequence clustering
A major challenge for clustering algorithms is to balance the trade-off between homogeneity, i.e., the degree to which an individual cluster includes only related sequences, and completeness, the degree to which related sequences are broken up into multiple clusters. Most algorithms are conservative...
Autores principales: | Nguyen, Rachel, Sokhansanj, Bahrad A., Polikar, Robi, Rosen, Gail L. |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9921987/ https://www.ncbi.nlm.nih.gov/pubmed/36785708 http://dx.doi.org/10.7717/peerj.14779 |
Ejemplares similares
-
Metagenome Fragment Classification Using N-Mer Frequency Profiles
por: Rosen, Gail, et al.
Publicado: (2008) -
Discovering the Unknown: Improving Detection of Novel Species and Genera from Short Reads
por: Rosen, Gail L., et al.
Publicado: (2011) -
Signal Processing for Metagenomics: Extracting Information from the Soup
por: Rosen, Gail L., et al.
Publicado: (2009) -
Mapping Data to Deep Understanding: Making the Most of the Deluge of SARS-CoV-2 Genome Sequences
por: Sokhansanj, Bahrad A., et al.
Publicado: (2022) -
Predicting COVID-19 disease severity from SARS-CoV-2 spike protein sequence by mixed effects machine learning
por: Sokhansanj, Bahrad A., et al.
Publicado: (2022)