Cargando…

RBLOSUM performs better than CorBLOSUM with lesser error per query

OBJECTIVE: BLOSUM matrices serve as standard matrices for many protein sequence alignment programs. BLOSUM matrices have been constructed using BLOCKS version(5.0) with 27,102 BLOCKS, whereas the latest updated version(14.3) has 6,739,916 BLOCKS. We read with interest the research article by Hess et...

Descripción completa

Detalles Bibliográficos
Autores principales: Govindarajan, Renganayaki, Leela, Biji Christopher, Nair, Achuthsankar S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5963171/
https://www.ncbi.nlm.nih.gov/pubmed/29784028
http://dx.doi.org/10.1186/s13104-018-3415-5
Descripción
Sumario:OBJECTIVE: BLOSUM matrices serve as standard matrices for many protein sequence alignment programs. BLOSUM matrices have been constructed using BLOCKS version(5.0) with 27,102 BLOCKS, whereas the latest updated version(14.3) has 6,739,916 BLOCKS. We read with interest the research article by Hess et al. (BMC Bioinform 17:189, 2016) on CorBLOSUM, wherein it is argued that an inaccuracy in the BLOSUM code affects the cluster memberships of sequences. They show that replacing the integer based clustering threshold to floating point arguably improves the performances of CorBLOSUM over BLOSUM and RBLOSUM matrices. They compare BLOSUM62(14.3) against RBLOSUM69, with relative entropies of 0.2685 and 0.2662 respectively. The present work attempts to repeat the computation to verify the respective analog matrices. RESULTS: In our attempt to repeat the computation, we observed that the relative entropy of BLOSUM62(14.3) is 0.2360 and BLOSUM50(14.3) is 0.1198. As only matrices of similar entropies can be compared, BLOSUM62 can be compared only with RBLOSUM66 and BLOSUM50 can be compared only with RBLOSUM56. We conducted experiments with Astral data sets, and demonstrated the improved accuracy in the coverage. Our results imply that RBLOSUM performs statistically better than CorBLOSUM and BLOSUM matrices. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13104-018-3415-5) contains supplementary material, which is available to authorized users.