Cargando…
RBLOSUM performs better than CorBLOSUM with lesser error per query
OBJECTIVE: BLOSUM matrices serve as standard matrices for many protein sequence alignment programs. BLOSUM matrices have been constructed using BLOCKS version(5.0) with 27,102 BLOCKS, whereas the latest updated version(14.3) has 6,739,916 BLOCKS. We read with interest the research article by Hess et...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5963171/ https://www.ncbi.nlm.nih.gov/pubmed/29784028 http://dx.doi.org/10.1186/s13104-018-3415-5 |
_version_ | 1783325006809268224 |
---|---|
author | Govindarajan, Renganayaki Leela, Biji Christopher Nair, Achuthsankar S. |
author_facet | Govindarajan, Renganayaki Leela, Biji Christopher Nair, Achuthsankar S. |
author_sort | Govindarajan, Renganayaki |
collection | PubMed |
description | OBJECTIVE: BLOSUM matrices serve as standard matrices for many protein sequence alignment programs. BLOSUM matrices have been constructed using BLOCKS version(5.0) with 27,102 BLOCKS, whereas the latest updated version(14.3) has 6,739,916 BLOCKS. We read with interest the research article by Hess et al. (BMC Bioinform 17:189, 2016) on CorBLOSUM, wherein it is argued that an inaccuracy in the BLOSUM code affects the cluster memberships of sequences. They show that replacing the integer based clustering threshold to floating point arguably improves the performances of CorBLOSUM over BLOSUM and RBLOSUM matrices. They compare BLOSUM62(14.3) against RBLOSUM69, with relative entropies of 0.2685 and 0.2662 respectively. The present work attempts to repeat the computation to verify the respective analog matrices. RESULTS: In our attempt to repeat the computation, we observed that the relative entropy of BLOSUM62(14.3) is 0.2360 and BLOSUM50(14.3) is 0.1198. As only matrices of similar entropies can be compared, BLOSUM62 can be compared only with RBLOSUM66 and BLOSUM50 can be compared only with RBLOSUM56. We conducted experiments with Astral data sets, and demonstrated the improved accuracy in the coverage. Our results imply that RBLOSUM performs statistically better than CorBLOSUM and BLOSUM matrices. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13104-018-3415-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5963171 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-59631712018-05-24 RBLOSUM performs better than CorBLOSUM with lesser error per query Govindarajan, Renganayaki Leela, Biji Christopher Nair, Achuthsankar S. BMC Res Notes Research Note OBJECTIVE: BLOSUM matrices serve as standard matrices for many protein sequence alignment programs. BLOSUM matrices have been constructed using BLOCKS version(5.0) with 27,102 BLOCKS, whereas the latest updated version(14.3) has 6,739,916 BLOCKS. We read with interest the research article by Hess et al. (BMC Bioinform 17:189, 2016) on CorBLOSUM, wherein it is argued that an inaccuracy in the BLOSUM code affects the cluster memberships of sequences. They show that replacing the integer based clustering threshold to floating point arguably improves the performances of CorBLOSUM over BLOSUM and RBLOSUM matrices. They compare BLOSUM62(14.3) against RBLOSUM69, with relative entropies of 0.2685 and 0.2662 respectively. The present work attempts to repeat the computation to verify the respective analog matrices. RESULTS: In our attempt to repeat the computation, we observed that the relative entropy of BLOSUM62(14.3) is 0.2360 and BLOSUM50(14.3) is 0.1198. As only matrices of similar entropies can be compared, BLOSUM62 can be compared only with RBLOSUM66 and BLOSUM50 can be compared only with RBLOSUM56. We conducted experiments with Astral data sets, and demonstrated the improved accuracy in the coverage. Our results imply that RBLOSUM performs statistically better than CorBLOSUM and BLOSUM matrices. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13104-018-3415-5) contains supplementary material, which is available to authorized users. BioMed Central 2018-05-21 /pmc/articles/PMC5963171/ /pubmed/29784028 http://dx.doi.org/10.1186/s13104-018-3415-5 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Note Govindarajan, Renganayaki Leela, Biji Christopher Nair, Achuthsankar S. RBLOSUM performs better than CorBLOSUM with lesser error per query |
title | RBLOSUM performs better than CorBLOSUM with lesser error per query |
title_full | RBLOSUM performs better than CorBLOSUM with lesser error per query |
title_fullStr | RBLOSUM performs better than CorBLOSUM with lesser error per query |
title_full_unstemmed | RBLOSUM performs better than CorBLOSUM with lesser error per query |
title_short | RBLOSUM performs better than CorBLOSUM with lesser error per query |
title_sort | rblosum performs better than corblosum with lesser error per query |
topic | Research Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5963171/ https://www.ncbi.nlm.nih.gov/pubmed/29784028 http://dx.doi.org/10.1186/s13104-018-3415-5 |
work_keys_str_mv | AT govindarajanrenganayaki rblosumperformsbetterthancorblosumwithlessererrorperquery AT leelabijichristopher rblosumperformsbetterthancorblosumwithlessererrorperquery AT nairachuthsankars rblosumperformsbetterthancorblosumwithlessererrorperquery |