Cargando…

Splitting the BLOSUM Score into Numbers of Biological Significance

Mathematical tools developed in the context of Shannon information theory were used to analyze the meaning of the BLOSUM score, which was split into three components termed as the BLOSUM spectrum (or BLOSpectrum). These relate respectively to the sequence convergence (the stochastic similarity of th...

Descripción completa

Detalles Bibliográficos
Autores principales: Fabris, Francesco, Sgarro, Andrea, Tossi, Alessandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171334/
https://www.ncbi.nlm.nih.gov/pubmed/18369412
http://dx.doi.org/10.1155/2007/31450
_version_ 1782211739138392064
author Fabris, Francesco
Sgarro, Andrea
Tossi, Alessandro
author_facet Fabris, Francesco
Sgarro, Andrea
Tossi, Alessandro
author_sort Fabris, Francesco
collection PubMed
description Mathematical tools developed in the context of Shannon information theory were used to analyze the meaning of the BLOSUM score, which was split into three components termed as the BLOSUM spectrum (or BLOSpectrum). These relate respectively to the sequence convergence (the stochastic similarity of the two protein sequences), to the background frequency divergence (typicality of the amino acid probability distribution in each sequence), and to the target frequency divergence (compliance of the amino acid variations between the two sequences to the protein model implicit in the BLOCKS database). This treatment sharpens the protein sequence comparison, providing a rationale for the biological significance of the obtained score, and helps to identify weakly related sequences. Moreover, the BLOSpectrum can guide the choice of the most appropriate scoring matrix, tailoring it to the evolutionary divergence associated with the two sequences, or indicate if a compositionally adjusted matrix could perform better.
format Online
Article
Text
id pubmed-3171334
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Springer
record_format MEDLINE/PubMed
spelling pubmed-31713342011-09-13 Splitting the BLOSUM Score into Numbers of Biological Significance Fabris, Francesco Sgarro, Andrea Tossi, Alessandro EURASIP J Bioinform Syst Biol Research Article Mathematical tools developed in the context of Shannon information theory were used to analyze the meaning of the BLOSUM score, which was split into three components termed as the BLOSUM spectrum (or BLOSpectrum). These relate respectively to the sequence convergence (the stochastic similarity of the two protein sequences), to the background frequency divergence (typicality of the amino acid probability distribution in each sequence), and to the target frequency divergence (compliance of the amino acid variations between the two sequences to the protein model implicit in the BLOCKS database). This treatment sharpens the protein sequence comparison, providing a rationale for the biological significance of the obtained score, and helps to identify weakly related sequences. Moreover, the BLOSpectrum can guide the choice of the most appropriate scoring matrix, tailoring it to the evolutionary divergence associated with the two sequences, or indicate if a compositionally adjusted matrix could perform better. Springer 2007-06-04 /pmc/articles/PMC3171334/ /pubmed/18369412 http://dx.doi.org/10.1155/2007/31450 Text en Copyright © 2007 Francesco Fabris et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Fabris, Francesco
Sgarro, Andrea
Tossi, Alessandro
Splitting the BLOSUM Score into Numbers of Biological Significance
title Splitting the BLOSUM Score into Numbers of Biological Significance
title_full Splitting the BLOSUM Score into Numbers of Biological Significance
title_fullStr Splitting the BLOSUM Score into Numbers of Biological Significance
title_full_unstemmed Splitting the BLOSUM Score into Numbers of Biological Significance
title_short Splitting the BLOSUM Score into Numbers of Biological Significance
title_sort splitting the blosum score into numbers of biological significance
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171334/
https://www.ncbi.nlm.nih.gov/pubmed/18369412
http://dx.doi.org/10.1155/2007/31450
work_keys_str_mv AT fabrisfrancesco splittingtheblosumscoreintonumbersofbiologicalsignificance
AT sgarroandrea splittingtheblosumscoreintonumbersofbiologicalsignificance
AT tossialessandro splittingtheblosumscoreintonumbersofbiologicalsignificance