Cargando…

Inconsistent Distances in Substitution Matrices can be Avoided by Properly Handling Hydrophobic Residues

The adequacy of substitution matrices to model evolutionary relationships between amino acid sequences can be numerically evaluated by checking the mathematical property of triangle inequality for all triplets of residues. By converting substitution scores into distances, one can verify that a direc...

Descripción completa

Detalles Bibliográficos
Autores principales: Baussand, J., Carbone, A.
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2614183/
https://www.ncbi.nlm.nih.gov/pubmed/19204823
_version_ 1782163218216517632
author Baussand, J.
Carbone, A.
author_facet Baussand, J.
Carbone, A.
author_sort Baussand, J.
collection PubMed
description The adequacy of substitution matrices to model evolutionary relationships between amino acid sequences can be numerically evaluated by checking the mathematical property of triangle inequality for all triplets of residues. By converting substitution scores into distances, one can verify that a direct path between two amino acids is shorter than a path passing through a third amino acid in the amino acid space modeled by the matrix. If the triangle inequality is not verified, the intuition is that the evolutionary signal is not well modeled by the matrix, that the space is locally inconsistent and that the matrix construction was probably based on insufficient biological data. Previous analysis on several substitution matrices revealed that the number of triplets violating the triangle inequality increases with sequence divergence. Here, we compare matrices which are dedicated to the alignment of highly divergent proteins. The triangle inequality is tested on several classical substitution matrices as well as in a pair of “complementary” substitution matrices recording the evolutionary pressures inside and outside hydrophobic blocks in protein sequences. The analysis proves the crucial role of hydrophobic residues in substitution matrices dedicated to the alignment of distantly related proteins.
format Text
id pubmed-2614183
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26141832009-02-09 Inconsistent Distances in Substitution Matrices can be Avoided by Properly Handling Hydrophobic Residues Baussand, J. Carbone, A. Evol Bioinform Online Original Research The adequacy of substitution matrices to model evolutionary relationships between amino acid sequences can be numerically evaluated by checking the mathematical property of triangle inequality for all triplets of residues. By converting substitution scores into distances, one can verify that a direct path between two amino acids is shorter than a path passing through a third amino acid in the amino acid space modeled by the matrix. If the triangle inequality is not verified, the intuition is that the evolutionary signal is not well modeled by the matrix, that the space is locally inconsistent and that the matrix construction was probably based on insufficient biological data. Previous analysis on several substitution matrices revealed that the number of triplets violating the triangle inequality increases with sequence divergence. Here, we compare matrices which are dedicated to the alignment of highly divergent proteins. The triangle inequality is tested on several classical substitution matrices as well as in a pair of “complementary” substitution matrices recording the evolutionary pressures inside and outside hydrophobic blocks in protein sequences. The analysis proves the crucial role of hydrophobic residues in substitution matrices dedicated to the alignment of distantly related proteins. Libertas Academica 2008-10-09 /pmc/articles/PMC2614183/ /pubmed/19204823 Text en Copyright © 2008 The authors. http://creativecommons.org/licenses/by/3.0 This article is published under the Creative Commons Attribution By licence. For further information go to: http://creativecommons.org/licenses/by/3.0. (http://creativecommons.org/licenses/by/3.0)
spellingShingle Original Research
Baussand, J.
Carbone, A.
Inconsistent Distances in Substitution Matrices can be Avoided by Properly Handling Hydrophobic Residues
title Inconsistent Distances in Substitution Matrices can be Avoided by Properly Handling Hydrophobic Residues
title_full Inconsistent Distances in Substitution Matrices can be Avoided by Properly Handling Hydrophobic Residues
title_fullStr Inconsistent Distances in Substitution Matrices can be Avoided by Properly Handling Hydrophobic Residues
title_full_unstemmed Inconsistent Distances in Substitution Matrices can be Avoided by Properly Handling Hydrophobic Residues
title_short Inconsistent Distances in Substitution Matrices can be Avoided by Properly Handling Hydrophobic Residues
title_sort inconsistent distances in substitution matrices can be avoided by properly handling hydrophobic residues
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2614183/
https://www.ncbi.nlm.nih.gov/pubmed/19204823
work_keys_str_mv AT baussandj inconsistentdistancesinsubstitutionmatricescanbeavoidedbyproperlyhandlinghydrophobicresidues
AT carbonea inconsistentdistancesinsubstitutionmatricescanbeavoidedbyproperlyhandlinghydrophobicresidues