Cargando…
Optimizing amino acid substitution matrices with a local alignment kernel
BACKGROUND: Detecting remote homologies by direct comparison of protein sequences remains a challenging task. We had previously developed a similarity score between sequences, called a local alignment kernel, that exhibits good performance for this task in combination with a support vector machine....
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1513605/ https://www.ncbi.nlm.nih.gov/pubmed/16677385 http://dx.doi.org/10.1186/1471-2105-7-246 |
_version_ | 1782128513798635520 |
---|---|
author | Saigo, Hiroto Vert, Jean-Philippe Akutsu, Tatsuya |
author_facet | Saigo, Hiroto Vert, Jean-Philippe Akutsu, Tatsuya |
author_sort | Saigo, Hiroto |
collection | PubMed |
description | BACKGROUND: Detecting remote homologies by direct comparison of protein sequences remains a challenging task. We had previously developed a similarity score between sequences, called a local alignment kernel, that exhibits good performance for this task in combination with a support vector machine. The local alignment kernel depends on an amino acid substitution matrix. Since commonly used BLOSUM or PAM matrices for scoring amino acid matches have been optimized to be used in combination with the Smith-Waterman algorithm, the matrices optimal for the local alignment kernel can be different. RESULTS: Contrary to the local alignment score computed by the Smith-Waterman algorithm, the local alignment kernel is differentiable with respect to the amino acid substitution and its derivative can be computed efficiently by dynamic programming. We optimized the substitution matrix by classical gradient descent by setting an objective function that measures how well the local alignment kernel discriminates homologs from non-homologs in the COG database. The local alignment kernel exhibits better performance when it uses the matrices and gap parameters optimized by this procedure than when it uses the matrices optimized for the Smith-Waterman algorithm. Furthermore, the matrices and gap parameters optimized for the local alignment kernel can also be used successfully by the Smith-Waterman algorithm. CONCLUSION: This optimization procedure leads to useful substitution matrices, both for the local alignment kernel and the Smith-Waterman algorithm. The best performance for homology detection is obtained by the local alignment kernel. |
format | Text |
id | pubmed-1513605 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-15136052006-07-24 Optimizing amino acid substitution matrices with a local alignment kernel Saigo, Hiroto Vert, Jean-Philippe Akutsu, Tatsuya BMC Bioinformatics Methodology Article BACKGROUND: Detecting remote homologies by direct comparison of protein sequences remains a challenging task. We had previously developed a similarity score between sequences, called a local alignment kernel, that exhibits good performance for this task in combination with a support vector machine. The local alignment kernel depends on an amino acid substitution matrix. Since commonly used BLOSUM or PAM matrices for scoring amino acid matches have been optimized to be used in combination with the Smith-Waterman algorithm, the matrices optimal for the local alignment kernel can be different. RESULTS: Contrary to the local alignment score computed by the Smith-Waterman algorithm, the local alignment kernel is differentiable with respect to the amino acid substitution and its derivative can be computed efficiently by dynamic programming. We optimized the substitution matrix by classical gradient descent by setting an objective function that measures how well the local alignment kernel discriminates homologs from non-homologs in the COG database. The local alignment kernel exhibits better performance when it uses the matrices and gap parameters optimized by this procedure than when it uses the matrices optimized for the Smith-Waterman algorithm. Furthermore, the matrices and gap parameters optimized for the local alignment kernel can also be used successfully by the Smith-Waterman algorithm. CONCLUSION: This optimization procedure leads to useful substitution matrices, both for the local alignment kernel and the Smith-Waterman algorithm. The best performance for homology detection is obtained by the local alignment kernel. BioMed Central 2006-05-05 /pmc/articles/PMC1513605/ /pubmed/16677385 http://dx.doi.org/10.1186/1471-2105-7-246 Text en Copyright © 2006 Saigo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Saigo, Hiroto Vert, Jean-Philippe Akutsu, Tatsuya Optimizing amino acid substitution matrices with a local alignment kernel |
title | Optimizing amino acid substitution matrices with a local alignment kernel |
title_full | Optimizing amino acid substitution matrices with a local alignment kernel |
title_fullStr | Optimizing amino acid substitution matrices with a local alignment kernel |
title_full_unstemmed | Optimizing amino acid substitution matrices with a local alignment kernel |
title_short | Optimizing amino acid substitution matrices with a local alignment kernel |
title_sort | optimizing amino acid substitution matrices with a local alignment kernel |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1513605/ https://www.ncbi.nlm.nih.gov/pubmed/16677385 http://dx.doi.org/10.1186/1471-2105-7-246 |
work_keys_str_mv | AT saigohiroto optimizingaminoacidsubstitutionmatriceswithalocalalignmentkernel AT vertjeanphilippe optimizingaminoacidsubstitutionmatriceswithalocalalignmentkernel AT akutsutatsuya optimizingaminoacidsubstitutionmatriceswithalocalalignmentkernel |