Cargando…

Optimizing amino acid substitution matrices with a local alignment kernel

BACKGROUND: Detecting remote homologies by direct comparison of protein sequences remains a challenging task. We had previously developed a similarity score between sequences, called a local alignment kernel, that exhibits good performance for this task in combination with a support vector machine....

Descripción completa

Detalles Bibliográficos
Autores principales: Saigo, Hiroto, Vert, Jean-Philippe, Akutsu, Tatsuya
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1513605/
https://www.ncbi.nlm.nih.gov/pubmed/16677385
http://dx.doi.org/10.1186/1471-2105-7-246
_version_ 1782128513798635520
author Saigo, Hiroto
Vert, Jean-Philippe
Akutsu, Tatsuya
author_facet Saigo, Hiroto
Vert, Jean-Philippe
Akutsu, Tatsuya
author_sort Saigo, Hiroto
collection PubMed
description BACKGROUND: Detecting remote homologies by direct comparison of protein sequences remains a challenging task. We had previously developed a similarity score between sequences, called a local alignment kernel, that exhibits good performance for this task in combination with a support vector machine. The local alignment kernel depends on an amino acid substitution matrix. Since commonly used BLOSUM or PAM matrices for scoring amino acid matches have been optimized to be used in combination with the Smith-Waterman algorithm, the matrices optimal for the local alignment kernel can be different. RESULTS: Contrary to the local alignment score computed by the Smith-Waterman algorithm, the local alignment kernel is differentiable with respect to the amino acid substitution and its derivative can be computed efficiently by dynamic programming. We optimized the substitution matrix by classical gradient descent by setting an objective function that measures how well the local alignment kernel discriminates homologs from non-homologs in the COG database. The local alignment kernel exhibits better performance when it uses the matrices and gap parameters optimized by this procedure than when it uses the matrices optimized for the Smith-Waterman algorithm. Furthermore, the matrices and gap parameters optimized for the local alignment kernel can also be used successfully by the Smith-Waterman algorithm. CONCLUSION: This optimization procedure leads to useful substitution matrices, both for the local alignment kernel and the Smith-Waterman algorithm. The best performance for homology detection is obtained by the local alignment kernel.
format Text
id pubmed-1513605
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15136052006-07-24 Optimizing amino acid substitution matrices with a local alignment kernel Saigo, Hiroto Vert, Jean-Philippe Akutsu, Tatsuya BMC Bioinformatics Methodology Article BACKGROUND: Detecting remote homologies by direct comparison of protein sequences remains a challenging task. We had previously developed a similarity score between sequences, called a local alignment kernel, that exhibits good performance for this task in combination with a support vector machine. The local alignment kernel depends on an amino acid substitution matrix. Since commonly used BLOSUM or PAM matrices for scoring amino acid matches have been optimized to be used in combination with the Smith-Waterman algorithm, the matrices optimal for the local alignment kernel can be different. RESULTS: Contrary to the local alignment score computed by the Smith-Waterman algorithm, the local alignment kernel is differentiable with respect to the amino acid substitution and its derivative can be computed efficiently by dynamic programming. We optimized the substitution matrix by classical gradient descent by setting an objective function that measures how well the local alignment kernel discriminates homologs from non-homologs in the COG database. The local alignment kernel exhibits better performance when it uses the matrices and gap parameters optimized by this procedure than when it uses the matrices optimized for the Smith-Waterman algorithm. Furthermore, the matrices and gap parameters optimized for the local alignment kernel can also be used successfully by the Smith-Waterman algorithm. CONCLUSION: This optimization procedure leads to useful substitution matrices, both for the local alignment kernel and the Smith-Waterman algorithm. The best performance for homology detection is obtained by the local alignment kernel. BioMed Central 2006-05-05 /pmc/articles/PMC1513605/ /pubmed/16677385 http://dx.doi.org/10.1186/1471-2105-7-246 Text en Copyright © 2006 Saigo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Saigo, Hiroto
Vert, Jean-Philippe
Akutsu, Tatsuya
Optimizing amino acid substitution matrices with a local alignment kernel
title Optimizing amino acid substitution matrices with a local alignment kernel
title_full Optimizing amino acid substitution matrices with a local alignment kernel
title_fullStr Optimizing amino acid substitution matrices with a local alignment kernel
title_full_unstemmed Optimizing amino acid substitution matrices with a local alignment kernel
title_short Optimizing amino acid substitution matrices with a local alignment kernel
title_sort optimizing amino acid substitution matrices with a local alignment kernel
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1513605/
https://www.ncbi.nlm.nih.gov/pubmed/16677385
http://dx.doi.org/10.1186/1471-2105-7-246
work_keys_str_mv AT saigohiroto optimizingaminoacidsubstitutionmatriceswithalocalalignmentkernel
AT vertjeanphilippe optimizingaminoacidsubstitutionmatriceswithalocalalignmentkernel
AT akutsutatsuya optimizingaminoacidsubstitutionmatriceswithalocalalignmentkernel