Cargando…

Dynamic use of multiple parameter sets in sequence alignment

The level of conservation between two homologous sequences often varies among sequence regions; functionally important domains are more conserved than the remaining regions. Thus, multiple parameter sets should be used in alignment of homologous sequences with a stringent parameter set for highly co...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Xiaoqiu, Brutlag, Douglas L.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1802605/
https://www.ncbi.nlm.nih.gov/pubmed/17182633
http://dx.doi.org/10.1093/nar/gkl1063
_version_ 1782132391794442240
author Huang, Xiaoqiu
Brutlag, Douglas L.
author_facet Huang, Xiaoqiu
Brutlag, Douglas L.
author_sort Huang, Xiaoqiu
collection PubMed
description The level of conservation between two homologous sequences often varies among sequence regions; functionally important domains are more conserved than the remaining regions. Thus, multiple parameter sets should be used in alignment of homologous sequences with a stringent parameter set for highly conserved regions and a moderate parameter set for weakly conserved regions. We describe an alignment algorithm to allow dynamic use of multiple parameter sets with different levels of stringency in computation of an optimal alignment of two sequences. The algorithm dynamically considers various candidate alignments, partitions each candidate alignment into sections, and determines the most appropriate set of parameter values for each section of the alignment. The algorithm and its local alignment version are implemented in a computer program named GAP4. The local alignment algorithm in GAP4, that in its predecessor GAP3, and an ordinary local alignment program SIM were evaluated on 257 716 pairs of homologous sequences from 100 protein families. On 168 475 of the 257 716 pairs (a rate of 65.4%), alignments from GAP4 were more statistically significant than alignments from GAP3 and SIM.
format Text
id pubmed-1802605
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-18026052007-03-01 Dynamic use of multiple parameter sets in sequence alignment Huang, Xiaoqiu Brutlag, Douglas L. Nucleic Acids Res Computational Biology The level of conservation between two homologous sequences often varies among sequence regions; functionally important domains are more conserved than the remaining regions. Thus, multiple parameter sets should be used in alignment of homologous sequences with a stringent parameter set for highly conserved regions and a moderate parameter set for weakly conserved regions. We describe an alignment algorithm to allow dynamic use of multiple parameter sets with different levels of stringency in computation of an optimal alignment of two sequences. The algorithm dynamically considers various candidate alignments, partitions each candidate alignment into sections, and determines the most appropriate set of parameter values for each section of the alignment. The algorithm and its local alignment version are implemented in a computer program named GAP4. The local alignment algorithm in GAP4, that in its predecessor GAP3, and an ordinary local alignment program SIM were evaluated on 257 716 pairs of homologous sequences from 100 protein families. On 168 475 of the 257 716 pairs (a rate of 65.4%), alignments from GAP4 were more statistically significant than alignments from GAP3 and SIM. Oxford University Press 2007-01 2006-12-19 /pmc/articles/PMC1802605/ /pubmed/17182633 http://dx.doi.org/10.1093/nar/gkl1063 Text en © 2006 The Author(s). This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Huang, Xiaoqiu
Brutlag, Douglas L.
Dynamic use of multiple parameter sets in sequence alignment
title Dynamic use of multiple parameter sets in sequence alignment
title_full Dynamic use of multiple parameter sets in sequence alignment
title_fullStr Dynamic use of multiple parameter sets in sequence alignment
title_full_unstemmed Dynamic use of multiple parameter sets in sequence alignment
title_short Dynamic use of multiple parameter sets in sequence alignment
title_sort dynamic use of multiple parameter sets in sequence alignment
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1802605/
https://www.ncbi.nlm.nih.gov/pubmed/17182633
http://dx.doi.org/10.1093/nar/gkl1063
work_keys_str_mv AT huangxiaoqiu dynamicuseofmultipleparametersetsinsequencealignment
AT brutlagdouglasl dynamicuseofmultipleparametersetsinsequencealignment