Cargando…

Using Structure to Explore the Sequence Alignment Space of Remote Homologs

Protein structure modeling by homology requires an accurate sequence alignment between the query protein and its structural template. However, sequence alignment methods based on dynamic programming (DP) are typically unable to generate accurate alignments for remote sequence homologs, thus limiting...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuziemko, Andrew, Honig, Barry, Petrey, Donald
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3188491/
https://www.ncbi.nlm.nih.gov/pubmed/21998567
http://dx.doi.org/10.1371/journal.pcbi.1002175
_version_ 1782213379716284416
author Kuziemko, Andrew
Honig, Barry
Petrey, Donald
author_facet Kuziemko, Andrew
Honig, Barry
Petrey, Donald
author_sort Kuziemko, Andrew
collection PubMed
description Protein structure modeling by homology requires an accurate sequence alignment between the query protein and its structural template. However, sequence alignment methods based on dynamic programming (DP) are typically unable to generate accurate alignments for remote sequence homologs, thus limiting the applicability of modeling methods. A central problem is that the alignment that is “optimal” in terms of the DP score does not necessarily correspond to the alignment that produces the most accurate structural model. That is, the correct alignment based on structural superposition will generally have a lower score than the optimal alignment obtained from sequence. Variations of the DP algorithm have been developed that generate alternative alignments that are “suboptimal” in terms of the DP score, but these still encounter difficulties in detecting the correct structural alignment. We present here a new alternative sequence alignment method that relies heavily on the structure of the template. By initially aligning the query sequence to individual fragments in secondary structure elements and combining high-scoring fragments that pass basic tests for “modelability”, we can generate accurate alignments within a small ensemble. Our results suggest that the set of sequences that can currently be modeled by homology can be greatly extended.
format Online
Article
Text
id pubmed-3188491
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31884912011-10-13 Using Structure to Explore the Sequence Alignment Space of Remote Homologs Kuziemko, Andrew Honig, Barry Petrey, Donald PLoS Comput Biol Research Article Protein structure modeling by homology requires an accurate sequence alignment between the query protein and its structural template. However, sequence alignment methods based on dynamic programming (DP) are typically unable to generate accurate alignments for remote sequence homologs, thus limiting the applicability of modeling methods. A central problem is that the alignment that is “optimal” in terms of the DP score does not necessarily correspond to the alignment that produces the most accurate structural model. That is, the correct alignment based on structural superposition will generally have a lower score than the optimal alignment obtained from sequence. Variations of the DP algorithm have been developed that generate alternative alignments that are “suboptimal” in terms of the DP score, but these still encounter difficulties in detecting the correct structural alignment. We present here a new alternative sequence alignment method that relies heavily on the structure of the template. By initially aligning the query sequence to individual fragments in secondary structure elements and combining high-scoring fragments that pass basic tests for “modelability”, we can generate accurate alignments within a small ensemble. Our results suggest that the set of sequences that can currently be modeled by homology can be greatly extended. Public Library of Science 2011-10-06 /pmc/articles/PMC3188491/ /pubmed/21998567 http://dx.doi.org/10.1371/journal.pcbi.1002175 Text en Kuziemko et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kuziemko, Andrew
Honig, Barry
Petrey, Donald
Using Structure to Explore the Sequence Alignment Space of Remote Homologs
title Using Structure to Explore the Sequence Alignment Space of Remote Homologs
title_full Using Structure to Explore the Sequence Alignment Space of Remote Homologs
title_fullStr Using Structure to Explore the Sequence Alignment Space of Remote Homologs
title_full_unstemmed Using Structure to Explore the Sequence Alignment Space of Remote Homologs
title_short Using Structure to Explore the Sequence Alignment Space of Remote Homologs
title_sort using structure to explore the sequence alignment space of remote homologs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3188491/
https://www.ncbi.nlm.nih.gov/pubmed/21998567
http://dx.doi.org/10.1371/journal.pcbi.1002175
work_keys_str_mv AT kuziemkoandrew usingstructuretoexplorethesequencealignmentspaceofremotehomologs
AT honigbarry usingstructuretoexplorethesequencealignmentspaceofremotehomologs
AT petreydonald usingstructuretoexplorethesequencealignmentspaceofremotehomologs