Cargando…

Protein structure database search and evolutionary classification

As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have develope...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Jinn-Moon, Tung, Chi-Hua
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1540718/
https://www.ncbi.nlm.nih.gov/pubmed/16885238
http://dx.doi.org/10.1093/nar/gkl395
_version_ 1782129176392761344
author Yang, Jinn-Moon
Tung, Chi-Hua
author_facet Yang, Jinn-Moon
Tung, Chi-Hua
author_sort Yang, Jinn-Moon
collection PubMed
description As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have developed 3D-BLAST, in part, to address these issues. 3D-BLAST is as fast as BLAST and calculates the statistical significance (E-value) of an alignment to indicate the reliability of the prediction. Using this method, we first identified 23 states of the structural alphabet that represent pattern profiles of the backbone fragments and then used them to represent protein structure databases as structural alphabet sequence databases (SADB). Our method enhanced BLAST as a search method, using a new structural alphabet substitution matrix (SASM) to find the longest common substructures with high-scoring structured segment pairs from an SADB database. Using personal computers with Intel Pentium4 (2.8 GHz) processors, our method searched more than 10 000 protein structures in 1.3 s and achieved a good agreement with search results from detailed structure alignment methods. [3D-BLAST is available at ]
format Text
id pubmed-1540718
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-15407182006-08-24 Protein structure database search and evolutionary classification Yang, Jinn-Moon Tung, Chi-Hua Nucleic Acids Res Article As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have developed 3D-BLAST, in part, to address these issues. 3D-BLAST is as fast as BLAST and calculates the statistical significance (E-value) of an alignment to indicate the reliability of the prediction. Using this method, we first identified 23 states of the structural alphabet that represent pattern profiles of the backbone fragments and then used them to represent protein structure databases as structural alphabet sequence databases (SADB). Our method enhanced BLAST as a search method, using a new structural alphabet substitution matrix (SASM) to find the longest common substructures with high-scoring structured segment pairs from an SADB database. Using personal computers with Intel Pentium4 (2.8 GHz) processors, our method searched more than 10 000 protein structures in 1.3 s and achieved a good agreement with search results from detailed structure alignment methods. [3D-BLAST is available at ] Oxford University Press 2006 2006-08-02 /pmc/articles/PMC1540718/ /pubmed/16885238 http://dx.doi.org/10.1093/nar/gkl395 Text en © 2006 The Author(s)
spellingShingle Article
Yang, Jinn-Moon
Tung, Chi-Hua
Protein structure database search and evolutionary classification
title Protein structure database search and evolutionary classification
title_full Protein structure database search and evolutionary classification
title_fullStr Protein structure database search and evolutionary classification
title_full_unstemmed Protein structure database search and evolutionary classification
title_short Protein structure database search and evolutionary classification
title_sort protein structure database search and evolutionary classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1540718/
https://www.ncbi.nlm.nih.gov/pubmed/16885238
http://dx.doi.org/10.1093/nar/gkl395
work_keys_str_mv AT yangjinnmoon proteinstructuredatabasesearchandevolutionaryclassification
AT tungchihua proteinstructuredatabasesearchandevolutionaryclassification