Cargando…
Protein structure database search and evolutionary classification
As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have develope...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1540718/ https://www.ncbi.nlm.nih.gov/pubmed/16885238 http://dx.doi.org/10.1093/nar/gkl395 |
_version_ | 1782129176392761344 |
---|---|
author | Yang, Jinn-Moon Tung, Chi-Hua |
author_facet | Yang, Jinn-Moon Tung, Chi-Hua |
author_sort | Yang, Jinn-Moon |
collection | PubMed |
description | As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have developed 3D-BLAST, in part, to address these issues. 3D-BLAST is as fast as BLAST and calculates the statistical significance (E-value) of an alignment to indicate the reliability of the prediction. Using this method, we first identified 23 states of the structural alphabet that represent pattern profiles of the backbone fragments and then used them to represent protein structure databases as structural alphabet sequence databases (SADB). Our method enhanced BLAST as a search method, using a new structural alphabet substitution matrix (SASM) to find the longest common substructures with high-scoring structured segment pairs from an SADB database. Using personal computers with Intel Pentium4 (2.8 GHz) processors, our method searched more than 10 000 protein structures in 1.3 s and achieved a good agreement with search results from detailed structure alignment methods. [3D-BLAST is available at ] |
format | Text |
id | pubmed-1540718 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-15407182006-08-24 Protein structure database search and evolutionary classification Yang, Jinn-Moon Tung, Chi-Hua Nucleic Acids Res Article As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have developed 3D-BLAST, in part, to address these issues. 3D-BLAST is as fast as BLAST and calculates the statistical significance (E-value) of an alignment to indicate the reliability of the prediction. Using this method, we first identified 23 states of the structural alphabet that represent pattern profiles of the backbone fragments and then used them to represent protein structure databases as structural alphabet sequence databases (SADB). Our method enhanced BLAST as a search method, using a new structural alphabet substitution matrix (SASM) to find the longest common substructures with high-scoring structured segment pairs from an SADB database. Using personal computers with Intel Pentium4 (2.8 GHz) processors, our method searched more than 10 000 protein structures in 1.3 s and achieved a good agreement with search results from detailed structure alignment methods. [3D-BLAST is available at ] Oxford University Press 2006 2006-08-02 /pmc/articles/PMC1540718/ /pubmed/16885238 http://dx.doi.org/10.1093/nar/gkl395 Text en © 2006 The Author(s) |
spellingShingle | Article Yang, Jinn-Moon Tung, Chi-Hua Protein structure database search and evolutionary classification |
title | Protein structure database search and evolutionary classification |
title_full | Protein structure database search and evolutionary classification |
title_fullStr | Protein structure database search and evolutionary classification |
title_full_unstemmed | Protein structure database search and evolutionary classification |
title_short | Protein structure database search and evolutionary classification |
title_sort | protein structure database search and evolutionary classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1540718/ https://www.ncbi.nlm.nih.gov/pubmed/16885238 http://dx.doi.org/10.1093/nar/gkl395 |
work_keys_str_mv | AT yangjinnmoon proteinstructuredatabasesearchandevolutionaryclassification AT tungchihua proteinstructuredatabasesearchandevolutionaryclassification |