Cargando…
ASH structure alignment package: Sensitivity and selectivity in domain classification
BACKGROUND: Structure alignment methods offer the possibility of measuring distant evolutionary relationships between proteins that are not visible by sequence-based analysis. However, the question of how structural differences and similarities ought to be quantified in this regard remains open. In...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1955748/ https://www.ncbi.nlm.nih.gov/pubmed/17407606 http://dx.doi.org/10.1186/1471-2105-8-116 |
_version_ | 1782134615296704512 |
---|---|
author | Standley, Daron M Toh, Hiroyuki Nakamura, Haruki |
author_facet | Standley, Daron M Toh, Hiroyuki Nakamura, Haruki |
author_sort | Standley, Daron M |
collection | PubMed |
description | BACKGROUND: Structure alignment methods offer the possibility of measuring distant evolutionary relationships between proteins that are not visible by sequence-based analysis. However, the question of how structural differences and similarities ought to be quantified in this regard remains open. In this study we construct a training set of sequence-unique CATH and SCOP domains, from which we develop a scoring function that can reliably identify domains with the same CATH topology and SCOP fold classification. The score is implemented in the ASH structure alignment package, for which the source code and a web service are freely available from the PDBj website . RESULTS: The new ASH score shows increased selectivity and sensitivity compared with values reported for several popular programs using the same test set of 4,298,905 structure pairs, yielding an area of .96 under the receiver operating characteristic (ROC) curve. In addition, weak sequence homologies between similar domains are revealed that could not be detected by BLAST sequence alignment. Also, a subset of domain pairs is identified that exhibit high similarity, even though their CATH and SCOP classification differs. Finally, we show that the ranking of alignment programs based solely on geometric measures depends on the choice of the quality measure. CONCLUSION: ASH shows high selectivity and sensitivity with regard to domain classification, an important step in defining distantly related protein sequence families. Moreover, the CPU cost per alignment is competitive with the fastest programs, making ASH a practical option for large-scale structure classification studies. |
format | Text |
id | pubmed-1955748 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-19557482007-08-30 ASH structure alignment package: Sensitivity and selectivity in domain classification Standley, Daron M Toh, Hiroyuki Nakamura, Haruki BMC Bioinformatics Software BACKGROUND: Structure alignment methods offer the possibility of measuring distant evolutionary relationships between proteins that are not visible by sequence-based analysis. However, the question of how structural differences and similarities ought to be quantified in this regard remains open. In this study we construct a training set of sequence-unique CATH and SCOP domains, from which we develop a scoring function that can reliably identify domains with the same CATH topology and SCOP fold classification. The score is implemented in the ASH structure alignment package, for which the source code and a web service are freely available from the PDBj website . RESULTS: The new ASH score shows increased selectivity and sensitivity compared with values reported for several popular programs using the same test set of 4,298,905 structure pairs, yielding an area of .96 under the receiver operating characteristic (ROC) curve. In addition, weak sequence homologies between similar domains are revealed that could not be detected by BLAST sequence alignment. Also, a subset of domain pairs is identified that exhibit high similarity, even though their CATH and SCOP classification differs. Finally, we show that the ranking of alignment programs based solely on geometric measures depends on the choice of the quality measure. CONCLUSION: ASH shows high selectivity and sensitivity with regard to domain classification, an important step in defining distantly related protein sequence families. Moreover, the CPU cost per alignment is competitive with the fastest programs, making ASH a practical option for large-scale structure classification studies. BioMed Central 2007-04-04 /pmc/articles/PMC1955748/ /pubmed/17407606 http://dx.doi.org/10.1186/1471-2105-8-116 Text en Copyright © 2007 Standley et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Standley, Daron M Toh, Hiroyuki Nakamura, Haruki ASH structure alignment package: Sensitivity and selectivity in domain classification |
title | ASH structure alignment package: Sensitivity and selectivity in domain classification |
title_full | ASH structure alignment package: Sensitivity and selectivity in domain classification |
title_fullStr | ASH structure alignment package: Sensitivity and selectivity in domain classification |
title_full_unstemmed | ASH structure alignment package: Sensitivity and selectivity in domain classification |
title_short | ASH structure alignment package: Sensitivity and selectivity in domain classification |
title_sort | ash structure alignment package: sensitivity and selectivity in domain classification |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1955748/ https://www.ncbi.nlm.nih.gov/pubmed/17407606 http://dx.doi.org/10.1186/1471-2105-8-116 |
work_keys_str_mv | AT standleydaronm ashstructurealignmentpackagesensitivityandselectivityindomainclassification AT tohhiroyuki ashstructurealignmentpackagesensitivityandselectivityindomainclassification AT nakamuraharuki ashstructurealignmentpackagesensitivityandselectivityindomainclassification |