Cargando…
Impact of structure space continuity on protein fold classification
Protein structure classification hierarchically clusters domain structures based on structure and/or sequence similarities and plays important roles in the study of protein structure-function relationship and protein evolution. Among many classifications, SCOP and CATH are widely viewed as the gold...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804218/ https://www.ncbi.nlm.nih.gov/pubmed/27006112 http://dx.doi.org/10.1038/srep23263 |
_version_ | 1782422985999646720 |
---|---|
author | Xu, Jinrui Zhang, Jianzhi |
author_facet | Xu, Jinrui Zhang, Jianzhi |
author_sort | Xu, Jinrui |
collection | PubMed |
description | Protein structure classification hierarchically clusters domain structures based on structure and/or sequence similarities and plays important roles in the study of protein structure-function relationship and protein evolution. Among many classifications, SCOP and CATH are widely viewed as the gold standards. Fold classification is of special interest because this is the lowest level of classification that does not depend on protein sequence similarity. The current fold classifications such as those in SCOP and CATH are controversial because they implicitly assume that folds are discrete islands in the structure space, whereas increasing evidence suggests significant similarities among folds and supports a continuous fold space. Although this problem is widely recognized, its impact on fold classification has not been quantitatively evaluated. Here we develop a likelihood method to classify a domain into the existing folds of CATH or SCOP using both query-fold structure similarities and within-fold structure heterogeneities. The new classification differs from the original classification for 3.4–12% of domains, depending on factors such as the structure similarity score and original classification scheme used. Because these factors differ for different biological purposes, our results indicate that the importance of considering structure space continuity in fold classification depends on the specific question asked. |
format | Online Article Text |
id | pubmed-4804218 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-48042182016-03-23 Impact of structure space continuity on protein fold classification Xu, Jinrui Zhang, Jianzhi Sci Rep Article Protein structure classification hierarchically clusters domain structures based on structure and/or sequence similarities and plays important roles in the study of protein structure-function relationship and protein evolution. Among many classifications, SCOP and CATH are widely viewed as the gold standards. Fold classification is of special interest because this is the lowest level of classification that does not depend on protein sequence similarity. The current fold classifications such as those in SCOP and CATH are controversial because they implicitly assume that folds are discrete islands in the structure space, whereas increasing evidence suggests significant similarities among folds and supports a continuous fold space. Although this problem is widely recognized, its impact on fold classification has not been quantitatively evaluated. Here we develop a likelihood method to classify a domain into the existing folds of CATH or SCOP using both query-fold structure similarities and within-fold structure heterogeneities. The new classification differs from the original classification for 3.4–12% of domains, depending on factors such as the structure similarity score and original classification scheme used. Because these factors differ for different biological purposes, our results indicate that the importance of considering structure space continuity in fold classification depends on the specific question asked. Nature Publishing Group 2016-03-23 /pmc/articles/PMC4804218/ /pubmed/27006112 http://dx.doi.org/10.1038/srep23263 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Xu, Jinrui Zhang, Jianzhi Impact of structure space continuity on protein fold classification |
title | Impact of structure space continuity on protein fold classification |
title_full | Impact of structure space continuity on protein fold classification |
title_fullStr | Impact of structure space continuity on protein fold classification |
title_full_unstemmed | Impact of structure space continuity on protein fold classification |
title_short | Impact of structure space continuity on protein fold classification |
title_sort | impact of structure space continuity on protein fold classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804218/ https://www.ncbi.nlm.nih.gov/pubmed/27006112 http://dx.doi.org/10.1038/srep23263 |
work_keys_str_mv | AT xujinrui impactofstructurespacecontinuityonproteinfoldclassification AT zhangjianzhi impactofstructurespacecontinuityonproteinfoldclassification |