Cargando…

Impact of structure space continuity on protein fold classification

Protein structure classification hierarchically clusters domain structures based on structure and/or sequence similarities and plays important roles in the study of protein structure-function relationship and protein evolution. Among many classifications, SCOP and CATH are widely viewed as the gold...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Jinrui, Zhang, Jianzhi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804218/
https://www.ncbi.nlm.nih.gov/pubmed/27006112
http://dx.doi.org/10.1038/srep23263
_version_ 1782422985999646720
author Xu, Jinrui
Zhang, Jianzhi
author_facet Xu, Jinrui
Zhang, Jianzhi
author_sort Xu, Jinrui
collection PubMed
description Protein structure classification hierarchically clusters domain structures based on structure and/or sequence similarities and plays important roles in the study of protein structure-function relationship and protein evolution. Among many classifications, SCOP and CATH are widely viewed as the gold standards. Fold classification is of special interest because this is the lowest level of classification that does not depend on protein sequence similarity. The current fold classifications such as those in SCOP and CATH are controversial because they implicitly assume that folds are discrete islands in the structure space, whereas increasing evidence suggests significant similarities among folds and supports a continuous fold space. Although this problem is widely recognized, its impact on fold classification has not been quantitatively evaluated. Here we develop a likelihood method to classify a domain into the existing folds of CATH or SCOP using both query-fold structure similarities and within-fold structure heterogeneities. The new classification differs from the original classification for 3.4–12% of domains, depending on factors such as the structure similarity score and original classification scheme used. Because these factors differ for different biological purposes, our results indicate that the importance of considering structure space continuity in fold classification depends on the specific question asked.
format Online
Article
Text
id pubmed-4804218
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-48042182016-03-23 Impact of structure space continuity on protein fold classification Xu, Jinrui Zhang, Jianzhi Sci Rep Article Protein structure classification hierarchically clusters domain structures based on structure and/or sequence similarities and plays important roles in the study of protein structure-function relationship and protein evolution. Among many classifications, SCOP and CATH are widely viewed as the gold standards. Fold classification is of special interest because this is the lowest level of classification that does not depend on protein sequence similarity. The current fold classifications such as those in SCOP and CATH are controversial because they implicitly assume that folds are discrete islands in the structure space, whereas increasing evidence suggests significant similarities among folds and supports a continuous fold space. Although this problem is widely recognized, its impact on fold classification has not been quantitatively evaluated. Here we develop a likelihood method to classify a domain into the existing folds of CATH or SCOP using both query-fold structure similarities and within-fold structure heterogeneities. The new classification differs from the original classification for 3.4–12% of domains, depending on factors such as the structure similarity score and original classification scheme used. Because these factors differ for different biological purposes, our results indicate that the importance of considering structure space continuity in fold classification depends on the specific question asked. Nature Publishing Group 2016-03-23 /pmc/articles/PMC4804218/ /pubmed/27006112 http://dx.doi.org/10.1038/srep23263 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Xu, Jinrui
Zhang, Jianzhi
Impact of structure space continuity on protein fold classification
title Impact of structure space continuity on protein fold classification
title_full Impact of structure space continuity on protein fold classification
title_fullStr Impact of structure space continuity on protein fold classification
title_full_unstemmed Impact of structure space continuity on protein fold classification
title_short Impact of structure space continuity on protein fold classification
title_sort impact of structure space continuity on protein fold classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804218/
https://www.ncbi.nlm.nih.gov/pubmed/27006112
http://dx.doi.org/10.1038/srep23263
work_keys_str_mv AT xujinrui impactofstructurespacecontinuityonproteinfoldclassification
AT zhangjianzhi impactofstructurespacecontinuityonproteinfoldclassification