Cargando…

Effective inter-residue contact definitions for accurate protein fold recognition

BACKGROUND: Effective encoding of residue contact information is crucial for protein structure prediction since it has a unique role to capture long-range residue interactions compared to other commonly used scoring terms. The residue contact information can be incorporated in structure prediction i...

Descripción completa

Detalles Bibliográficos
Autores principales: Yuan, Chao, Chen, Hao, Kihara, Daisuke
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534397/
https://www.ncbi.nlm.nih.gov/pubmed/23140471
http://dx.doi.org/10.1186/1471-2105-13-292
_version_ 1782475330730065920
author Yuan, Chao
Chen, Hao
Kihara, Daisuke
author_facet Yuan, Chao
Chen, Hao
Kihara, Daisuke
author_sort Yuan, Chao
collection PubMed
description BACKGROUND: Effective encoding of residue contact information is crucial for protein structure prediction since it has a unique role to capture long-range residue interactions compared to other commonly used scoring terms. The residue contact information can be incorporated in structure prediction in several different ways: It can be incorporated as statistical potentials or it can be also used as constraints in ab initio structure prediction. To seek the most effective definition of residue contacts for template-based protein structure prediction, we evaluated 45 different contact definitions, varying bases of contacts and distance cutoffs, in terms of their ability to identify proteins of the same fold. RESULTS: We found that overall the residue contact pattern can distinguish protein folds best when contacts are defined for residue pairs whose Cβ atoms are at 7.0 Å or closer to each other. Lower fold recognition accuracy was observed when inaccurate threading alignments were used to identify common residue contacts between protein pairs. In the case of threading, alignment accuracy strongly influences the fraction of common contacts identified among proteins of the same fold, which eventually affects the fold recognition accuracy. The largest deterioration of the fold recognition was observed for β-class proteins when the threading methods were used because the average alignment accuracy was worst for this fold class. When results of fold recognition were examined for individual proteins, we found that the effective contact definition depends on the fold of the proteins. A larger distance cutoff is often advantageous for capturing spatial arrangement of the secondary structures which are not physically in contact. For capturing contacts between neighboring β strands, considering the distance between Cα atoms is better than the Cβ−based distance because the side-chain of interacting residues on β strands sometimes point to opposite directions. CONCLUSION: Residue contacts defined by Cβ−Cβ distance of 7.0 Å work best overall among tested to identify proteins of the same fold. We also found that effective contact definitions differ from fold to fold, suggesting that using different residue contact definition specific for each template will lead to improvement of the performance of threading.
format Online
Article
Text
id pubmed-3534397
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35343972013-01-03 Effective inter-residue contact definitions for accurate protein fold recognition Yuan, Chao Chen, Hao Kihara, Daisuke BMC Bioinformatics Research Article BACKGROUND: Effective encoding of residue contact information is crucial for protein structure prediction since it has a unique role to capture long-range residue interactions compared to other commonly used scoring terms. The residue contact information can be incorporated in structure prediction in several different ways: It can be incorporated as statistical potentials or it can be also used as constraints in ab initio structure prediction. To seek the most effective definition of residue contacts for template-based protein structure prediction, we evaluated 45 different contact definitions, varying bases of contacts and distance cutoffs, in terms of their ability to identify proteins of the same fold. RESULTS: We found that overall the residue contact pattern can distinguish protein folds best when contacts are defined for residue pairs whose Cβ atoms are at 7.0 Å or closer to each other. Lower fold recognition accuracy was observed when inaccurate threading alignments were used to identify common residue contacts between protein pairs. In the case of threading, alignment accuracy strongly influences the fraction of common contacts identified among proteins of the same fold, which eventually affects the fold recognition accuracy. The largest deterioration of the fold recognition was observed for β-class proteins when the threading methods were used because the average alignment accuracy was worst for this fold class. When results of fold recognition were examined for individual proteins, we found that the effective contact definition depends on the fold of the proteins. A larger distance cutoff is often advantageous for capturing spatial arrangement of the secondary structures which are not physically in contact. For capturing contacts between neighboring β strands, considering the distance between Cα atoms is better than the Cβ−based distance because the side-chain of interacting residues on β strands sometimes point to opposite directions. CONCLUSION: Residue contacts defined by Cβ−Cβ distance of 7.0 Å work best overall among tested to identify proteins of the same fold. We also found that effective contact definitions differ from fold to fold, suggesting that using different residue contact definition specific for each template will lead to improvement of the performance of threading. BioMed Central 2012-11-09 /pmc/articles/PMC3534397/ /pubmed/23140471 http://dx.doi.org/10.1186/1471-2105-13-292 Text en Copyright ©2012 Yuan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yuan, Chao
Chen, Hao
Kihara, Daisuke
Effective inter-residue contact definitions for accurate protein fold recognition
title Effective inter-residue contact definitions for accurate protein fold recognition
title_full Effective inter-residue contact definitions for accurate protein fold recognition
title_fullStr Effective inter-residue contact definitions for accurate protein fold recognition
title_full_unstemmed Effective inter-residue contact definitions for accurate protein fold recognition
title_short Effective inter-residue contact definitions for accurate protein fold recognition
title_sort effective inter-residue contact definitions for accurate protein fold recognition
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534397/
https://www.ncbi.nlm.nih.gov/pubmed/23140471
http://dx.doi.org/10.1186/1471-2105-13-292
work_keys_str_mv AT yuanchao effectiveinterresiduecontactdefinitionsforaccurateproteinfoldrecognition
AT chenhao effectiveinterresiduecontactdefinitionsforaccurateproteinfoldrecognition
AT kiharadaisuke effectiveinterresiduecontactdefinitionsforaccurateproteinfoldrecognition