Cargando…

Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts

BACKGROUND: Residue-residue contacts are key features for accurate de novo protein structure prediction. For the optimal utilization of these predicted contacts in folding proteins accurately, it is important to study the challenges of reconstructing protein structures using true contacts. Because c...

Descripción completa

Detalles Bibliográficos
Autores principales: Adhikari, Badri, Cheng, Jianlin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5576353/
https://www.ncbi.nlm.nih.gov/pubmed/28851269
http://dx.doi.org/10.1186/s12859-017-1807-5
_version_ 1783260185735725056
author Adhikari, Badri
Cheng, Jianlin
author_facet Adhikari, Badri
Cheng, Jianlin
author_sort Adhikari, Badri
collection PubMed
description BACKGROUND: Residue-residue contacts are key features for accurate de novo protein structure prediction. For the optimal utilization of these predicted contacts in folding proteins accurately, it is important to study the challenges of reconstructing protein structures using true contacts. Because contact-guided protein modeling approach is valuable for predicting the folds of proteins that do not have structural templates, it is necessary for reconstruction studies to focus on hard-to-predict protein structures. RESULTS: Using a data set consisting of 496 structural domains released in recent CASP experiments and a dataset of 150 representative protein structures, in this work, we discuss three techniques to improve the reconstruction accuracy using true contacts – adding secondary structures, increasing contact distance thresholds, and adding non-contacts. We find that reconstruction using secondary structures and contacts can deliver accuracy higher than using full contact maps. Similarly, we demonstrate that non-contacts can improve reconstruction accuracy not only when the used non-contacts are true but also when they are predicted. On the dataset consisting of 150 proteins, we find that by simply using low ranked predicted contacts as non-contacts and adding them as additional restraints, can increase the reconstruction accuracy by 5% when the reconstructed models are evaluated using TM-score. CONCLUSIONS: Our findings suggest that secondary structures are invaluable companions of contacts for accurate reconstruction. Confirming some earlier findings, we also find that larger distance thresholds are useful for folding many protein structures which cannot be folded using the standard definition of contacts. Our findings also suggest that for more accurate reconstruction using predicted contacts it is useful to predict contacts at higher distance thresholds (beyond 8 Å) and predict non-contacts. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1807-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5576353
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55763532017-08-31 Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts Adhikari, Badri Cheng, Jianlin BMC Bioinformatics Research Article BACKGROUND: Residue-residue contacts are key features for accurate de novo protein structure prediction. For the optimal utilization of these predicted contacts in folding proteins accurately, it is important to study the challenges of reconstructing protein structures using true contacts. Because contact-guided protein modeling approach is valuable for predicting the folds of proteins that do not have structural templates, it is necessary for reconstruction studies to focus on hard-to-predict protein structures. RESULTS: Using a data set consisting of 496 structural domains released in recent CASP experiments and a dataset of 150 representative protein structures, in this work, we discuss three techniques to improve the reconstruction accuracy using true contacts – adding secondary structures, increasing contact distance thresholds, and adding non-contacts. We find that reconstruction using secondary structures and contacts can deliver accuracy higher than using full contact maps. Similarly, we demonstrate that non-contacts can improve reconstruction accuracy not only when the used non-contacts are true but also when they are predicted. On the dataset consisting of 150 proteins, we find that by simply using low ranked predicted contacts as non-contacts and adding them as additional restraints, can increase the reconstruction accuracy by 5% when the reconstructed models are evaluated using TM-score. CONCLUSIONS: Our findings suggest that secondary structures are invaluable companions of contacts for accurate reconstruction. Confirming some earlier findings, we also find that larger distance thresholds are useful for folding many protein structures which cannot be folded using the standard definition of contacts. Our findings also suggest that for more accurate reconstruction using predicted contacts it is useful to predict contacts at higher distance thresholds (beyond 8 Å) and predict non-contacts. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1807-5) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-29 /pmc/articles/PMC5576353/ /pubmed/28851269 http://dx.doi.org/10.1186/s12859-017-1807-5 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Adhikari, Badri
Cheng, Jianlin
Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts
title Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts
title_full Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts
title_fullStr Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts
title_full_unstemmed Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts
title_short Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts
title_sort improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5576353/
https://www.ncbi.nlm.nih.gov/pubmed/28851269
http://dx.doi.org/10.1186/s12859-017-1807-5
work_keys_str_mv AT adhikaribadri improvedproteinstructurereconstructionusingsecondarystructurescontactsathigherdistancethresholdsandnoncontacts
AT chengjianlin improvedproteinstructurereconstructionusingsecondarystructurescontactsathigherdistancethresholdsandnoncontacts