Cargando…

Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks

BACKGROUND: Prediction of protein structures from their sequences is still one of the open grand challenges of computational biology. Some approaches to protein structure prediction, especially ab initio ones, rely to some extent on the prediction of residue contact maps. Residue contact map predict...

Descripción completa

Detalles Bibliográficos
Autores principales: Walsh, Ian, Baù, Davide, Martin, Alberto JM, Mooney, Catherine, Vullo, Alessandro, Pollastri, Gianluca
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654788/
https://www.ncbi.nlm.nih.gov/pubmed/19183478
http://dx.doi.org/10.1186/1472-6807-9-5
_version_ 1782165402416054272
author Walsh, Ian
Baù, Davide
Martin, Alberto JM
Mooney, Catherine
Vullo, Alessandro
Pollastri, Gianluca
author_facet Walsh, Ian
Baù, Davide
Martin, Alberto JM
Mooney, Catherine
Vullo, Alessandro
Pollastri, Gianluca
author_sort Walsh, Ian
collection PubMed
description BACKGROUND: Prediction of protein structures from their sequences is still one of the open grand challenges of computational biology. Some approaches to protein structure prediction, especially ab initio ones, rely to some extent on the prediction of residue contact maps. Residue contact map predictions have been assessed at the CASP competition for several years now. Although it has been shown that exact contact maps generally yield correct three-dimensional structures, this is true only at a relatively low resolution (3–4 Å from the native structure). Another known weakness of contact maps is that they are generally predicted ab initio, that is not exploiting information about potential homologues of known structure. RESULTS: We introduce a new class of distance restraints for protein structures: multi-class distance maps. We show that C(α )trace reconstructions based on 4-class native maps are significantly better than those from residue contact maps. We then build two predictors of 4-class maps based on recursive neural networks: one ab initio, or relying on the sequence and on evolutionary information; one template-based, or in which homology information to known structures is provided as a further input. We show that virtually any level of sequence similarity to structural templates (down to less than 10%) yields more accurate 4-class maps than the ab initio predictor. We show that template-based predictions by recursive neural networks are consistently better than the best template and than a number of combinations of the best available templates. We also extract binary residue contact maps at an 8 Å threshold (as per CASP assessment) from the 4-class predictors and show that the template-based version is also more accurate than the best template and consistently better than the ab initio one, down to very low levels of sequence identity to structural templates. Furthermore, we test both ab-initio and template-based 8 Å predictions on the CASP7 targets using a pre-CASP7 PDB, and find that both predictors are state-of-the-art, with the template-based one far outperforming the best CASP7 systems if templates with sequence identity to the query of 10% or better are available. Although this is not the main focus of this paper we also report on reconstructions of C(α )traces based on both ab initio and template-based 4-class map predictions, showing that the latter are generally more accurate even when homology is dubious. CONCLUSION: Accurate predictions of multi-class maps may provide valuable constraints for improved ab initio and template-based prediction of protein structures, naturally incorporate multiple templates, and yield state-of-the-art binary maps. Predictions of protein structures and 8 Å contact maps based on the multi-class distance map predictors described in this paper are freely available to academic users at the url .
format Text
id pubmed-2654788
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26547882009-03-13 Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks Walsh, Ian Baù, Davide Martin, Alberto JM Mooney, Catherine Vullo, Alessandro Pollastri, Gianluca BMC Struct Biol Research Article BACKGROUND: Prediction of protein structures from their sequences is still one of the open grand challenges of computational biology. Some approaches to protein structure prediction, especially ab initio ones, rely to some extent on the prediction of residue contact maps. Residue contact map predictions have been assessed at the CASP competition for several years now. Although it has been shown that exact contact maps generally yield correct three-dimensional structures, this is true only at a relatively low resolution (3–4 Å from the native structure). Another known weakness of contact maps is that they are generally predicted ab initio, that is not exploiting information about potential homologues of known structure. RESULTS: We introduce a new class of distance restraints for protein structures: multi-class distance maps. We show that C(α )trace reconstructions based on 4-class native maps are significantly better than those from residue contact maps. We then build two predictors of 4-class maps based on recursive neural networks: one ab initio, or relying on the sequence and on evolutionary information; one template-based, or in which homology information to known structures is provided as a further input. We show that virtually any level of sequence similarity to structural templates (down to less than 10%) yields more accurate 4-class maps than the ab initio predictor. We show that template-based predictions by recursive neural networks are consistently better than the best template and than a number of combinations of the best available templates. We also extract binary residue contact maps at an 8 Å threshold (as per CASP assessment) from the 4-class predictors and show that the template-based version is also more accurate than the best template and consistently better than the ab initio one, down to very low levels of sequence identity to structural templates. Furthermore, we test both ab-initio and template-based 8 Å predictions on the CASP7 targets using a pre-CASP7 PDB, and find that both predictors are state-of-the-art, with the template-based one far outperforming the best CASP7 systems if templates with sequence identity to the query of 10% or better are available. Although this is not the main focus of this paper we also report on reconstructions of C(α )traces based on both ab initio and template-based 4-class map predictions, showing that the latter are generally more accurate even when homology is dubious. CONCLUSION: Accurate predictions of multi-class maps may provide valuable constraints for improved ab initio and template-based prediction of protein structures, naturally incorporate multiple templates, and yield state-of-the-art binary maps. Predictions of protein structures and 8 Å contact maps based on the multi-class distance map predictors described in this paper are freely available to academic users at the url . BioMed Central 2009-01-30 /pmc/articles/PMC2654788/ /pubmed/19183478 http://dx.doi.org/10.1186/1472-6807-9-5 Text en Copyright © 2009 Walsh et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Walsh, Ian
Baù, Davide
Martin, Alberto JM
Mooney, Catherine
Vullo, Alessandro
Pollastri, Gianluca
Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks
title Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks
title_full Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks
title_fullStr Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks
title_full_unstemmed Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks
title_short Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks
title_sort ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654788/
https://www.ncbi.nlm.nih.gov/pubmed/19183478
http://dx.doi.org/10.1186/1472-6807-9-5
work_keys_str_mv AT walshian abinitioandtemplatebasedpredictionofmulticlassdistancemapsbytwodimensionalrecursiveneuralnetworks
AT baudavide abinitioandtemplatebasedpredictionofmulticlassdistancemapsbytwodimensionalrecursiveneuralnetworks
AT martinalbertojm abinitioandtemplatebasedpredictionofmulticlassdistancemapsbytwodimensionalrecursiveneuralnetworks
AT mooneycatherine abinitioandtemplatebasedpredictionofmulticlassdistancemapsbytwodimensionalrecursiveneuralnetworks
AT vulloalessandro abinitioandtemplatebasedpredictionofmulticlassdistancemapsbytwodimensionalrecursiveneuralnetworks
AT pollastrigianluca abinitioandtemplatebasedpredictionofmulticlassdistancemapsbytwodimensionalrecursiveneuralnetworks