Cargando…

Improved residue contact prediction using support vector machines and a large feature set

BACKGROUND: Predicting protein residue-residue contacts is an important 2D prediction task. It is useful for ab initio structure prediction and understanding protein folding. In spite of steady progress over the past decade, contact prediction remains still largely unsolved. RESULTS: Here we develop...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheng, Jianlin, Baldi, Pierre
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1852326/
https://www.ncbi.nlm.nih.gov/pubmed/17407573
http://dx.doi.org/10.1186/1471-2105-8-113
_version_ 1782133038290829312
author Cheng, Jianlin
Baldi, Pierre
author_facet Cheng, Jianlin
Baldi, Pierre
author_sort Cheng, Jianlin
collection PubMed
description BACKGROUND: Predicting protein residue-residue contacts is an important 2D prediction task. It is useful for ab initio structure prediction and understanding protein folding. In spite of steady progress over the past decade, contact prediction remains still largely unsolved. RESULTS: Here we develop a new contact map predictor (SVMcon) that uses support vector machines to predict medium- and long-range contacts. SVMcon integrates profiles, secondary structure, relative solvent accessibility, contact potentials, and other useful features. On the same test data set, SVMcon's accuracy is 4% higher than the latest version of the CMAPpro contact map predictor. SVMcon recently participated in the seventh edition of the Critical Assessment of Techniques for Protein Structure Prediction (CASP7) experiment and was evaluated along with seven other contact map predictors. SVMcon was ranked as one of the top predictors, yielding the second best coverage and accuracy for contacts with sequence separation >= 12 on 13 de novo domains. CONCLUSION: We describe SVMcon, a new contact map predictor that uses SVMs and a large set of informative features. SVMcon yields good performance on medium- to long-range contact predictions and can be modularly incorporated into a structure prediction pipeline.
format Text
id pubmed-1852326
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18523262007-04-17 Improved residue contact prediction using support vector machines and a large feature set Cheng, Jianlin Baldi, Pierre BMC Bioinformatics Research Article BACKGROUND: Predicting protein residue-residue contacts is an important 2D prediction task. It is useful for ab initio structure prediction and understanding protein folding. In spite of steady progress over the past decade, contact prediction remains still largely unsolved. RESULTS: Here we develop a new contact map predictor (SVMcon) that uses support vector machines to predict medium- and long-range contacts. SVMcon integrates profiles, secondary structure, relative solvent accessibility, contact potentials, and other useful features. On the same test data set, SVMcon's accuracy is 4% higher than the latest version of the CMAPpro contact map predictor. SVMcon recently participated in the seventh edition of the Critical Assessment of Techniques for Protein Structure Prediction (CASP7) experiment and was evaluated along with seven other contact map predictors. SVMcon was ranked as one of the top predictors, yielding the second best coverage and accuracy for contacts with sequence separation >= 12 on 13 de novo domains. CONCLUSION: We describe SVMcon, a new contact map predictor that uses SVMs and a large set of informative features. SVMcon yields good performance on medium- to long-range contact predictions and can be modularly incorporated into a structure prediction pipeline. BioMed Central 2007-04-02 /pmc/articles/PMC1852326/ /pubmed/17407573 http://dx.doi.org/10.1186/1471-2105-8-113 Text en Copyright © 2007 Cheng and Baldi; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Cheng, Jianlin
Baldi, Pierre
Improved residue contact prediction using support vector machines and a large feature set
title Improved residue contact prediction using support vector machines and a large feature set
title_full Improved residue contact prediction using support vector machines and a large feature set
title_fullStr Improved residue contact prediction using support vector machines and a large feature set
title_full_unstemmed Improved residue contact prediction using support vector machines and a large feature set
title_short Improved residue contact prediction using support vector machines and a large feature set
title_sort improved residue contact prediction using support vector machines and a large feature set
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1852326/
https://www.ncbi.nlm.nih.gov/pubmed/17407573
http://dx.doi.org/10.1186/1471-2105-8-113
work_keys_str_mv AT chengjianlin improvedresiduecontactpredictionusingsupportvectormachinesandalargefeatureset
AT baldipierre improvedresiduecontactpredictionusingsupportvectormachinesandalargefeatureset