Cargando…

Prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations

BACKGROUND: Recently, information derived by correlated mutations in proteins has regained relevance for predicting protein contacts. This is due to new forms of mutual information analysis that have been proven to be more suitable to highlight direct coupling between pairs of residues in protein st...

Descripción completa

Detalles Bibliográficos
Autores principales: Savojardo, Castrense, Fariselli, Piero, Martelli, Pier Luigi, Casadio, Rita
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3548674/
https://www.ncbi.nlm.nih.gov/pubmed/23368835
http://dx.doi.org/10.1186/1471-2105-14-S1-S10
_version_ 1782256343879516160
author Savojardo, Castrense
Fariselli, Piero
Martelli, Pier Luigi
Casadio, Rita
author_facet Savojardo, Castrense
Fariselli, Piero
Martelli, Pier Luigi
Casadio, Rita
author_sort Savojardo, Castrense
collection PubMed
description BACKGROUND: Recently, information derived by correlated mutations in proteins has regained relevance for predicting protein contacts. This is due to new forms of mutual information analysis that have been proven to be more suitable to highlight direct coupling between pairs of residues in protein structures and to the large number of protein chains that are currently available for statistical validation. It was previously discussed that disulfide bond topology in proteins is also constrained by correlated mutations. RESULTS: In this paper we exploit information derived from a corrected mutual information analysis and from the inverse of the covariance matrix to address the problem of the prediction of the topology of disulfide bonds in Eukaryotes. Recently, we have shown that Support Vector Regression (SVR) can improve the prediction for the disulfide connectivity patterns. Here we show that the inclusion of the correlated mutation information increases of 5 percentage points the SVR performance (from 54% to 59%). When this approach is used in combination with a method previously developed by us and scoring at the state of art in predicting both location and topology of disulfide bonds in Eukaryotes (DisLocate), the per-protein accuracy is 38%, 2 percentage points higher than that previously obtained. CONCLUSIONS: In this paper we show that the inclusion of information derived from correlated mutations can improve the performance of the state of the art methods for predicting disulfide connectivity patterns in Eukaryotic proteins. Our analysis also provides support to the notion that improving methods to extract evolutionary information from multiple sequence alignments greatly contributes to the scoring performance of predictors suited to detect relevant features from protein chains.
format Online
Article
Text
id pubmed-3548674
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35486742013-02-04 Prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations Savojardo, Castrense Fariselli, Piero Martelli, Pier Luigi Casadio, Rita BMC Bioinformatics Research BACKGROUND: Recently, information derived by correlated mutations in proteins has regained relevance for predicting protein contacts. This is due to new forms of mutual information analysis that have been proven to be more suitable to highlight direct coupling between pairs of residues in protein structures and to the large number of protein chains that are currently available for statistical validation. It was previously discussed that disulfide bond topology in proteins is also constrained by correlated mutations. RESULTS: In this paper we exploit information derived from a corrected mutual information analysis and from the inverse of the covariance matrix to address the problem of the prediction of the topology of disulfide bonds in Eukaryotes. Recently, we have shown that Support Vector Regression (SVR) can improve the prediction for the disulfide connectivity patterns. Here we show that the inclusion of the correlated mutation information increases of 5 percentage points the SVR performance (from 54% to 59%). When this approach is used in combination with a method previously developed by us and scoring at the state of art in predicting both location and topology of disulfide bonds in Eukaryotes (DisLocate), the per-protein accuracy is 38%, 2 percentage points higher than that previously obtained. CONCLUSIONS: In this paper we show that the inclusion of information derived from correlated mutations can improve the performance of the state of the art methods for predicting disulfide connectivity patterns in Eukaryotic proteins. Our analysis also provides support to the notion that improving methods to extract evolutionary information from multiple sequence alignments greatly contributes to the scoring performance of predictors suited to detect relevant features from protein chains. BioMed Central 2013-01-14 /pmc/articles/PMC3548674/ /pubmed/23368835 http://dx.doi.org/10.1186/1471-2105-14-S1-S10 Text en Copyright ©2013 Savojardo et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Savojardo, Castrense
Fariselli, Piero
Martelli, Pier Luigi
Casadio, Rita
Prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations
title Prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations
title_full Prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations
title_fullStr Prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations
title_full_unstemmed Prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations
title_short Prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations
title_sort prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3548674/
https://www.ncbi.nlm.nih.gov/pubmed/23368835
http://dx.doi.org/10.1186/1471-2105-14-S1-S10
work_keys_str_mv AT savojardocastrense predictionofdisulfideconnectivityinproteinswithmachinelearningmethodsandcorrelatedmutations
AT farisellipiero predictionofdisulfideconnectivityinproteinswithmachinelearningmethodsandcorrelatedmutations
AT martellipierluigi predictionofdisulfideconnectivityinproteinswithmachinelearningmethodsandcorrelatedmutations
AT casadiorita predictionofdisulfideconnectivityinproteinswithmachinelearningmethodsandcorrelatedmutations