Cargando…

A machine learning approach for predicting methionine oxidation sites

BACKGROUND: The oxidation of protein-bound methionine to form methionine sulfoxide, has traditionally been regarded as an oxidative damage. However, recent evidences support the view of this reversible reaction as a regulatory post-translational modification. The perception that methionine sulfoxida...

Descripción completa

Detalles Bibliográficos
Autores principales: Aledo, Juan C., Cantón, Francisco R., Veredas, Francisco J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5622526/
https://www.ncbi.nlm.nih.gov/pubmed/28962549
http://dx.doi.org/10.1186/s12859-017-1848-9
_version_ 1783267928248942592
author Aledo, Juan C.
Cantón, Francisco R.
Veredas, Francisco J.
author_facet Aledo, Juan C.
Cantón, Francisco R.
Veredas, Francisco J.
author_sort Aledo, Juan C.
collection PubMed
description BACKGROUND: The oxidation of protein-bound methionine to form methionine sulfoxide, has traditionally been regarded as an oxidative damage. However, recent evidences support the view of this reversible reaction as a regulatory post-translational modification. The perception that methionine sulfoxidation may provide a mechanism to the redox regulation of a wide range of cellular processes, has stimulated some proteomic studies. However, these experimental approaches are expensive and time-consuming. Therefore, computational methods designed to predict methionine oxidation sites are an attractive alternative. As a first approach to this matter, we have developed models based on random forests, support vector machines and neural networks, aimed at accurate prediction of sites of methionine oxidation. RESULTS: Starting from published proteomic data regarding oxidized methionines, we created a hand-curated dataset formed by 113 unique polypeptides of known structure, containing 975 methionyl residues, 122 of which were oxidation-prone (positive dataset) and 853 were oxidation-resistant (negative dataset). We use a machine learning approach to generate predictive models from these datasets. Among the multiple features used in the classification task, some of them contributed substantially to the performance of the predictive models. Thus, (i) the solvent accessible area of the methionine residue, (ii) the number of residues between the analyzed methionine and the next methionine found towards the N-terminus and (iii) the spatial distance between the atom of sulfur from the analyzed methionine and the closest aromatic residue, were among the most relevant features. Compared to the other classifiers we also evaluated, random forests provided the best performance, with accuracy, sensitivity and specificity of 0.7468±0.0567, 0.6817±0.0982 and 0.7557±0.0721, respectively (mean ± standard deviation). CONCLUSIONS: We present the first predictive models aimed to computationally detect methionine sites that may become oxidized in vivo in response to oxidative signals. These models provide insights into the structural context in which a methionine residue become either oxidation-resistant or oxidation-prone. Furthermore, these models should be useful in prioritizing methinonyl residues for further studies to determine their potential as regulatory post-translational modification sites.
format Online
Article
Text
id pubmed-5622526
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-56225262017-10-11 A machine learning approach for predicting methionine oxidation sites Aledo, Juan C. Cantón, Francisco R. Veredas, Francisco J. BMC Bioinformatics Research Article BACKGROUND: The oxidation of protein-bound methionine to form methionine sulfoxide, has traditionally been regarded as an oxidative damage. However, recent evidences support the view of this reversible reaction as a regulatory post-translational modification. The perception that methionine sulfoxidation may provide a mechanism to the redox regulation of a wide range of cellular processes, has stimulated some proteomic studies. However, these experimental approaches are expensive and time-consuming. Therefore, computational methods designed to predict methionine oxidation sites are an attractive alternative. As a first approach to this matter, we have developed models based on random forests, support vector machines and neural networks, aimed at accurate prediction of sites of methionine oxidation. RESULTS: Starting from published proteomic data regarding oxidized methionines, we created a hand-curated dataset formed by 113 unique polypeptides of known structure, containing 975 methionyl residues, 122 of which were oxidation-prone (positive dataset) and 853 were oxidation-resistant (negative dataset). We use a machine learning approach to generate predictive models from these datasets. Among the multiple features used in the classification task, some of them contributed substantially to the performance of the predictive models. Thus, (i) the solvent accessible area of the methionine residue, (ii) the number of residues between the analyzed methionine and the next methionine found towards the N-terminus and (iii) the spatial distance between the atom of sulfur from the analyzed methionine and the closest aromatic residue, were among the most relevant features. Compared to the other classifiers we also evaluated, random forests provided the best performance, with accuracy, sensitivity and specificity of 0.7468±0.0567, 0.6817±0.0982 and 0.7557±0.0721, respectively (mean ± standard deviation). CONCLUSIONS: We present the first predictive models aimed to computationally detect methionine sites that may become oxidized in vivo in response to oxidative signals. These models provide insights into the structural context in which a methionine residue become either oxidation-resistant or oxidation-prone. Furthermore, these models should be useful in prioritizing methinonyl residues for further studies to determine their potential as regulatory post-translational modification sites. BioMed Central 2017-09-29 /pmc/articles/PMC5622526/ /pubmed/28962549 http://dx.doi.org/10.1186/s12859-017-1848-9 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Aledo, Juan C.
Cantón, Francisco R.
Veredas, Francisco J.
A machine learning approach for predicting methionine oxidation sites
title A machine learning approach for predicting methionine oxidation sites
title_full A machine learning approach for predicting methionine oxidation sites
title_fullStr A machine learning approach for predicting methionine oxidation sites
title_full_unstemmed A machine learning approach for predicting methionine oxidation sites
title_short A machine learning approach for predicting methionine oxidation sites
title_sort machine learning approach for predicting methionine oxidation sites
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5622526/
https://www.ncbi.nlm.nih.gov/pubmed/28962549
http://dx.doi.org/10.1186/s12859-017-1848-9
work_keys_str_mv AT aledojuanc amachinelearningapproachforpredictingmethionineoxidationsites
AT cantonfranciscor amachinelearningapproachforpredictingmethionineoxidationsites
AT veredasfranciscoj amachinelearningapproachforpredictingmethionineoxidationsites
AT aledojuanc machinelearningapproachforpredictingmethionineoxidationsites
AT cantonfranciscor machinelearningapproachforpredictingmethionineoxidationsites
AT veredasfranciscoj machinelearningapproachforpredictingmethionineoxidationsites