Cargando…
A machine learning approach for predicting methionine oxidation sites
BACKGROUND: The oxidation of protein-bound methionine to form methionine sulfoxide, has traditionally been regarded as an oxidative damage. However, recent evidences support the view of this reversible reaction as a regulatory post-translational modification. The perception that methionine sulfoxida...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5622526/ https://www.ncbi.nlm.nih.gov/pubmed/28962549 http://dx.doi.org/10.1186/s12859-017-1848-9 |
_version_ | 1783267928248942592 |
---|---|
author | Aledo, Juan C. Cantón, Francisco R. Veredas, Francisco J. |
author_facet | Aledo, Juan C. Cantón, Francisco R. Veredas, Francisco J. |
author_sort | Aledo, Juan C. |
collection | PubMed |
description | BACKGROUND: The oxidation of protein-bound methionine to form methionine sulfoxide, has traditionally been regarded as an oxidative damage. However, recent evidences support the view of this reversible reaction as a regulatory post-translational modification. The perception that methionine sulfoxidation may provide a mechanism to the redox regulation of a wide range of cellular processes, has stimulated some proteomic studies. However, these experimental approaches are expensive and time-consuming. Therefore, computational methods designed to predict methionine oxidation sites are an attractive alternative. As a first approach to this matter, we have developed models based on random forests, support vector machines and neural networks, aimed at accurate prediction of sites of methionine oxidation. RESULTS: Starting from published proteomic data regarding oxidized methionines, we created a hand-curated dataset formed by 113 unique polypeptides of known structure, containing 975 methionyl residues, 122 of which were oxidation-prone (positive dataset) and 853 were oxidation-resistant (negative dataset). We use a machine learning approach to generate predictive models from these datasets. Among the multiple features used in the classification task, some of them contributed substantially to the performance of the predictive models. Thus, (i) the solvent accessible area of the methionine residue, (ii) the number of residues between the analyzed methionine and the next methionine found towards the N-terminus and (iii) the spatial distance between the atom of sulfur from the analyzed methionine and the closest aromatic residue, were among the most relevant features. Compared to the other classifiers we also evaluated, random forests provided the best performance, with accuracy, sensitivity and specificity of 0.7468±0.0567, 0.6817±0.0982 and 0.7557±0.0721, respectively (mean ± standard deviation). CONCLUSIONS: We present the first predictive models aimed to computationally detect methionine sites that may become oxidized in vivo in response to oxidative signals. These models provide insights into the structural context in which a methionine residue become either oxidation-resistant or oxidation-prone. Furthermore, these models should be useful in prioritizing methinonyl residues for further studies to determine their potential as regulatory post-translational modification sites. |
format | Online Article Text |
id | pubmed-5622526 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-56225262017-10-11 A machine learning approach for predicting methionine oxidation sites Aledo, Juan C. Cantón, Francisco R. Veredas, Francisco J. BMC Bioinformatics Research Article BACKGROUND: The oxidation of protein-bound methionine to form methionine sulfoxide, has traditionally been regarded as an oxidative damage. However, recent evidences support the view of this reversible reaction as a regulatory post-translational modification. The perception that methionine sulfoxidation may provide a mechanism to the redox regulation of a wide range of cellular processes, has stimulated some proteomic studies. However, these experimental approaches are expensive and time-consuming. Therefore, computational methods designed to predict methionine oxidation sites are an attractive alternative. As a first approach to this matter, we have developed models based on random forests, support vector machines and neural networks, aimed at accurate prediction of sites of methionine oxidation. RESULTS: Starting from published proteomic data regarding oxidized methionines, we created a hand-curated dataset formed by 113 unique polypeptides of known structure, containing 975 methionyl residues, 122 of which were oxidation-prone (positive dataset) and 853 were oxidation-resistant (negative dataset). We use a machine learning approach to generate predictive models from these datasets. Among the multiple features used in the classification task, some of them contributed substantially to the performance of the predictive models. Thus, (i) the solvent accessible area of the methionine residue, (ii) the number of residues between the analyzed methionine and the next methionine found towards the N-terminus and (iii) the spatial distance between the atom of sulfur from the analyzed methionine and the closest aromatic residue, were among the most relevant features. Compared to the other classifiers we also evaluated, random forests provided the best performance, with accuracy, sensitivity and specificity of 0.7468±0.0567, 0.6817±0.0982 and 0.7557±0.0721, respectively (mean ± standard deviation). CONCLUSIONS: We present the first predictive models aimed to computationally detect methionine sites that may become oxidized in vivo in response to oxidative signals. These models provide insights into the structural context in which a methionine residue become either oxidation-resistant or oxidation-prone. Furthermore, these models should be useful in prioritizing methinonyl residues for further studies to determine their potential as regulatory post-translational modification sites. BioMed Central 2017-09-29 /pmc/articles/PMC5622526/ /pubmed/28962549 http://dx.doi.org/10.1186/s12859-017-1848-9 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Aledo, Juan C. Cantón, Francisco R. Veredas, Francisco J. A machine learning approach for predicting methionine oxidation sites |
title | A machine learning approach for predicting methionine oxidation sites |
title_full | A machine learning approach for predicting methionine oxidation sites |
title_fullStr | A machine learning approach for predicting methionine oxidation sites |
title_full_unstemmed | A machine learning approach for predicting methionine oxidation sites |
title_short | A machine learning approach for predicting methionine oxidation sites |
title_sort | machine learning approach for predicting methionine oxidation sites |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5622526/ https://www.ncbi.nlm.nih.gov/pubmed/28962549 http://dx.doi.org/10.1186/s12859-017-1848-9 |
work_keys_str_mv | AT aledojuanc amachinelearningapproachforpredictingmethionineoxidationsites AT cantonfranciscor amachinelearningapproachforpredictingmethionineoxidationsites AT veredasfranciscoj amachinelearningapproachforpredictingmethionineoxidationsites AT aledojuanc machinelearningapproachforpredictingmethionineoxidationsites AT cantonfranciscor machinelearningapproachforpredictingmethionineoxidationsites AT veredasfranciscoj machinelearningapproachforpredictingmethionineoxidationsites |