Cargando…
O-GlcNAcylation Prediction: An Unattained Objective
BACKGROUND: O-GlcNAcylation is an essential post-translational modification (PTM) in mammalian cells. It consists in the addition of a N-acetylglucosamine (GlcNAc) residue onto serines or threonines by an O-GlcNAc transferase (OGT). Inhibition of OGT is lethal, and misregulation of this PTM can lead...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Dove
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8197665/ https://www.ncbi.nlm.nih.gov/pubmed/34135600 http://dx.doi.org/10.2147/AABC.S294867 |
_version_ | 1783706966491660288 |
---|---|
author | Mauri, Theo Menu-Bouaouiche, Laurence Bardor, Muriel Lefebvre, Tony Lensink, Marc F Brysbaert, Guillaume |
author_facet | Mauri, Theo Menu-Bouaouiche, Laurence Bardor, Muriel Lefebvre, Tony Lensink, Marc F Brysbaert, Guillaume |
author_sort | Mauri, Theo |
collection | PubMed |
description | BACKGROUND: O-GlcNAcylation is an essential post-translational modification (PTM) in mammalian cells. It consists in the addition of a N-acetylglucosamine (GlcNAc) residue onto serines or threonines by an O-GlcNAc transferase (OGT). Inhibition of OGT is lethal, and misregulation of this PTM can lead to diverse pathologies including diabetes, Alzheimer’s disease and cancers. Knowing the location of O-GlcNAcylation sites and the ability to accurately predict them is therefore of prime importance to a better understanding of this process and its related pathologies. PURPOSE: Here, we present an evaluation of the current predictors of O-GlcNAcylation sites based on a newly built dataset and an investigation to improve predictions. METHODS: Several datasets of experimentally proven O-GlcNAcylated sites were combined, and the resulting meta-dataset was used to evaluate three prediction tools. We further defined a set of new features following the analysis of the primary to tertiary structures of experimentally proven O-GlcNAcylated sites in order to improve predictions by the use of different types of machine learning techniques. RESULTS: Our results show the failure of currently available algorithms to predict O-GlcNAcylated sites with a precision exceeding 9%. Our efforts to improve the precision with new features using machine learning techniques do succeed for equal proportions of O-GlcNAcylated and non-O-GlcNAcylated sites but fail like the other tools for real-life proportions where ~1.4% of S/T are O-GlcNAcylated. CONCLUSION: Present-day algorithms for O-GlcNAcylation prediction narrowly outperform random prediction. The inclusion of additional features, in combination with machine learning algorithms, does not enhance these predictions, emphasizing a pressing need for further development. We hypothesize that the improvement of prediction algorithms requires characterization of OGT’s partners. |
format | Online Article Text |
id | pubmed-8197665 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Dove |
record_format | MEDLINE/PubMed |
spelling | pubmed-81976652021-06-15 O-GlcNAcylation Prediction: An Unattained Objective Mauri, Theo Menu-Bouaouiche, Laurence Bardor, Muriel Lefebvre, Tony Lensink, Marc F Brysbaert, Guillaume Adv Appl Bioinform Chem Original Research BACKGROUND: O-GlcNAcylation is an essential post-translational modification (PTM) in mammalian cells. It consists in the addition of a N-acetylglucosamine (GlcNAc) residue onto serines or threonines by an O-GlcNAc transferase (OGT). Inhibition of OGT is lethal, and misregulation of this PTM can lead to diverse pathologies including diabetes, Alzheimer’s disease and cancers. Knowing the location of O-GlcNAcylation sites and the ability to accurately predict them is therefore of prime importance to a better understanding of this process and its related pathologies. PURPOSE: Here, we present an evaluation of the current predictors of O-GlcNAcylation sites based on a newly built dataset and an investigation to improve predictions. METHODS: Several datasets of experimentally proven O-GlcNAcylated sites were combined, and the resulting meta-dataset was used to evaluate three prediction tools. We further defined a set of new features following the analysis of the primary to tertiary structures of experimentally proven O-GlcNAcylated sites in order to improve predictions by the use of different types of machine learning techniques. RESULTS: Our results show the failure of currently available algorithms to predict O-GlcNAcylated sites with a precision exceeding 9%. Our efforts to improve the precision with new features using machine learning techniques do succeed for equal proportions of O-GlcNAcylated and non-O-GlcNAcylated sites but fail like the other tools for real-life proportions where ~1.4% of S/T are O-GlcNAcylated. CONCLUSION: Present-day algorithms for O-GlcNAcylation prediction narrowly outperform random prediction. The inclusion of additional features, in combination with machine learning algorithms, does not enhance these predictions, emphasizing a pressing need for further development. We hypothesize that the improvement of prediction algorithms requires characterization of OGT’s partners. Dove 2021-06-08 /pmc/articles/PMC8197665/ /pubmed/34135600 http://dx.doi.org/10.2147/AABC.S294867 Text en © 2021 Mauri et al. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php). |
spellingShingle | Original Research Mauri, Theo Menu-Bouaouiche, Laurence Bardor, Muriel Lefebvre, Tony Lensink, Marc F Brysbaert, Guillaume O-GlcNAcylation Prediction: An Unattained Objective |
title | O-GlcNAcylation Prediction: An Unattained Objective |
title_full | O-GlcNAcylation Prediction: An Unattained Objective |
title_fullStr | O-GlcNAcylation Prediction: An Unattained Objective |
title_full_unstemmed | O-GlcNAcylation Prediction: An Unattained Objective |
title_short | O-GlcNAcylation Prediction: An Unattained Objective |
title_sort | o-glcnacylation prediction: an unattained objective |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8197665/ https://www.ncbi.nlm.nih.gov/pubmed/34135600 http://dx.doi.org/10.2147/AABC.S294867 |
work_keys_str_mv | AT mauritheo oglcnacylationpredictionanunattainedobjective AT menubouaouichelaurence oglcnacylationpredictionanunattainedobjective AT bardormuriel oglcnacylationpredictionanunattainedobjective AT lefebvretony oglcnacylationpredictionanunattainedobjective AT lensinkmarcf oglcnacylationpredictionanunattainedobjective AT brysbaertguillaume oglcnacylationpredictionanunattainedobjective |