Cargando…

O-GlcNAcylation Prediction: An Unattained Objective

BACKGROUND: O-GlcNAcylation is an essential post-translational modification (PTM) in mammalian cells. It consists in the addition of a N-acetylglucosamine (GlcNAc) residue onto serines or threonines by an O-GlcNAc transferase (OGT). Inhibition of OGT is lethal, and misregulation of this PTM can lead...

Descripción completa

Detalles Bibliográficos
Autores principales: Mauri, Theo, Menu-Bouaouiche, Laurence, Bardor, Muriel, Lefebvre, Tony, Lensink, Marc F, Brysbaert, Guillaume
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8197665/
https://www.ncbi.nlm.nih.gov/pubmed/34135600
http://dx.doi.org/10.2147/AABC.S294867
_version_ 1783706966491660288
author Mauri, Theo
Menu-Bouaouiche, Laurence
Bardor, Muriel
Lefebvre, Tony
Lensink, Marc F
Brysbaert, Guillaume
author_facet Mauri, Theo
Menu-Bouaouiche, Laurence
Bardor, Muriel
Lefebvre, Tony
Lensink, Marc F
Brysbaert, Guillaume
author_sort Mauri, Theo
collection PubMed
description BACKGROUND: O-GlcNAcylation is an essential post-translational modification (PTM) in mammalian cells. It consists in the addition of a N-acetylglucosamine (GlcNAc) residue onto serines or threonines by an O-GlcNAc transferase (OGT). Inhibition of OGT is lethal, and misregulation of this PTM can lead to diverse pathologies including diabetes, Alzheimer’s disease and cancers. Knowing the location of O-GlcNAcylation sites and the ability to accurately predict them is therefore of prime importance to a better understanding of this process and its related pathologies. PURPOSE: Here, we present an evaluation of the current predictors of O-GlcNAcylation sites based on a newly built dataset and an investigation to improve predictions. METHODS: Several datasets of experimentally proven O-GlcNAcylated sites were combined, and the resulting meta-dataset was used to evaluate three prediction tools. We further defined a set of new features following the analysis of the primary to tertiary structures of experimentally proven O-GlcNAcylated sites in order to improve predictions by the use of different types of machine learning techniques. RESULTS: Our results show the failure of currently available algorithms to predict O-GlcNAcylated sites with a precision exceeding 9%. Our efforts to improve the precision with new features using machine learning techniques do succeed for equal proportions of O-GlcNAcylated and non-O-GlcNAcylated sites but fail like the other tools for real-life proportions where ~1.4% of S/T are O-GlcNAcylated. CONCLUSION: Present-day algorithms for O-GlcNAcylation prediction narrowly outperform random prediction. The inclusion of additional features, in combination with machine learning algorithms, does not enhance these predictions, emphasizing a pressing need for further development. We hypothesize that the improvement of prediction algorithms requires characterization of OGT’s partners.
format Online
Article
Text
id pubmed-8197665
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Dove
record_format MEDLINE/PubMed
spelling pubmed-81976652021-06-15 O-GlcNAcylation Prediction: An Unattained Objective Mauri, Theo Menu-Bouaouiche, Laurence Bardor, Muriel Lefebvre, Tony Lensink, Marc F Brysbaert, Guillaume Adv Appl Bioinform Chem Original Research BACKGROUND: O-GlcNAcylation is an essential post-translational modification (PTM) in mammalian cells. It consists in the addition of a N-acetylglucosamine (GlcNAc) residue onto serines or threonines by an O-GlcNAc transferase (OGT). Inhibition of OGT is lethal, and misregulation of this PTM can lead to diverse pathologies including diabetes, Alzheimer’s disease and cancers. Knowing the location of O-GlcNAcylation sites and the ability to accurately predict them is therefore of prime importance to a better understanding of this process and its related pathologies. PURPOSE: Here, we present an evaluation of the current predictors of O-GlcNAcylation sites based on a newly built dataset and an investigation to improve predictions. METHODS: Several datasets of experimentally proven O-GlcNAcylated sites were combined, and the resulting meta-dataset was used to evaluate three prediction tools. We further defined a set of new features following the analysis of the primary to tertiary structures of experimentally proven O-GlcNAcylated sites in order to improve predictions by the use of different types of machine learning techniques. RESULTS: Our results show the failure of currently available algorithms to predict O-GlcNAcylated sites with a precision exceeding 9%. Our efforts to improve the precision with new features using machine learning techniques do succeed for equal proportions of O-GlcNAcylated and non-O-GlcNAcylated sites but fail like the other tools for real-life proportions where ~1.4% of S/T are O-GlcNAcylated. CONCLUSION: Present-day algorithms for O-GlcNAcylation prediction narrowly outperform random prediction. The inclusion of additional features, in combination with machine learning algorithms, does not enhance these predictions, emphasizing a pressing need for further development. We hypothesize that the improvement of prediction algorithms requires characterization of OGT’s partners. Dove 2021-06-08 /pmc/articles/PMC8197665/ /pubmed/34135600 http://dx.doi.org/10.2147/AABC.S294867 Text en © 2021 Mauri et al. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php).
spellingShingle Original Research
Mauri, Theo
Menu-Bouaouiche, Laurence
Bardor, Muriel
Lefebvre, Tony
Lensink, Marc F
Brysbaert, Guillaume
O-GlcNAcylation Prediction: An Unattained Objective
title O-GlcNAcylation Prediction: An Unattained Objective
title_full O-GlcNAcylation Prediction: An Unattained Objective
title_fullStr O-GlcNAcylation Prediction: An Unattained Objective
title_full_unstemmed O-GlcNAcylation Prediction: An Unattained Objective
title_short O-GlcNAcylation Prediction: An Unattained Objective
title_sort o-glcnacylation prediction: an unattained objective
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8197665/
https://www.ncbi.nlm.nih.gov/pubmed/34135600
http://dx.doi.org/10.2147/AABC.S294867
work_keys_str_mv AT mauritheo oglcnacylationpredictionanunattainedobjective
AT menubouaouichelaurence oglcnacylationpredictionanunattainedobjective
AT bardormuriel oglcnacylationpredictionanunattainedobjective
AT lefebvretony oglcnacylationpredictionanunattainedobjective
AT lensinkmarcf oglcnacylationpredictionanunattainedobjective
AT brysbaertguillaume oglcnacylationpredictionanunattainedobjective