Cargando…

RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes

MOTIVATION: Glycosylation elaborates the structures and functions of glycoproteins; glycoproteins are common post-translationally modified proteins and are heterogeneous and non-deterministically syn-thesized as an evolutionarily driven mechanism that elaborates the functions of glycosylated gene pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Hackett, William Edwin, Chang, Deborah, Carvalho, Luis, Zaia, Joseph
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10312533/
https://www.ncbi.nlm.nih.gov/pubmed/37398011
http://dx.doi.org/10.1101/2023.05.30.542895
_version_ 1785066945625718784
author Hackett, William Edwin
Chang, Deborah
Carvalho, Luis
Zaia, Joseph
author_facet Hackett, William Edwin
Chang, Deborah
Carvalho, Luis
Zaia, Joseph
author_sort Hackett, William Edwin
collection PubMed
description MOTIVATION: Glycosylation elaborates the structures and functions of glycoproteins; glycoproteins are common post-translationally modified proteins and are heterogeneous and non-deterministically syn-thesized as an evolutionarily driven mechanism that elaborates the functions of glycosylated gene products. While glycoproteins account for approximately half of all proteins, their macro- and micro-heterogeneity requires specialized proteomics data analysis methods as a given glycosite can be divided into several glycosylated forms, each of which must be quantified. Sampling of heterogeneous glycopeptides is limited by mass spectrometer speed and sensitivity, resulting in missing values. In conjunction with the low sample size inherent to glycoproteomics, this necessitated specialized statistical metrics to identify if observed changes in glycopeptide abundances are biologically significant or due to data quality limitations. RESULTS: We developed an R package, Relative Assessment of m/z Identifications by Similarity (RAMZIS), that uses similarity metrics to guide biomedical researchers to a more rigorous interpretation of glycoproteomics data. RAMZIS uses contextual similarity to assess the quality of mass spectral data and generates graphical output that demonstrates the likelihood of finding biologically significant differences in glycosylation abundance dataset. Investigators can assess dataset quality, holistically differentiate glycosites, and identify which glycopeptides are responsible for glycosylation pattern expression change. Herein RAMZIS approach is validated by theoretical cases and by a proof-of-concept application. RAMZIS enables comparison between datasets too stochastic, small, or sparse for interpolation while acknowledging these issues in its assessment. Using our tool, researchers will be able to rigor-ously define the role of glycosylation and the changes that occur during biological processes.
format Online
Article
Text
id pubmed-10312533
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-103125332023-07-01 RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes Hackett, William Edwin Chang, Deborah Carvalho, Luis Zaia, Joseph bioRxiv Article MOTIVATION: Glycosylation elaborates the structures and functions of glycoproteins; glycoproteins are common post-translationally modified proteins and are heterogeneous and non-deterministically syn-thesized as an evolutionarily driven mechanism that elaborates the functions of glycosylated gene products. While glycoproteins account for approximately half of all proteins, their macro- and micro-heterogeneity requires specialized proteomics data analysis methods as a given glycosite can be divided into several glycosylated forms, each of which must be quantified. Sampling of heterogeneous glycopeptides is limited by mass spectrometer speed and sensitivity, resulting in missing values. In conjunction with the low sample size inherent to glycoproteomics, this necessitated specialized statistical metrics to identify if observed changes in glycopeptide abundances are biologically significant or due to data quality limitations. RESULTS: We developed an R package, Relative Assessment of m/z Identifications by Similarity (RAMZIS), that uses similarity metrics to guide biomedical researchers to a more rigorous interpretation of glycoproteomics data. RAMZIS uses contextual similarity to assess the quality of mass spectral data and generates graphical output that demonstrates the likelihood of finding biologically significant differences in glycosylation abundance dataset. Investigators can assess dataset quality, holistically differentiate glycosites, and identify which glycopeptides are responsible for glycosylation pattern expression change. Herein RAMZIS approach is validated by theoretical cases and by a proof-of-concept application. RAMZIS enables comparison between datasets too stochastic, small, or sparse for interpolation while acknowledging these issues in its assessment. Using our tool, researchers will be able to rigor-ously define the role of glycosylation and the changes that occur during biological processes. Cold Spring Harbor Laboratory 2023-06-01 /pmc/articles/PMC10312533/ /pubmed/37398011 http://dx.doi.org/10.1101/2023.05.30.542895 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Hackett, William Edwin
Chang, Deborah
Carvalho, Luis
Zaia, Joseph
RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes
title RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes
title_full RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes
title_fullStr RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes
title_full_unstemmed RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes
title_short RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes
title_sort ramzis: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10312533/
https://www.ncbi.nlm.nih.gov/pubmed/37398011
http://dx.doi.org/10.1101/2023.05.30.542895
work_keys_str_mv AT hackettwilliamedwin ramzisabioinformatictoolkitforrigorousassessmentofthealterationstoglycoproteinstructurethatoccurduringbiologicalprocesses
AT changdeborah ramzisabioinformatictoolkitforrigorousassessmentofthealterationstoglycoproteinstructurethatoccurduringbiologicalprocesses
AT carvalholuis ramzisabioinformatictoolkitforrigorousassessmentofthealterationstoglycoproteinstructurethatoccurduringbiologicalprocesses
AT zaiajoseph ramzisabioinformatictoolkitforrigorousassessmentofthealterationstoglycoproteinstructurethatoccurduringbiologicalprocesses