Cargando…
RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes
MOTIVATION: Glycosylation elaborates the structures and functions of glycoproteins; glycoproteins are common post-translationally modified proteins and are heterogeneous and non-deterministically syn-thesized as an evolutionarily driven mechanism that elaborates the functions of glycosylated gene pr...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10312533/ https://www.ncbi.nlm.nih.gov/pubmed/37398011 http://dx.doi.org/10.1101/2023.05.30.542895 |
_version_ | 1785066945625718784 |
---|---|
author | Hackett, William Edwin Chang, Deborah Carvalho, Luis Zaia, Joseph |
author_facet | Hackett, William Edwin Chang, Deborah Carvalho, Luis Zaia, Joseph |
author_sort | Hackett, William Edwin |
collection | PubMed |
description | MOTIVATION: Glycosylation elaborates the structures and functions of glycoproteins; glycoproteins are common post-translationally modified proteins and are heterogeneous and non-deterministically syn-thesized as an evolutionarily driven mechanism that elaborates the functions of glycosylated gene products. While glycoproteins account for approximately half of all proteins, their macro- and micro-heterogeneity requires specialized proteomics data analysis methods as a given glycosite can be divided into several glycosylated forms, each of which must be quantified. Sampling of heterogeneous glycopeptides is limited by mass spectrometer speed and sensitivity, resulting in missing values. In conjunction with the low sample size inherent to glycoproteomics, this necessitated specialized statistical metrics to identify if observed changes in glycopeptide abundances are biologically significant or due to data quality limitations. RESULTS: We developed an R package, Relative Assessment of m/z Identifications by Similarity (RAMZIS), that uses similarity metrics to guide biomedical researchers to a more rigorous interpretation of glycoproteomics data. RAMZIS uses contextual similarity to assess the quality of mass spectral data and generates graphical output that demonstrates the likelihood of finding biologically significant differences in glycosylation abundance dataset. Investigators can assess dataset quality, holistically differentiate glycosites, and identify which glycopeptides are responsible for glycosylation pattern expression change. Herein RAMZIS approach is validated by theoretical cases and by a proof-of-concept application. RAMZIS enables comparison between datasets too stochastic, small, or sparse for interpolation while acknowledging these issues in its assessment. Using our tool, researchers will be able to rigor-ously define the role of glycosylation and the changes that occur during biological processes. |
format | Online Article Text |
id | pubmed-10312533 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-103125332023-07-01 RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes Hackett, William Edwin Chang, Deborah Carvalho, Luis Zaia, Joseph bioRxiv Article MOTIVATION: Glycosylation elaborates the structures and functions of glycoproteins; glycoproteins are common post-translationally modified proteins and are heterogeneous and non-deterministically syn-thesized as an evolutionarily driven mechanism that elaborates the functions of glycosylated gene products. While glycoproteins account for approximately half of all proteins, their macro- and micro-heterogeneity requires specialized proteomics data analysis methods as a given glycosite can be divided into several glycosylated forms, each of which must be quantified. Sampling of heterogeneous glycopeptides is limited by mass spectrometer speed and sensitivity, resulting in missing values. In conjunction with the low sample size inherent to glycoproteomics, this necessitated specialized statistical metrics to identify if observed changes in glycopeptide abundances are biologically significant or due to data quality limitations. RESULTS: We developed an R package, Relative Assessment of m/z Identifications by Similarity (RAMZIS), that uses similarity metrics to guide biomedical researchers to a more rigorous interpretation of glycoproteomics data. RAMZIS uses contextual similarity to assess the quality of mass spectral data and generates graphical output that demonstrates the likelihood of finding biologically significant differences in glycosylation abundance dataset. Investigators can assess dataset quality, holistically differentiate glycosites, and identify which glycopeptides are responsible for glycosylation pattern expression change. Herein RAMZIS approach is validated by theoretical cases and by a proof-of-concept application. RAMZIS enables comparison between datasets too stochastic, small, or sparse for interpolation while acknowledging these issues in its assessment. Using our tool, researchers will be able to rigor-ously define the role of glycosylation and the changes that occur during biological processes. Cold Spring Harbor Laboratory 2023-06-01 /pmc/articles/PMC10312533/ /pubmed/37398011 http://dx.doi.org/10.1101/2023.05.30.542895 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Hackett, William Edwin Chang, Deborah Carvalho, Luis Zaia, Joseph RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes |
title | RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes |
title_full | RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes |
title_fullStr | RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes |
title_full_unstemmed | RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes |
title_short | RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes |
title_sort | ramzis: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10312533/ https://www.ncbi.nlm.nih.gov/pubmed/37398011 http://dx.doi.org/10.1101/2023.05.30.542895 |
work_keys_str_mv | AT hackettwilliamedwin ramzisabioinformatictoolkitforrigorousassessmentofthealterationstoglycoproteinstructurethatoccurduringbiologicalprocesses AT changdeborah ramzisabioinformatictoolkitforrigorousassessmentofthealterationstoglycoproteinstructurethatoccurduringbiologicalprocesses AT carvalholuis ramzisabioinformatictoolkitforrigorousassessmentofthealterationstoglycoproteinstructurethatoccurduringbiologicalprocesses AT zaiajoseph ramzisabioinformatictoolkitforrigorousassessmentofthealterationstoglycoproteinstructurethatoccurduringbiologicalprocesses |