Cargando…

Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry

Protein glycosylation is known to be involved in biological progresses such as cell recognition, growth, differentiation, and apoptosis. Fucosylation of glycoproteins plays an important role for structural stability and function of N-linked glycoproteins. Although many of biological and clinical stu...

Descripción completa

Detalles Bibliográficos
Autores principales: Hwang, Heeyoun, Jeong, Hoi Keun, Lee, Hyun Kyoung, Park, Gun Wook, Lee, Ju Yeon, Lee, Soo Youn, Kang, Young-Mook, An, Hyun Joo, Kang, Jeong Gu, Ko, Jeong-Heon, Kim, Jin Young, Yoo, Jong Shin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6962204/
https://www.ncbi.nlm.nih.gov/pubmed/31941975
http://dx.doi.org/10.1038/s41598-019-57274-1
_version_ 1783488116687896576
author Hwang, Heeyoun
Jeong, Hoi Keun
Lee, Hyun Kyoung
Park, Gun Wook
Lee, Ju Yeon
Lee, Soo Youn
Kang, Young-Mook
An, Hyun Joo
Kang, Jeong Gu
Ko, Jeong-Heon
Kim, Jin Young
Yoo, Jong Shin
author_facet Hwang, Heeyoun
Jeong, Hoi Keun
Lee, Hyun Kyoung
Park, Gun Wook
Lee, Ju Yeon
Lee, Soo Youn
Kang, Young-Mook
An, Hyun Joo
Kang, Jeong Gu
Ko, Jeong-Heon
Kim, Jin Young
Yoo, Jong Shin
author_sort Hwang, Heeyoun
collection PubMed
description Protein glycosylation is known to be involved in biological progresses such as cell recognition, growth, differentiation, and apoptosis. Fucosylation of glycoproteins plays an important role for structural stability and function of N-linked glycoproteins. Although many of biological and clinical studies of protein fucosylation by fucosyltransferases has been reported, structural classification of fucosylated N-glycoproteins such as core or outer isoforms remains a challenge. Here, we report for the first time the classification of N-glycopeptides as core- and outer-fucosylated types using tandem mass spectrometry (MS/MS) and machine learning algorithms such as the deep neural network (DNN) and support vector machine (SVM). Training and test sets of more than 800 MS/MS spectra of N-glycopeptides from the immunoglobulin gamma and alpha 1-acid-glycoprotein standards were selected for classification of the fucosylation types using supervised learning models. The best-performing model had an accuracy of more than 99% against manual characterization and area under the curve values greater than 0.99, which were calculated by probability scores from target and decoy datasets. Finally, this model was applied to classify fucosylated N-glycoproteins from human plasma. A total of 82N-glycopeptides, with 54 core-, 24 outer-, and 4 dual-fucosylation types derived from 54 glycoproteins, were commonly classified as the same type in both the DNN and SVM. Specifically, outer fucosylation was dominant in tri- and tetra-antennary N-glycopeptides, while core fucosylation was dominant in the mono-, bi-antennary and hybrid types of N-glycoproteins in human plasma. Thus, the machine learning methods can be combined with MS/MS to distinguish between different isoforms of fucosylated N-glycopeptides.
format Online
Article
Text
id pubmed-6962204
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-69622042020-01-23 Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry Hwang, Heeyoun Jeong, Hoi Keun Lee, Hyun Kyoung Park, Gun Wook Lee, Ju Yeon Lee, Soo Youn Kang, Young-Mook An, Hyun Joo Kang, Jeong Gu Ko, Jeong-Heon Kim, Jin Young Yoo, Jong Shin Sci Rep Article Protein glycosylation is known to be involved in biological progresses such as cell recognition, growth, differentiation, and apoptosis. Fucosylation of glycoproteins plays an important role for structural stability and function of N-linked glycoproteins. Although many of biological and clinical studies of protein fucosylation by fucosyltransferases has been reported, structural classification of fucosylated N-glycoproteins such as core or outer isoforms remains a challenge. Here, we report for the first time the classification of N-glycopeptides as core- and outer-fucosylated types using tandem mass spectrometry (MS/MS) and machine learning algorithms such as the deep neural network (DNN) and support vector machine (SVM). Training and test sets of more than 800 MS/MS spectra of N-glycopeptides from the immunoglobulin gamma and alpha 1-acid-glycoprotein standards were selected for classification of the fucosylation types using supervised learning models. The best-performing model had an accuracy of more than 99% against manual characterization and area under the curve values greater than 0.99, which were calculated by probability scores from target and decoy datasets. Finally, this model was applied to classify fucosylated N-glycoproteins from human plasma. A total of 82N-glycopeptides, with 54 core-, 24 outer-, and 4 dual-fucosylation types derived from 54 glycoproteins, were commonly classified as the same type in both the DNN and SVM. Specifically, outer fucosylation was dominant in tri- and tetra-antennary N-glycopeptides, while core fucosylation was dominant in the mono-, bi-antennary and hybrid types of N-glycoproteins in human plasma. Thus, the machine learning methods can be combined with MS/MS to distinguish between different isoforms of fucosylated N-glycopeptides. Nature Publishing Group UK 2020-01-15 /pmc/articles/PMC6962204/ /pubmed/31941975 http://dx.doi.org/10.1038/s41598-019-57274-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Hwang, Heeyoun
Jeong, Hoi Keun
Lee, Hyun Kyoung
Park, Gun Wook
Lee, Ju Yeon
Lee, Soo Youn
Kang, Young-Mook
An, Hyun Joo
Kang, Jeong Gu
Ko, Jeong-Heon
Kim, Jin Young
Yoo, Jong Shin
Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry
title Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry
title_full Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry
title_fullStr Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry
title_full_unstemmed Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry
title_short Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry
title_sort machine learning classifies core and outer fucosylation of n-glycoproteins using mass spectrometry
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6962204/
https://www.ncbi.nlm.nih.gov/pubmed/31941975
http://dx.doi.org/10.1038/s41598-019-57274-1
work_keys_str_mv AT hwangheeyoun machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry
AT jeonghoikeun machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry
AT leehyunkyoung machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry
AT parkgunwook machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry
AT leejuyeon machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry
AT leesooyoun machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry
AT kangyoungmook machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry
AT anhyunjoo machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry
AT kangjeonggu machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry
AT kojeongheon machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry
AT kimjinyoung machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry
AT yoojongshin machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry