Cargando…
Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry
Protein glycosylation is known to be involved in biological progresses such as cell recognition, growth, differentiation, and apoptosis. Fucosylation of glycoproteins plays an important role for structural stability and function of N-linked glycoproteins. Although many of biological and clinical stu...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6962204/ https://www.ncbi.nlm.nih.gov/pubmed/31941975 http://dx.doi.org/10.1038/s41598-019-57274-1 |
_version_ | 1783488116687896576 |
---|---|
author | Hwang, Heeyoun Jeong, Hoi Keun Lee, Hyun Kyoung Park, Gun Wook Lee, Ju Yeon Lee, Soo Youn Kang, Young-Mook An, Hyun Joo Kang, Jeong Gu Ko, Jeong-Heon Kim, Jin Young Yoo, Jong Shin |
author_facet | Hwang, Heeyoun Jeong, Hoi Keun Lee, Hyun Kyoung Park, Gun Wook Lee, Ju Yeon Lee, Soo Youn Kang, Young-Mook An, Hyun Joo Kang, Jeong Gu Ko, Jeong-Heon Kim, Jin Young Yoo, Jong Shin |
author_sort | Hwang, Heeyoun |
collection | PubMed |
description | Protein glycosylation is known to be involved in biological progresses such as cell recognition, growth, differentiation, and apoptosis. Fucosylation of glycoproteins plays an important role for structural stability and function of N-linked glycoproteins. Although many of biological and clinical studies of protein fucosylation by fucosyltransferases has been reported, structural classification of fucosylated N-glycoproteins such as core or outer isoforms remains a challenge. Here, we report for the first time the classification of N-glycopeptides as core- and outer-fucosylated types using tandem mass spectrometry (MS/MS) and machine learning algorithms such as the deep neural network (DNN) and support vector machine (SVM). Training and test sets of more than 800 MS/MS spectra of N-glycopeptides from the immunoglobulin gamma and alpha 1-acid-glycoprotein standards were selected for classification of the fucosylation types using supervised learning models. The best-performing model had an accuracy of more than 99% against manual characterization and area under the curve values greater than 0.99, which were calculated by probability scores from target and decoy datasets. Finally, this model was applied to classify fucosylated N-glycoproteins from human plasma. A total of 82N-glycopeptides, with 54 core-, 24 outer-, and 4 dual-fucosylation types derived from 54 glycoproteins, were commonly classified as the same type in both the DNN and SVM. Specifically, outer fucosylation was dominant in tri- and tetra-antennary N-glycopeptides, while core fucosylation was dominant in the mono-, bi-antennary and hybrid types of N-glycoproteins in human plasma. Thus, the machine learning methods can be combined with MS/MS to distinguish between different isoforms of fucosylated N-glycopeptides. |
format | Online Article Text |
id | pubmed-6962204 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-69622042020-01-23 Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry Hwang, Heeyoun Jeong, Hoi Keun Lee, Hyun Kyoung Park, Gun Wook Lee, Ju Yeon Lee, Soo Youn Kang, Young-Mook An, Hyun Joo Kang, Jeong Gu Ko, Jeong-Heon Kim, Jin Young Yoo, Jong Shin Sci Rep Article Protein glycosylation is known to be involved in biological progresses such as cell recognition, growth, differentiation, and apoptosis. Fucosylation of glycoproteins plays an important role for structural stability and function of N-linked glycoproteins. Although many of biological and clinical studies of protein fucosylation by fucosyltransferases has been reported, structural classification of fucosylated N-glycoproteins such as core or outer isoforms remains a challenge. Here, we report for the first time the classification of N-glycopeptides as core- and outer-fucosylated types using tandem mass spectrometry (MS/MS) and machine learning algorithms such as the deep neural network (DNN) and support vector machine (SVM). Training and test sets of more than 800 MS/MS spectra of N-glycopeptides from the immunoglobulin gamma and alpha 1-acid-glycoprotein standards were selected for classification of the fucosylation types using supervised learning models. The best-performing model had an accuracy of more than 99% against manual characterization and area under the curve values greater than 0.99, which were calculated by probability scores from target and decoy datasets. Finally, this model was applied to classify fucosylated N-glycoproteins from human plasma. A total of 82N-glycopeptides, with 54 core-, 24 outer-, and 4 dual-fucosylation types derived from 54 glycoproteins, were commonly classified as the same type in both the DNN and SVM. Specifically, outer fucosylation was dominant in tri- and tetra-antennary N-glycopeptides, while core fucosylation was dominant in the mono-, bi-antennary and hybrid types of N-glycoproteins in human plasma. Thus, the machine learning methods can be combined with MS/MS to distinguish between different isoforms of fucosylated N-glycopeptides. Nature Publishing Group UK 2020-01-15 /pmc/articles/PMC6962204/ /pubmed/31941975 http://dx.doi.org/10.1038/s41598-019-57274-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Hwang, Heeyoun Jeong, Hoi Keun Lee, Hyun Kyoung Park, Gun Wook Lee, Ju Yeon Lee, Soo Youn Kang, Young-Mook An, Hyun Joo Kang, Jeong Gu Ko, Jeong-Heon Kim, Jin Young Yoo, Jong Shin Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry |
title | Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry |
title_full | Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry |
title_fullStr | Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry |
title_full_unstemmed | Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry |
title_short | Machine Learning Classifies Core and Outer Fucosylation of N-Glycoproteins Using Mass Spectrometry |
title_sort | machine learning classifies core and outer fucosylation of n-glycoproteins using mass spectrometry |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6962204/ https://www.ncbi.nlm.nih.gov/pubmed/31941975 http://dx.doi.org/10.1038/s41598-019-57274-1 |
work_keys_str_mv | AT hwangheeyoun machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry AT jeonghoikeun machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry AT leehyunkyoung machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry AT parkgunwook machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry AT leejuyeon machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry AT leesooyoun machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry AT kangyoungmook machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry AT anhyunjoo machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry AT kangjeonggu machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry AT kojeongheon machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry AT kimjinyoung machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry AT yoojongshin machinelearningclassifiescoreandouterfucosylationofnglycoproteinsusingmassspectrometry |