Cargando…

Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin

BACKGROUND: In addition to causing the pandemic influenza outbreaks of 1918 and 2009, subtype H1N1 influenza A viruses (IAVs) have caused seasonal epidemics since 1977. Antigenic property of influenza viruses are determined by both protein sequence and N-linked glycosylation of influenza glycoprotei...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Lei, Chang, Deborah, Han, Lei, Zhang, Xiaojian, Zaia, Joseph, Wan, Xiu-Feng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7216668/
https://www.ncbi.nlm.nih.gov/pubmed/32393178
http://dx.doi.org/10.1186/s12859-020-3527-5
_version_ 1783532459591204864
author Li, Lei
Chang, Deborah
Han, Lei
Zhang, Xiaojian
Zaia, Joseph
Wan, Xiu-Feng
author_facet Li, Lei
Chang, Deborah
Han, Lei
Zhang, Xiaojian
Zaia, Joseph
Wan, Xiu-Feng
author_sort Li, Lei
collection PubMed
description BACKGROUND: In addition to causing the pandemic influenza outbreaks of 1918 and 2009, subtype H1N1 influenza A viruses (IAVs) have caused seasonal epidemics since 1977. Antigenic property of influenza viruses are determined by both protein sequence and N-linked glycosylation of influenza glycoproteins, especially hemagglutinin (HA). The currently available computational methods are only considered features in protein sequence but not N-linked glycosylation. RESULTS: A multi-task learning sparse group least absolute shrinkage and selection operator (LASSO) (MTL-SGL) regression method was developed and applied to derive two types of predominant features including protein sequence and N-linked glycosylation in hemagglutinin (HA) affecting variations in serologic data for human and swine H1N1 IAVs. Results suggested that mutations and changes in N-linked glycosylation sites are associated with the rise of antigenic variants of H1N1 IAVs. Furthermore, the implicated mutations are predominantly located at five reported antibody-binding sites, and within or close to the HA receptor binding site. All of the three N-linked glycosylation sites (i.e. sequons NCSV at HA 54, NHTV at HA 125, and NLSK at HA 160) identified by MTL-SGL to determine antigenic changes were experimentally validated in the H1N1 antigenic variants using mass spectrometry analyses. Compared with conventional sparse learning methods, MTL-SGL achieved a lower prediction error and higher accuracy, indicating that grouped features and MTL in the MTL-SGL method are not only able to handle serologic data generated from multiple reagents, supplies, and protocols, but also perform better in genetic sequence-based antigenic quantification. CONCLUSIONS: In summary, the results of this study suggest that mutations and variations in N-glycosylation in HA caused antigenic variations in H1N1 IAVs and that the sequence-based antigenicity predictive model will be useful in understanding antigenic evolution of IAVs.
format Online
Article
Text
id pubmed-7216668
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-72166682020-05-15 Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin Li, Lei Chang, Deborah Han, Lei Zhang, Xiaojian Zaia, Joseph Wan, Xiu-Feng BMC Bioinformatics Methodology Article BACKGROUND: In addition to causing the pandemic influenza outbreaks of 1918 and 2009, subtype H1N1 influenza A viruses (IAVs) have caused seasonal epidemics since 1977. Antigenic property of influenza viruses are determined by both protein sequence and N-linked glycosylation of influenza glycoproteins, especially hemagglutinin (HA). The currently available computational methods are only considered features in protein sequence but not N-linked glycosylation. RESULTS: A multi-task learning sparse group least absolute shrinkage and selection operator (LASSO) (MTL-SGL) regression method was developed and applied to derive two types of predominant features including protein sequence and N-linked glycosylation in hemagglutinin (HA) affecting variations in serologic data for human and swine H1N1 IAVs. Results suggested that mutations and changes in N-linked glycosylation sites are associated with the rise of antigenic variants of H1N1 IAVs. Furthermore, the implicated mutations are predominantly located at five reported antibody-binding sites, and within or close to the HA receptor binding site. All of the three N-linked glycosylation sites (i.e. sequons NCSV at HA 54, NHTV at HA 125, and NLSK at HA 160) identified by MTL-SGL to determine antigenic changes were experimentally validated in the H1N1 antigenic variants using mass spectrometry analyses. Compared with conventional sparse learning methods, MTL-SGL achieved a lower prediction error and higher accuracy, indicating that grouped features and MTL in the MTL-SGL method are not only able to handle serologic data generated from multiple reagents, supplies, and protocols, but also perform better in genetic sequence-based antigenic quantification. CONCLUSIONS: In summary, the results of this study suggest that mutations and variations in N-glycosylation in HA caused antigenic variations in H1N1 IAVs and that the sequence-based antigenicity predictive model will be useful in understanding antigenic evolution of IAVs. BioMed Central 2020-05-11 /pmc/articles/PMC7216668/ /pubmed/32393178 http://dx.doi.org/10.1186/s12859-020-3527-5 Text en © The Author(s). 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Li, Lei
Chang, Deborah
Han, Lei
Zhang, Xiaojian
Zaia, Joseph
Wan, Xiu-Feng
Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin
title Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin
title_full Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin
title_fullStr Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin
title_full_unstemmed Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin
title_short Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin
title_sort multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza a(h1n1) virus using mutations and variations in glycosylation of hemagglutinin
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7216668/
https://www.ncbi.nlm.nih.gov/pubmed/32393178
http://dx.doi.org/10.1186/s12859-020-3527-5
work_keys_str_mv AT lilei multitasklearningsparsegrouplassoamethodforquantifyingantigenicityofinfluenzaah1n1virususingmutationsandvariationsinglycosylationofhemagglutinin
AT changdeborah multitasklearningsparsegrouplassoamethodforquantifyingantigenicityofinfluenzaah1n1virususingmutationsandvariationsinglycosylationofhemagglutinin
AT hanlei multitasklearningsparsegrouplassoamethodforquantifyingantigenicityofinfluenzaah1n1virususingmutationsandvariationsinglycosylationofhemagglutinin
AT zhangxiaojian multitasklearningsparsegrouplassoamethodforquantifyingantigenicityofinfluenzaah1n1virususingmutationsandvariationsinglycosylationofhemagglutinin
AT zaiajoseph multitasklearningsparsegrouplassoamethodforquantifyingantigenicityofinfluenzaah1n1virususingmutationsandvariationsinglycosylationofhemagglutinin
AT wanxiufeng multitasklearningsparsegrouplassoamethodforquantifyingantigenicityofinfluenzaah1n1virususingmutationsandvariationsinglycosylationofhemagglutinin