Cargando…

Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning

To date, many experiments have revealed that the functional balance between hemagglutinin (HA) and neuraminidase (NA) plays a crucial role in viral mobility, production, and transmission. However, whether and how HA and NA maintain balance at the sequence level needs further investigation. Here, we...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, He, Zang, Yongjian, Zhao, Yizhen, Hao, Dongxiao, Kang, Ying, Zhang, Jianwen, Zhang, Zichen, Zhang, Lei, Yang, Zhiwei, Zhang, Shengli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8950662/
https://www.ncbi.nlm.nih.gov/pubmed/35336876
http://dx.doi.org/10.3390/v14030469
_version_ 1784675196314058752
author Wang, He
Zang, Yongjian
Zhao, Yizhen
Hao, Dongxiao
Kang, Ying
Zhang, Jianwen
Zhang, Zichen
Zhang, Lei
Yang, Zhiwei
Zhang, Shengli
author_facet Wang, He
Zang, Yongjian
Zhao, Yizhen
Hao, Dongxiao
Kang, Ying
Zhang, Jianwen
Zhang, Zichen
Zhang, Lei
Yang, Zhiwei
Zhang, Shengli
author_sort Wang, He
collection PubMed
description To date, many experiments have revealed that the functional balance between hemagglutinin (HA) and neuraminidase (NA) plays a crucial role in viral mobility, production, and transmission. However, whether and how HA and NA maintain balance at the sequence level needs further investigation. Here, we applied principal component analysis and hierarchical clustering analysis on thousands of HA and NA sequences of A/H1N1 and A/H3N2. We discovered significant coevolution between HA and NA at the sequence level, which is closely related to the type of host species and virus epidemic years. Furthermore, we propose a sequence-to-sequence transformer model (S2STM), which mainly consists of an encoder and a decoder that adopts a multi-head attention mechanism for establishing the mapping relationship between HA and NA sequences. The training results reveal that the S2STM can effectively realize the “translation” from HA to NA or vice versa, thereby building a relationship network between them. Our work combines unsupervised and supervised machine learning methods to identify the sequence matching between HA and NA, which will advance our understanding of IAVs’ evolution and also provide a novel idea for sequence analysis methods.
format Online
Article
Text
id pubmed-8950662
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-89506622022-03-26 Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning Wang, He Zang, Yongjian Zhao, Yizhen Hao, Dongxiao Kang, Ying Zhang, Jianwen Zhang, Zichen Zhang, Lei Yang, Zhiwei Zhang, Shengli Viruses Article To date, many experiments have revealed that the functional balance between hemagglutinin (HA) and neuraminidase (NA) plays a crucial role in viral mobility, production, and transmission. However, whether and how HA and NA maintain balance at the sequence level needs further investigation. Here, we applied principal component analysis and hierarchical clustering analysis on thousands of HA and NA sequences of A/H1N1 and A/H3N2. We discovered significant coevolution between HA and NA at the sequence level, which is closely related to the type of host species and virus epidemic years. Furthermore, we propose a sequence-to-sequence transformer model (S2STM), which mainly consists of an encoder and a decoder that adopts a multi-head attention mechanism for establishing the mapping relationship between HA and NA sequences. The training results reveal that the S2STM can effectively realize the “translation” from HA to NA or vice versa, thereby building a relationship network between them. Our work combines unsupervised and supervised machine learning methods to identify the sequence matching between HA and NA, which will advance our understanding of IAVs’ evolution and also provide a novel idea for sequence analysis methods. MDPI 2022-02-23 /pmc/articles/PMC8950662/ /pubmed/35336876 http://dx.doi.org/10.3390/v14030469 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, He
Zang, Yongjian
Zhao, Yizhen
Hao, Dongxiao
Kang, Ying
Zhang, Jianwen
Zhang, Zichen
Zhang, Lei
Yang, Zhiwei
Zhang, Shengli
Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning
title Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning
title_full Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning
title_fullStr Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning
title_full_unstemmed Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning
title_short Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning
title_sort sequence matching between hemagglutinin and neuraminidase through sequence analysis using machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8950662/
https://www.ncbi.nlm.nih.gov/pubmed/35336876
http://dx.doi.org/10.3390/v14030469
work_keys_str_mv AT wanghe sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning
AT zangyongjian sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning
AT zhaoyizhen sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning
AT haodongxiao sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning
AT kangying sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning
AT zhangjianwen sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning
AT zhangzichen sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning
AT zhanglei sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning
AT yangzhiwei sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning
AT zhangshengli sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning