Cargando…
Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning
To date, many experiments have revealed that the functional balance between hemagglutinin (HA) and neuraminidase (NA) plays a crucial role in viral mobility, production, and transmission. However, whether and how HA and NA maintain balance at the sequence level needs further investigation. Here, we...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8950662/ https://www.ncbi.nlm.nih.gov/pubmed/35336876 http://dx.doi.org/10.3390/v14030469 |
_version_ | 1784675196314058752 |
---|---|
author | Wang, He Zang, Yongjian Zhao, Yizhen Hao, Dongxiao Kang, Ying Zhang, Jianwen Zhang, Zichen Zhang, Lei Yang, Zhiwei Zhang, Shengli |
author_facet | Wang, He Zang, Yongjian Zhao, Yizhen Hao, Dongxiao Kang, Ying Zhang, Jianwen Zhang, Zichen Zhang, Lei Yang, Zhiwei Zhang, Shengli |
author_sort | Wang, He |
collection | PubMed |
description | To date, many experiments have revealed that the functional balance between hemagglutinin (HA) and neuraminidase (NA) plays a crucial role in viral mobility, production, and transmission. However, whether and how HA and NA maintain balance at the sequence level needs further investigation. Here, we applied principal component analysis and hierarchical clustering analysis on thousands of HA and NA sequences of A/H1N1 and A/H3N2. We discovered significant coevolution between HA and NA at the sequence level, which is closely related to the type of host species and virus epidemic years. Furthermore, we propose a sequence-to-sequence transformer model (S2STM), which mainly consists of an encoder and a decoder that adopts a multi-head attention mechanism for establishing the mapping relationship between HA and NA sequences. The training results reveal that the S2STM can effectively realize the “translation” from HA to NA or vice versa, thereby building a relationship network between them. Our work combines unsupervised and supervised machine learning methods to identify the sequence matching between HA and NA, which will advance our understanding of IAVs’ evolution and also provide a novel idea for sequence analysis methods. |
format | Online Article Text |
id | pubmed-8950662 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-89506622022-03-26 Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning Wang, He Zang, Yongjian Zhao, Yizhen Hao, Dongxiao Kang, Ying Zhang, Jianwen Zhang, Zichen Zhang, Lei Yang, Zhiwei Zhang, Shengli Viruses Article To date, many experiments have revealed that the functional balance between hemagglutinin (HA) and neuraminidase (NA) plays a crucial role in viral mobility, production, and transmission. However, whether and how HA and NA maintain balance at the sequence level needs further investigation. Here, we applied principal component analysis and hierarchical clustering analysis on thousands of HA and NA sequences of A/H1N1 and A/H3N2. We discovered significant coevolution between HA and NA at the sequence level, which is closely related to the type of host species and virus epidemic years. Furthermore, we propose a sequence-to-sequence transformer model (S2STM), which mainly consists of an encoder and a decoder that adopts a multi-head attention mechanism for establishing the mapping relationship between HA and NA sequences. The training results reveal that the S2STM can effectively realize the “translation” from HA to NA or vice versa, thereby building a relationship network between them. Our work combines unsupervised and supervised machine learning methods to identify the sequence matching between HA and NA, which will advance our understanding of IAVs’ evolution and also provide a novel idea for sequence analysis methods. MDPI 2022-02-23 /pmc/articles/PMC8950662/ /pubmed/35336876 http://dx.doi.org/10.3390/v14030469 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, He Zang, Yongjian Zhao, Yizhen Hao, Dongxiao Kang, Ying Zhang, Jianwen Zhang, Zichen Zhang, Lei Yang, Zhiwei Zhang, Shengli Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning |
title | Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning |
title_full | Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning |
title_fullStr | Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning |
title_full_unstemmed | Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning |
title_short | Sequence Matching between Hemagglutinin and Neuraminidase through Sequence Analysis Using Machine Learning |
title_sort | sequence matching between hemagglutinin and neuraminidase through sequence analysis using machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8950662/ https://www.ncbi.nlm.nih.gov/pubmed/35336876 http://dx.doi.org/10.3390/v14030469 |
work_keys_str_mv | AT wanghe sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning AT zangyongjian sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning AT zhaoyizhen sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning AT haodongxiao sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning AT kangying sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning AT zhangjianwen sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning AT zhangzichen sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning AT zhanglei sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning AT yangzhiwei sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning AT zhangshengli sequencematchingbetweenhemagglutininandneuraminidasethroughsequenceanalysisusingmachinelearning |