Cargando…

TEMPO: A transformer-based mutation prediction framework for SARS-CoV-2 evolution

The widespread of SARS-CoV-2 presents a significant threat to human society, as well as public health and economic development. Extensive efforts have been undertaken to battle against the pandemic, whereas effective approaches such as vaccination would be weakened by the continuous mutations, leadi...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Binbin, Zhou, Hang, Zhang, Xue, Xu, Xiaobin, Chai, Yi, Zheng, Zengwei, Kot, Alex Chichung, Zhou, Zhan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9747230/
https://www.ncbi.nlm.nih.gov/pubmed/36535209
http://dx.doi.org/10.1016/j.compbiomed.2022.106264
_version_ 1784849549412532224
author Zhou, Binbin
Zhou, Hang
Zhang, Xue
Xu, Xiaobin
Chai, Yi
Zheng, Zengwei
Kot, Alex Chichung
Zhou, Zhan
author_facet Zhou, Binbin
Zhou, Hang
Zhang, Xue
Xu, Xiaobin
Chai, Yi
Zheng, Zengwei
Kot, Alex Chichung
Zhou, Zhan
author_sort Zhou, Binbin
collection PubMed
description The widespread of SARS-CoV-2 presents a significant threat to human society, as well as public health and economic development. Extensive efforts have been undertaken to battle against the pandemic, whereas effective approaches such as vaccination would be weakened by the continuous mutations, leading to considerable attention being attracted to the mutation prediction. However, most previous studies lack attention to phylogenetics. In this paper, we propose a novel and effective model TEMPO for predicting the mutation of SARS-CoV-2 evolution. Specifically, we design a phylogenetic tree-based sampling method to generate sequence evolution data. Then, a transformer-based model is presented for the site mutation prediction after learning the high-level representation of these sequence data. We conduct experiments to verify the effectiveness of TEMPO, leveraging a large-scale SARS-CoV- 2 dataset. Experimental results show that TEMPO is effective for mutation prediction of SARS- CoV-2 evolution and outperforms several state-of-the-art baseline methods. We further perform mutation prediction experiments of other infectious viruses, to explore the feasibility and robustness of TEMPO, and experimental results verify its superiority. The codes and datasets are freely available at https://github.com/ZJUDataIntelligence/TEMPO.
format Online
Article
Text
id pubmed-9747230
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-97472302022-12-14 TEMPO: A transformer-based mutation prediction framework for SARS-CoV-2 evolution Zhou, Binbin Zhou, Hang Zhang, Xue Xu, Xiaobin Chai, Yi Zheng, Zengwei Kot, Alex Chichung Zhou, Zhan Comput Biol Med Article The widespread of SARS-CoV-2 presents a significant threat to human society, as well as public health and economic development. Extensive efforts have been undertaken to battle against the pandemic, whereas effective approaches such as vaccination would be weakened by the continuous mutations, leading to considerable attention being attracted to the mutation prediction. However, most previous studies lack attention to phylogenetics. In this paper, we propose a novel and effective model TEMPO for predicting the mutation of SARS-CoV-2 evolution. Specifically, we design a phylogenetic tree-based sampling method to generate sequence evolution data. Then, a transformer-based model is presented for the site mutation prediction after learning the high-level representation of these sequence data. We conduct experiments to verify the effectiveness of TEMPO, leveraging a large-scale SARS-CoV- 2 dataset. Experimental results show that TEMPO is effective for mutation prediction of SARS- CoV-2 evolution and outperforms several state-of-the-art baseline methods. We further perform mutation prediction experiments of other infectious viruses, to explore the feasibility and robustness of TEMPO, and experimental results verify its superiority. The codes and datasets are freely available at https://github.com/ZJUDataIntelligence/TEMPO. Elsevier Ltd. 2023-01 2022-12-14 /pmc/articles/PMC9747230/ /pubmed/36535209 http://dx.doi.org/10.1016/j.compbiomed.2022.106264 Text en © 2022 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Zhou, Binbin
Zhou, Hang
Zhang, Xue
Xu, Xiaobin
Chai, Yi
Zheng, Zengwei
Kot, Alex Chichung
Zhou, Zhan
TEMPO: A transformer-based mutation prediction framework for SARS-CoV-2 evolution
title TEMPO: A transformer-based mutation prediction framework for SARS-CoV-2 evolution
title_full TEMPO: A transformer-based mutation prediction framework for SARS-CoV-2 evolution
title_fullStr TEMPO: A transformer-based mutation prediction framework for SARS-CoV-2 evolution
title_full_unstemmed TEMPO: A transformer-based mutation prediction framework for SARS-CoV-2 evolution
title_short TEMPO: A transformer-based mutation prediction framework for SARS-CoV-2 evolution
title_sort tempo: a transformer-based mutation prediction framework for sars-cov-2 evolution
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9747230/
https://www.ncbi.nlm.nih.gov/pubmed/36535209
http://dx.doi.org/10.1016/j.compbiomed.2022.106264
work_keys_str_mv AT zhoubinbin tempoatransformerbasedmutationpredictionframeworkforsarscov2evolution
AT zhouhang tempoatransformerbasedmutationpredictionframeworkforsarscov2evolution
AT zhangxue tempoatransformerbasedmutationpredictionframeworkforsarscov2evolution
AT xuxiaobin tempoatransformerbasedmutationpredictionframeworkforsarscov2evolution
AT chaiyi tempoatransformerbasedmutationpredictionframeworkforsarscov2evolution
AT zhengzengwei tempoatransformerbasedmutationpredictionframeworkforsarscov2evolution
AT kotalexchichung tempoatransformerbasedmutationpredictionframeworkforsarscov2evolution
AT zhouzhan tempoatransformerbasedmutationpredictionframeworkforsarscov2evolution