Cargando…

DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism

The rapid spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) around the world affects the normal lives of people all over the world. The computational methods can be used to accurately identify SARS-CoV-2 phosphorylation sites. In this paper, a new prediction model of SARS-CoV-2...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Minghui, Yan, Lu, Jia, Jihua, Lai, Jiali, Zhou, Hongyan, Yu, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10140648/
https://www.ncbi.nlm.nih.gov/pubmed/37120990
http://dx.doi.org/10.1016/j.compbiomed.2023.106935
_version_ 1785033207405608960
author Wang, Minghui
Yan, Lu
Jia, Jihua
Lai, Jiali
Zhou, Hongyan
Yu, Bin
author_facet Wang, Minghui
Yan, Lu
Jia, Jihua
Lai, Jiali
Zhou, Hongyan
Yu, Bin
author_sort Wang, Minghui
collection PubMed
description The rapid spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) around the world affects the normal lives of people all over the world. The computational methods can be used to accurately identify SARS-CoV-2 phosphorylation sites. In this paper, a new prediction model of SARS-CoV-2 phosphorylation sites, called DE-MHAIPs, is proposed. First, we use six feature extraction methods to extract protein sequence information from different perspectives. For the first time, we use a differential evolution (DE) algorithm to learn individual feature weights and fuse multi-information in a weighted combination. Next, Group LASSO is used to select a subset of good features. Then, the important protein information is given higher weight through multi-head attention. After that, the processed data is fed into long short-term memory network (LSTM) to further enhance model's ability to learn features. Finally, the data from LSTM are input into fully connected neural network (FCN) to predict SARS-CoV-2 phosphorylation sites. The AUC values of the S/T and Y datasets under 5-fold cross-validation reach 91.98% and 98.32%, respectively. The AUC values of the two datasets on the independent test set reach 91.72% and 97.78%, respectively. The experimental results show that the DE-MHAIPs method exhibits excellent predictive ability compared with other methods.
format Online
Article
Text
id pubmed-10140648
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-101406482023-04-28 DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism Wang, Minghui Yan, Lu Jia, Jihua Lai, Jiali Zhou, Hongyan Yu, Bin Comput Biol Med Article The rapid spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) around the world affects the normal lives of people all over the world. The computational methods can be used to accurately identify SARS-CoV-2 phosphorylation sites. In this paper, a new prediction model of SARS-CoV-2 phosphorylation sites, called DE-MHAIPs, is proposed. First, we use six feature extraction methods to extract protein sequence information from different perspectives. For the first time, we use a differential evolution (DE) algorithm to learn individual feature weights and fuse multi-information in a weighted combination. Next, Group LASSO is used to select a subset of good features. Then, the important protein information is given higher weight through multi-head attention. After that, the processed data is fed into long short-term memory network (LSTM) to further enhance model's ability to learn features. Finally, the data from LSTM are input into fully connected neural network (FCN) to predict SARS-CoV-2 phosphorylation sites. The AUC values of the S/T and Y datasets under 5-fold cross-validation reach 91.98% and 98.32%, respectively. The AUC values of the two datasets on the independent test set reach 91.72% and 97.78%, respectively. The experimental results show that the DE-MHAIPs method exhibits excellent predictive ability compared with other methods. Elsevier Ltd. 2023-06 2023-04-14 /pmc/articles/PMC10140648/ /pubmed/37120990 http://dx.doi.org/10.1016/j.compbiomed.2023.106935 Text en © 2023 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Wang, Minghui
Yan, Lu
Jia, Jihua
Lai, Jiali
Zhou, Hongyan
Yu, Bin
DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism
title DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism
title_full DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism
title_fullStr DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism
title_full_unstemmed DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism
title_short DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism
title_sort de-mhaips: identification of sars-cov-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10140648/
https://www.ncbi.nlm.nih.gov/pubmed/37120990
http://dx.doi.org/10.1016/j.compbiomed.2023.106935
work_keys_str_mv AT wangminghui demhaipsidentificationofsarscov2phosphorylationsitesbasedondifferentialevolutionmultifeaturelearningandmultiheadattentionmechanism
AT yanlu demhaipsidentificationofsarscov2phosphorylationsitesbasedondifferentialevolutionmultifeaturelearningandmultiheadattentionmechanism
AT jiajihua demhaipsidentificationofsarscov2phosphorylationsitesbasedondifferentialevolutionmultifeaturelearningandmultiheadattentionmechanism
AT laijiali demhaipsidentificationofsarscov2phosphorylationsitesbasedondifferentialevolutionmultifeaturelearningandmultiheadattentionmechanism
AT zhouhongyan demhaipsidentificationofsarscov2phosphorylationsitesbasedondifferentialevolutionmultifeaturelearningandmultiheadattentionmechanism
AT yubin demhaipsidentificationofsarscov2phosphorylationsitesbasedondifferentialevolutionmultifeaturelearningandmultiheadattentionmechanism