Cargando…
DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism
The rapid spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) around the world affects the normal lives of people all over the world. The computational methods can be used to accurately identify SARS-CoV-2 phosphorylation sites. In this paper, a new prediction model of SARS-CoV-2...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10140648/ https://www.ncbi.nlm.nih.gov/pubmed/37120990 http://dx.doi.org/10.1016/j.compbiomed.2023.106935 |
_version_ | 1785033207405608960 |
---|---|
author | Wang, Minghui Yan, Lu Jia, Jihua Lai, Jiali Zhou, Hongyan Yu, Bin |
author_facet | Wang, Minghui Yan, Lu Jia, Jihua Lai, Jiali Zhou, Hongyan Yu, Bin |
author_sort | Wang, Minghui |
collection | PubMed |
description | The rapid spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) around the world affects the normal lives of people all over the world. The computational methods can be used to accurately identify SARS-CoV-2 phosphorylation sites. In this paper, a new prediction model of SARS-CoV-2 phosphorylation sites, called DE-MHAIPs, is proposed. First, we use six feature extraction methods to extract protein sequence information from different perspectives. For the first time, we use a differential evolution (DE) algorithm to learn individual feature weights and fuse multi-information in a weighted combination. Next, Group LASSO is used to select a subset of good features. Then, the important protein information is given higher weight through multi-head attention. After that, the processed data is fed into long short-term memory network (LSTM) to further enhance model's ability to learn features. Finally, the data from LSTM are input into fully connected neural network (FCN) to predict SARS-CoV-2 phosphorylation sites. The AUC values of the S/T and Y datasets under 5-fold cross-validation reach 91.98% and 98.32%, respectively. The AUC values of the two datasets on the independent test set reach 91.72% and 97.78%, respectively. The experimental results show that the DE-MHAIPs method exhibits excellent predictive ability compared with other methods. |
format | Online Article Text |
id | pubmed-10140648 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-101406482023-04-28 DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism Wang, Minghui Yan, Lu Jia, Jihua Lai, Jiali Zhou, Hongyan Yu, Bin Comput Biol Med Article The rapid spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) around the world affects the normal lives of people all over the world. The computational methods can be used to accurately identify SARS-CoV-2 phosphorylation sites. In this paper, a new prediction model of SARS-CoV-2 phosphorylation sites, called DE-MHAIPs, is proposed. First, we use six feature extraction methods to extract protein sequence information from different perspectives. For the first time, we use a differential evolution (DE) algorithm to learn individual feature weights and fuse multi-information in a weighted combination. Next, Group LASSO is used to select a subset of good features. Then, the important protein information is given higher weight through multi-head attention. After that, the processed data is fed into long short-term memory network (LSTM) to further enhance model's ability to learn features. Finally, the data from LSTM are input into fully connected neural network (FCN) to predict SARS-CoV-2 phosphorylation sites. The AUC values of the S/T and Y datasets under 5-fold cross-validation reach 91.98% and 98.32%, respectively. The AUC values of the two datasets on the independent test set reach 91.72% and 97.78%, respectively. The experimental results show that the DE-MHAIPs method exhibits excellent predictive ability compared with other methods. Elsevier Ltd. 2023-06 2023-04-14 /pmc/articles/PMC10140648/ /pubmed/37120990 http://dx.doi.org/10.1016/j.compbiomed.2023.106935 Text en © 2023 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Wang, Minghui Yan, Lu Jia, Jihua Lai, Jiali Zhou, Hongyan Yu, Bin DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism |
title | DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism |
title_full | DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism |
title_fullStr | DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism |
title_full_unstemmed | DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism |
title_short | DE-MHAIPs: Identification of SARS-CoV-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism |
title_sort | de-mhaips: identification of sars-cov-2 phosphorylation sites based on differential evolution multi-feature learning and multi-head attention mechanism |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10140648/ https://www.ncbi.nlm.nih.gov/pubmed/37120990 http://dx.doi.org/10.1016/j.compbiomed.2023.106935 |
work_keys_str_mv | AT wangminghui demhaipsidentificationofsarscov2phosphorylationsitesbasedondifferentialevolutionmultifeaturelearningandmultiheadattentionmechanism AT yanlu demhaipsidentificationofsarscov2phosphorylationsitesbasedondifferentialevolutionmultifeaturelearningandmultiheadattentionmechanism AT jiajihua demhaipsidentificationofsarscov2phosphorylationsitesbasedondifferentialevolutionmultifeaturelearningandmultiheadattentionmechanism AT laijiali demhaipsidentificationofsarscov2phosphorylationsitesbasedondifferentialevolutionmultifeaturelearningandmultiheadattentionmechanism AT zhouhongyan demhaipsidentificationofsarscov2phosphorylationsitesbasedondifferentialevolutionmultifeaturelearningandmultiheadattentionmechanism AT yubin demhaipsidentificationofsarscov2phosphorylationsitesbasedondifferentialevolutionmultifeaturelearningandmultiheadattentionmechanism |