Cargando…
FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction
As a novel lactate-derived post-translational modification (PTM), lysine lactylation (Kla) is involved in diverse biological processes, and participates in human tumorigenesis. Identification of Kla substrates with their exact sites is crucial for revealing the molecular mechanisms of lactylation. I...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8385177/ https://www.ncbi.nlm.nih.gov/pubmed/34471495 http://dx.doi.org/10.1016/j.csbj.2021.08.013 |
_version_ | 1783742038704914432 |
---|---|
author | Jiang, Peiran Ning, Wanshan Shi, Yunshu Liu, Chuan Mo, Saijun Zhou, Haoran Liu, Kangdong Guo, Yaping |
author_facet | Jiang, Peiran Ning, Wanshan Shi, Yunshu Liu, Chuan Mo, Saijun Zhou, Haoran Liu, Kangdong Guo, Yaping |
author_sort | Jiang, Peiran |
collection | PubMed |
description | As a novel lactate-derived post-translational modification (PTM), lysine lactylation (Kla) is involved in diverse biological processes, and participates in human tumorigenesis. Identification of Kla substrates with their exact sites is crucial for revealing the molecular mechanisms of lactylation. In contrast with labor-intensive and time-consuming experimental approaches, computational prediction of Kla could provide convenience and increased speed, but is still lacking. In this work, although current identified Kla sites are limited, we constructed the first Kla benchmark dataset and developed a few-shot learning-based architecture approach to leverage the power of small datasets and reduce the impact of imbalance and overfitting. A maximum 11.7% (0.745 versus 0.667) increase of area under the curve (AUC) value was achieved in contrast to conventional machine learning methods. We conducted a comprehensive survey of the performance by combining 8 sequence-based features and 3 structure-based features and tailored a multi-feature hybrid system for synergistic combination. This system achieved >16.2% improvement of the AUC value (0.889 versus 0.765) compared with single feature-based models for the prediction of Kla sites in silico. Taken few-shot learning and hybrid system together, we present our newly designed predictor named FSL-Kla, which is not only a cutting-edge tool for Kla site profile but also could generate candidates for further experimental approaches. The webserver of FSL-Kla is freely accessible for academic research at http://kla.zbiolab.cn/. |
format | Online Article Text |
id | pubmed-8385177 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-83851772021-08-31 FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction Jiang, Peiran Ning, Wanshan Shi, Yunshu Liu, Chuan Mo, Saijun Zhou, Haoran Liu, Kangdong Guo, Yaping Comput Struct Biotechnol J Research Article As a novel lactate-derived post-translational modification (PTM), lysine lactylation (Kla) is involved in diverse biological processes, and participates in human tumorigenesis. Identification of Kla substrates with their exact sites is crucial for revealing the molecular mechanisms of lactylation. In contrast with labor-intensive and time-consuming experimental approaches, computational prediction of Kla could provide convenience and increased speed, but is still lacking. In this work, although current identified Kla sites are limited, we constructed the first Kla benchmark dataset and developed a few-shot learning-based architecture approach to leverage the power of small datasets and reduce the impact of imbalance and overfitting. A maximum 11.7% (0.745 versus 0.667) increase of area under the curve (AUC) value was achieved in contrast to conventional machine learning methods. We conducted a comprehensive survey of the performance by combining 8 sequence-based features and 3 structure-based features and tailored a multi-feature hybrid system for synergistic combination. This system achieved >16.2% improvement of the AUC value (0.889 versus 0.765) compared with single feature-based models for the prediction of Kla sites in silico. Taken few-shot learning and hybrid system together, we present our newly designed predictor named FSL-Kla, which is not only a cutting-edge tool for Kla site profile but also could generate candidates for further experimental approaches. The webserver of FSL-Kla is freely accessible for academic research at http://kla.zbiolab.cn/. Research Network of Computational and Structural Biotechnology 2021-08-10 /pmc/articles/PMC8385177/ /pubmed/34471495 http://dx.doi.org/10.1016/j.csbj.2021.08.013 Text en © 2021 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Jiang, Peiran Ning, Wanshan Shi, Yunshu Liu, Chuan Mo, Saijun Zhou, Haoran Liu, Kangdong Guo, Yaping FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction |
title | FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction |
title_full | FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction |
title_fullStr | FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction |
title_full_unstemmed | FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction |
title_short | FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction |
title_sort | fsl-kla: a few-shot learning-based multi-feature hybrid system for lactylation site prediction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8385177/ https://www.ncbi.nlm.nih.gov/pubmed/34471495 http://dx.doi.org/10.1016/j.csbj.2021.08.013 |
work_keys_str_mv | AT jiangpeiran fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction AT ningwanshan fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction AT shiyunshu fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction AT liuchuan fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction AT mosaijun fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction AT zhouhaoran fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction AT liukangdong fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction AT guoyaping fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction |