Cargando…

FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction

As a novel lactate-derived post-translational modification (PTM), lysine lactylation (Kla) is involved in diverse biological processes, and participates in human tumorigenesis. Identification of Kla substrates with their exact sites is crucial for revealing the molecular mechanisms of lactylation. I...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Peiran, Ning, Wanshan, Shi, Yunshu, Liu, Chuan, Mo, Saijun, Zhou, Haoran, Liu, Kangdong, Guo, Yaping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8385177/
https://www.ncbi.nlm.nih.gov/pubmed/34471495
http://dx.doi.org/10.1016/j.csbj.2021.08.013
_version_ 1783742038704914432
author Jiang, Peiran
Ning, Wanshan
Shi, Yunshu
Liu, Chuan
Mo, Saijun
Zhou, Haoran
Liu, Kangdong
Guo, Yaping
author_facet Jiang, Peiran
Ning, Wanshan
Shi, Yunshu
Liu, Chuan
Mo, Saijun
Zhou, Haoran
Liu, Kangdong
Guo, Yaping
author_sort Jiang, Peiran
collection PubMed
description As a novel lactate-derived post-translational modification (PTM), lysine lactylation (Kla) is involved in diverse biological processes, and participates in human tumorigenesis. Identification of Kla substrates with their exact sites is crucial for revealing the molecular mechanisms of lactylation. In contrast with labor-intensive and time-consuming experimental approaches, computational prediction of Kla could provide convenience and increased speed, but is still lacking. In this work, although current identified Kla sites are limited, we constructed the first Kla benchmark dataset and developed a few-shot learning-based architecture approach to leverage the power of small datasets and reduce the impact of imbalance and overfitting. A maximum 11.7% (0.745 versus 0.667) increase of area under the curve (AUC) value was achieved in contrast to conventional machine learning methods. We conducted a comprehensive survey of the performance by combining 8 sequence-based features and 3 structure-based features and tailored a multi-feature hybrid system for synergistic combination. This system achieved >16.2% improvement of the AUC value (0.889 versus 0.765) compared with single feature-based models for the prediction of Kla sites in silico. Taken few-shot learning and hybrid system together, we present our newly designed predictor named FSL-Kla, which is not only a cutting-edge tool for Kla site profile but also could generate candidates for further experimental approaches. The webserver of FSL-Kla is freely accessible for academic research at http://kla.zbiolab.cn/.
format Online
Article
Text
id pubmed-8385177
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-83851772021-08-31 FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction Jiang, Peiran Ning, Wanshan Shi, Yunshu Liu, Chuan Mo, Saijun Zhou, Haoran Liu, Kangdong Guo, Yaping Comput Struct Biotechnol J Research Article As a novel lactate-derived post-translational modification (PTM), lysine lactylation (Kla) is involved in diverse biological processes, and participates in human tumorigenesis. Identification of Kla substrates with their exact sites is crucial for revealing the molecular mechanisms of lactylation. In contrast with labor-intensive and time-consuming experimental approaches, computational prediction of Kla could provide convenience and increased speed, but is still lacking. In this work, although current identified Kla sites are limited, we constructed the first Kla benchmark dataset and developed a few-shot learning-based architecture approach to leverage the power of small datasets and reduce the impact of imbalance and overfitting. A maximum 11.7% (0.745 versus 0.667) increase of area under the curve (AUC) value was achieved in contrast to conventional machine learning methods. We conducted a comprehensive survey of the performance by combining 8 sequence-based features and 3 structure-based features and tailored a multi-feature hybrid system for synergistic combination. This system achieved >16.2% improvement of the AUC value (0.889 versus 0.765) compared with single feature-based models for the prediction of Kla sites in silico. Taken few-shot learning and hybrid system together, we present our newly designed predictor named FSL-Kla, which is not only a cutting-edge tool for Kla site profile but also could generate candidates for further experimental approaches. The webserver of FSL-Kla is freely accessible for academic research at http://kla.zbiolab.cn/. Research Network of Computational and Structural Biotechnology 2021-08-10 /pmc/articles/PMC8385177/ /pubmed/34471495 http://dx.doi.org/10.1016/j.csbj.2021.08.013 Text en © 2021 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Jiang, Peiran
Ning, Wanshan
Shi, Yunshu
Liu, Chuan
Mo, Saijun
Zhou, Haoran
Liu, Kangdong
Guo, Yaping
FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction
title FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction
title_full FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction
title_fullStr FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction
title_full_unstemmed FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction
title_short FSL-Kla: A few-shot learning-based multi-feature hybrid system for lactylation site prediction
title_sort fsl-kla: a few-shot learning-based multi-feature hybrid system for lactylation site prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8385177/
https://www.ncbi.nlm.nih.gov/pubmed/34471495
http://dx.doi.org/10.1016/j.csbj.2021.08.013
work_keys_str_mv AT jiangpeiran fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction
AT ningwanshan fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction
AT shiyunshu fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction
AT liuchuan fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction
AT mosaijun fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction
AT zhouhaoran fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction
AT liukangdong fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction
AT guoyaping fslklaafewshotlearningbasedmultifeaturehybridsystemforlactylationsiteprediction