Cargando…

Deep learning-based multi-functional therapeutic peptides prediction with a multi-label focal dice loss function

MOTIVATION: With the great number of peptide sequences produced in the postgenomic era, it is highly desirable to identify the various functions of therapeutic peptides quickly. Furthermore, it is a great challenge to predict accurate multi-functional therapeutic peptides (MFTP) via sequence-based c...

Descripción completa

Detalles Bibliográficos
Autores principales: Fan, Henghui, Yan, Wenhui, Wang, Lihua, Liu, Jie, Bin, Yannan, Xia, Junfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10234765/
https://www.ncbi.nlm.nih.gov/pubmed/37216900
http://dx.doi.org/10.1093/bioinformatics/btad334
_version_ 1785052567456186368
author Fan, Henghui
Yan, Wenhui
Wang, Lihua
Liu, Jie
Bin, Yannan
Xia, Junfeng
author_facet Fan, Henghui
Yan, Wenhui
Wang, Lihua
Liu, Jie
Bin, Yannan
Xia, Junfeng
author_sort Fan, Henghui
collection PubMed
description MOTIVATION: With the great number of peptide sequences produced in the postgenomic era, it is highly desirable to identify the various functions of therapeutic peptides quickly. Furthermore, it is a great challenge to predict accurate multi-functional therapeutic peptides (MFTP) via sequence-based computational tools. RESULTS: Here, we propose a novel multi-label-based method, named ETFC, to predict 21 categories of therapeutic peptides. The method utilizes a deep learning-based model architecture, which consists of four blocks: embedding, text convolutional neural network, feed-forward network, and classification blocks. This method also adopts an imbalanced learning strategy with a novel multi-label focal dice loss function. multi-label focal dice loss is applied in the ETFC method to solve the inherent imbalance problem in the multi-label dataset and achieve competitive performance. The experimental results state that the ETFC method is significantly better than the existing methods for MFTP prediction. With the established framework, we use the teacher–student-based knowledge distillation to obtain the attention weight from the self-attention mechanism in the MFTP prediction and quantify their contributions toward each of the investigated activities. AVAILABILITY AND IMPLEMENTATION: The source code and dataset are available via: https://github.com/xialab-ahu/ETFC.
format Online
Article
Text
id pubmed-10234765
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102347652023-06-02 Deep learning-based multi-functional therapeutic peptides prediction with a multi-label focal dice loss function Fan, Henghui Yan, Wenhui Wang, Lihua Liu, Jie Bin, Yannan Xia, Junfeng Bioinformatics Original Paper MOTIVATION: With the great number of peptide sequences produced in the postgenomic era, it is highly desirable to identify the various functions of therapeutic peptides quickly. Furthermore, it is a great challenge to predict accurate multi-functional therapeutic peptides (MFTP) via sequence-based computational tools. RESULTS: Here, we propose a novel multi-label-based method, named ETFC, to predict 21 categories of therapeutic peptides. The method utilizes a deep learning-based model architecture, which consists of four blocks: embedding, text convolutional neural network, feed-forward network, and classification blocks. This method also adopts an imbalanced learning strategy with a novel multi-label focal dice loss function. multi-label focal dice loss is applied in the ETFC method to solve the inherent imbalance problem in the multi-label dataset and achieve competitive performance. The experimental results state that the ETFC method is significantly better than the existing methods for MFTP prediction. With the established framework, we use the teacher–student-based knowledge distillation to obtain the attention weight from the self-attention mechanism in the MFTP prediction and quantify their contributions toward each of the investigated activities. AVAILABILITY AND IMPLEMENTATION: The source code and dataset are available via: https://github.com/xialab-ahu/ETFC. Oxford University Press 2023-05-22 /pmc/articles/PMC10234765/ /pubmed/37216900 http://dx.doi.org/10.1093/bioinformatics/btad334 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Fan, Henghui
Yan, Wenhui
Wang, Lihua
Liu, Jie
Bin, Yannan
Xia, Junfeng
Deep learning-based multi-functional therapeutic peptides prediction with a multi-label focal dice loss function
title Deep learning-based multi-functional therapeutic peptides prediction with a multi-label focal dice loss function
title_full Deep learning-based multi-functional therapeutic peptides prediction with a multi-label focal dice loss function
title_fullStr Deep learning-based multi-functional therapeutic peptides prediction with a multi-label focal dice loss function
title_full_unstemmed Deep learning-based multi-functional therapeutic peptides prediction with a multi-label focal dice loss function
title_short Deep learning-based multi-functional therapeutic peptides prediction with a multi-label focal dice loss function
title_sort deep learning-based multi-functional therapeutic peptides prediction with a multi-label focal dice loss function
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10234765/
https://www.ncbi.nlm.nih.gov/pubmed/37216900
http://dx.doi.org/10.1093/bioinformatics/btad334
work_keys_str_mv AT fanhenghui deeplearningbasedmultifunctionaltherapeuticpeptidespredictionwithamultilabelfocaldicelossfunction
AT yanwenhui deeplearningbasedmultifunctionaltherapeuticpeptidespredictionwithamultilabelfocaldicelossfunction
AT wanglihua deeplearningbasedmultifunctionaltherapeuticpeptidespredictionwithamultilabelfocaldicelossfunction
AT liujie deeplearningbasedmultifunctionaltherapeuticpeptidespredictionwithamultilabelfocaldicelossfunction
AT binyannan deeplearningbasedmultifunctionaltherapeuticpeptidespredictionwithamultilabelfocaldicelossfunction
AT xiajunfeng deeplearningbasedmultifunctionaltherapeuticpeptidespredictionwithamultilabelfocaldicelossfunction