Cargando…

Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method

Peptide toxins generally have extreme pharmacological activities and provide a rich source for the discovery of drug leads. However, determining the optimal activity of a new peptide can be a long and expensive process. In this study, peptide toxins were retrieved from Uniprot; three positive-unlabe...

Descripción completa

Detalles Bibliográficos
Autores principales: Chu, Yanyan, Zhang, Huanhuan, Zhang, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9696491/
https://www.ncbi.nlm.nih.gov/pubmed/36422985
http://dx.doi.org/10.3390/toxins14110811
_version_ 1784838321767186432
author Chu, Yanyan
Zhang, Huanhuan
Zhang, Lei
author_facet Chu, Yanyan
Zhang, Huanhuan
Zhang, Lei
author_sort Chu, Yanyan
collection PubMed
description Peptide toxins generally have extreme pharmacological activities and provide a rich source for the discovery of drug leads. However, determining the optimal activity of a new peptide can be a long and expensive process. In this study, peptide toxins were retrieved from Uniprot; three positive-unlabeled (PU) learning schemes, adaptive basis classifier, two-step method, and PU bagging were adopted to develop models for predicting the biological function of new peptide toxins. All three schemes were embedded with 14 machine learning classifiers. The prediction results of the adaptive base classifier and the two-step method were highly consistent. The models with top comprehensive performances were further optimized by feature selection and hyperparameter tuning, and the models were validated by making predictions for 61 three-finger toxins or the external HemoPI dataset. Biological functions that can be identified by these models include cardiotoxicity, vasoactivity, lipid binding, hemolysis, neurotoxicity, postsynaptic neurotoxicity, hypotension, and cytolysis, with relatively weak predictions for hemostasis and presynaptic neurotoxicity. These models are discovery-prediction tools for active peptide toxins and are expected to accelerate the development of peptide toxins as drugs.
format Online
Article
Text
id pubmed-9696491
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96964912022-11-26 Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method Chu, Yanyan Zhang, Huanhuan Zhang, Lei Toxins (Basel) Article Peptide toxins generally have extreme pharmacological activities and provide a rich source for the discovery of drug leads. However, determining the optimal activity of a new peptide can be a long and expensive process. In this study, peptide toxins were retrieved from Uniprot; three positive-unlabeled (PU) learning schemes, adaptive basis classifier, two-step method, and PU bagging were adopted to develop models for predicting the biological function of new peptide toxins. All three schemes were embedded with 14 machine learning classifiers. The prediction results of the adaptive base classifier and the two-step method were highly consistent. The models with top comprehensive performances were further optimized by feature selection and hyperparameter tuning, and the models were validated by making predictions for 61 three-finger toxins or the external HemoPI dataset. Biological functions that can be identified by these models include cardiotoxicity, vasoactivity, lipid binding, hemolysis, neurotoxicity, postsynaptic neurotoxicity, hypotension, and cytolysis, with relatively weak predictions for hemostasis and presynaptic neurotoxicity. These models are discovery-prediction tools for active peptide toxins and are expected to accelerate the development of peptide toxins as drugs. MDPI 2022-11-21 /pmc/articles/PMC9696491/ /pubmed/36422985 http://dx.doi.org/10.3390/toxins14110811 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chu, Yanyan
Zhang, Huanhuan
Zhang, Lei
Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method
title Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method
title_full Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method
title_fullStr Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method
title_full_unstemmed Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method
title_short Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method
title_sort function prediction of peptide toxins with sequence-based multi-tasking pu learning method
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9696491/
https://www.ncbi.nlm.nih.gov/pubmed/36422985
http://dx.doi.org/10.3390/toxins14110811
work_keys_str_mv AT chuyanyan functionpredictionofpeptidetoxinswithsequencebasedmultitaskingpulearningmethod
AT zhanghuanhuan functionpredictionofpeptidetoxinswithsequencebasedmultitaskingpulearningmethod
AT zhanglei functionpredictionofpeptidetoxinswithsequencebasedmultitaskingpulearningmethod