Cargando…
Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method
Peptide toxins generally have extreme pharmacological activities and provide a rich source for the discovery of drug leads. However, determining the optimal activity of a new peptide can be a long and expensive process. In this study, peptide toxins were retrieved from Uniprot; three positive-unlabe...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9696491/ https://www.ncbi.nlm.nih.gov/pubmed/36422985 http://dx.doi.org/10.3390/toxins14110811 |
_version_ | 1784838321767186432 |
---|---|
author | Chu, Yanyan Zhang, Huanhuan Zhang, Lei |
author_facet | Chu, Yanyan Zhang, Huanhuan Zhang, Lei |
author_sort | Chu, Yanyan |
collection | PubMed |
description | Peptide toxins generally have extreme pharmacological activities and provide a rich source for the discovery of drug leads. However, determining the optimal activity of a new peptide can be a long and expensive process. In this study, peptide toxins were retrieved from Uniprot; three positive-unlabeled (PU) learning schemes, adaptive basis classifier, two-step method, and PU bagging were adopted to develop models for predicting the biological function of new peptide toxins. All three schemes were embedded with 14 machine learning classifiers. The prediction results of the adaptive base classifier and the two-step method were highly consistent. The models with top comprehensive performances were further optimized by feature selection and hyperparameter tuning, and the models were validated by making predictions for 61 three-finger toxins or the external HemoPI dataset. Biological functions that can be identified by these models include cardiotoxicity, vasoactivity, lipid binding, hemolysis, neurotoxicity, postsynaptic neurotoxicity, hypotension, and cytolysis, with relatively weak predictions for hemostasis and presynaptic neurotoxicity. These models are discovery-prediction tools for active peptide toxins and are expected to accelerate the development of peptide toxins as drugs. |
format | Online Article Text |
id | pubmed-9696491 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96964912022-11-26 Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method Chu, Yanyan Zhang, Huanhuan Zhang, Lei Toxins (Basel) Article Peptide toxins generally have extreme pharmacological activities and provide a rich source for the discovery of drug leads. However, determining the optimal activity of a new peptide can be a long and expensive process. In this study, peptide toxins were retrieved from Uniprot; three positive-unlabeled (PU) learning schemes, adaptive basis classifier, two-step method, and PU bagging were adopted to develop models for predicting the biological function of new peptide toxins. All three schemes were embedded with 14 machine learning classifiers. The prediction results of the adaptive base classifier and the two-step method were highly consistent. The models with top comprehensive performances were further optimized by feature selection and hyperparameter tuning, and the models were validated by making predictions for 61 three-finger toxins or the external HemoPI dataset. Biological functions that can be identified by these models include cardiotoxicity, vasoactivity, lipid binding, hemolysis, neurotoxicity, postsynaptic neurotoxicity, hypotension, and cytolysis, with relatively weak predictions for hemostasis and presynaptic neurotoxicity. These models are discovery-prediction tools for active peptide toxins and are expected to accelerate the development of peptide toxins as drugs. MDPI 2022-11-21 /pmc/articles/PMC9696491/ /pubmed/36422985 http://dx.doi.org/10.3390/toxins14110811 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Chu, Yanyan Zhang, Huanhuan Zhang, Lei Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method |
title | Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method |
title_full | Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method |
title_fullStr | Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method |
title_full_unstemmed | Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method |
title_short | Function Prediction of Peptide Toxins with Sequence-Based Multi-Tasking PU Learning Method |
title_sort | function prediction of peptide toxins with sequence-based multi-tasking pu learning method |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9696491/ https://www.ncbi.nlm.nih.gov/pubmed/36422985 http://dx.doi.org/10.3390/toxins14110811 |
work_keys_str_mv | AT chuyanyan functionpredictionofpeptidetoxinswithsequencebasedmultitaskingpulearningmethod AT zhanghuanhuan functionpredictionofpeptidetoxinswithsequencebasedmultitaskingpulearningmethod AT zhanglei functionpredictionofpeptidetoxinswithsequencebasedmultitaskingpulearningmethod |