Cargando…

Krein support vector machine classification of antimicrobial peptides

Antimicrobial peptides (AMPs) represent a potential solution to the growing problem of antimicrobial resistance, yet their identification through wet-lab experiments is a costly and time-consuming process. Accurate computational predictions would allow rapid in silico screening of candidate AMPs, th...

Descripción completa

Detalles Bibliográficos
Autores principales: Redshaw, Joseph, Ting, Darren S. J., Brown, Alex, Hirst, Jonathan D., Gärtner, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: RSC 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10087059/
https://www.ncbi.nlm.nih.gov/pubmed/37065679
http://dx.doi.org/10.1039/d3dd00004d
_version_ 1785022264588107776
author Redshaw, Joseph
Ting, Darren S. J.
Brown, Alex
Hirst, Jonathan D.
Gärtner, Thomas
author_facet Redshaw, Joseph
Ting, Darren S. J.
Brown, Alex
Hirst, Jonathan D.
Gärtner, Thomas
author_sort Redshaw, Joseph
collection PubMed
description Antimicrobial peptides (AMPs) represent a potential solution to the growing problem of antimicrobial resistance, yet their identification through wet-lab experiments is a costly and time-consuming process. Accurate computational predictions would allow rapid in silico screening of candidate AMPs, thereby accelerating the discovery process. Kernel methods are a class of machine learning algorithms that utilise a kernel function to transform input data into a new representation. When appropriately normalised, the kernel function can be regarded as a notion of similarity between instances. However, many expressive notions of similarity are not valid kernel functions, meaning they cannot be used with standard kernel methods such as the support-vector machine (SVM). The Kreĭn-SVM represents generalisation of the standard SVM that admits a much larger class of similarity functions. In this study, we propose and develop Kreĭn-SVM models for AMP classification and prediction by employing the Levenshtein distance and local alignment score as sequence similarity functions. Utilising two datasets from the literature, each containing more than 3000 peptides, we train models to predict general antimicrobial activity. Our best models achieve an AUC of 0.967 and 0.863 on the test sets of each respective dataset, outperforming the in-house and literature baselines in both cases. We also curate a dataset of experimentally validated peptides, measured against Staphylococcus aureus and Pseudomonas aeruginosa, in order to evaluate the applicability of our methodology in predicting microbe-specific activity. In this case, our best models achieve an AUC of 0.982 and 0.891, respectively. Models to predict both general and microbe-specific activities are made available as web applications.
format Online
Article
Text
id pubmed-10087059
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher RSC
record_format MEDLINE/PubMed
spelling pubmed-100870592023-04-12 Krein support vector machine classification of antimicrobial peptides Redshaw, Joseph Ting, Darren S. J. Brown, Alex Hirst, Jonathan D. Gärtner, Thomas Digit Discov Chemistry Antimicrobial peptides (AMPs) represent a potential solution to the growing problem of antimicrobial resistance, yet their identification through wet-lab experiments is a costly and time-consuming process. Accurate computational predictions would allow rapid in silico screening of candidate AMPs, thereby accelerating the discovery process. Kernel methods are a class of machine learning algorithms that utilise a kernel function to transform input data into a new representation. When appropriately normalised, the kernel function can be regarded as a notion of similarity between instances. However, many expressive notions of similarity are not valid kernel functions, meaning they cannot be used with standard kernel methods such as the support-vector machine (SVM). The Kreĭn-SVM represents generalisation of the standard SVM that admits a much larger class of similarity functions. In this study, we propose and develop Kreĭn-SVM models for AMP classification and prediction by employing the Levenshtein distance and local alignment score as sequence similarity functions. Utilising two datasets from the literature, each containing more than 3000 peptides, we train models to predict general antimicrobial activity. Our best models achieve an AUC of 0.967 and 0.863 on the test sets of each respective dataset, outperforming the in-house and literature baselines in both cases. We also curate a dataset of experimentally validated peptides, measured against Staphylococcus aureus and Pseudomonas aeruginosa, in order to evaluate the applicability of our methodology in predicting microbe-specific activity. In this case, our best models achieve an AUC of 0.982 and 0.891, respectively. Models to predict both general and microbe-specific activities are made available as web applications. RSC 2023-02-27 /pmc/articles/PMC10087059/ /pubmed/37065679 http://dx.doi.org/10.1039/d3dd00004d Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/
spellingShingle Chemistry
Redshaw, Joseph
Ting, Darren S. J.
Brown, Alex
Hirst, Jonathan D.
Gärtner, Thomas
Krein support vector machine classification of antimicrobial peptides
title Krein support vector machine classification of antimicrobial peptides
title_full Krein support vector machine classification of antimicrobial peptides
title_fullStr Krein support vector machine classification of antimicrobial peptides
title_full_unstemmed Krein support vector machine classification of antimicrobial peptides
title_short Krein support vector machine classification of antimicrobial peptides
title_sort krein support vector machine classification of antimicrobial peptides
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10087059/
https://www.ncbi.nlm.nih.gov/pubmed/37065679
http://dx.doi.org/10.1039/d3dd00004d
work_keys_str_mv AT redshawjoseph kreinsupportvectormachineclassificationofantimicrobialpeptides
AT tingdarrensj kreinsupportvectormachineclassificationofantimicrobialpeptides
AT brownalex kreinsupportvectormachineclassificationofantimicrobialpeptides
AT hirstjonathand kreinsupportvectormachineclassificationofantimicrobialpeptides
AT gartnerthomas kreinsupportvectormachineclassificationofantimicrobialpeptides