Cargando…

Machine learning-guided discovery and design of non-hemolytic peptides

Reducing hurdles to clinical trials without compromising the therapeutic promises of peptide candidates becomes an essential step in peptide-based drug design. Machine-learning models are cost-effective and time-saving strategies used to predict biological activities from primary sequences. Their li...

Descripción completa

Detalles Bibliográficos
Autores principales: Plisson, Fabien, Ramírez-Sánchez, Obed, Martínez-Hernández, Cristina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7538962/
https://www.ncbi.nlm.nih.gov/pubmed/33024236
http://dx.doi.org/10.1038/s41598-020-73644-6
_version_ 1783590966951673856
author Plisson, Fabien
Ramírez-Sánchez, Obed
Martínez-Hernández, Cristina
author_facet Plisson, Fabien
Ramírez-Sánchez, Obed
Martínez-Hernández, Cristina
author_sort Plisson, Fabien
collection PubMed
description Reducing hurdles to clinical trials without compromising the therapeutic promises of peptide candidates becomes an essential step in peptide-based drug design. Machine-learning models are cost-effective and time-saving strategies used to predict biological activities from primary sequences. Their limitations lie in the diversity of peptide sequences and biological information within these models. Additional outlier detection methods are needed to set the boundaries for reliable predictions; the applicability domain. Antimicrobial peptides (AMPs) constitute an extensive library of peptides offering promising avenues against antibiotic-resistant infections. Most AMPs present in clinical trials are administrated topically due to their hemolytic toxicity. Here we developed machine learning models and outlier detection methods that ensure robust predictions for the discovery of AMPs and the design of novel peptides with reduced hemolytic activity. Our best models, gradient boosting classifiers, predicted the hemolytic nature from any peptide sequence with 95–97% accuracy. Nearly 70% of AMPs were predicted as hemolytic peptides. Applying multivariate outlier detection models, we found that 273 AMPs (~ 9%) could not be predicted reliably. Our combined approach led to the discovery of 34 high-confidence non-hemolytic natural AMPs, the de novo design of 507 non-hemolytic peptides, and the guidelines for non-hemolytic peptide design.
format Online
Article
Text
id pubmed-7538962
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-75389622020-10-08 Machine learning-guided discovery and design of non-hemolytic peptides Plisson, Fabien Ramírez-Sánchez, Obed Martínez-Hernández, Cristina Sci Rep Article Reducing hurdles to clinical trials without compromising the therapeutic promises of peptide candidates becomes an essential step in peptide-based drug design. Machine-learning models are cost-effective and time-saving strategies used to predict biological activities from primary sequences. Their limitations lie in the diversity of peptide sequences and biological information within these models. Additional outlier detection methods are needed to set the boundaries for reliable predictions; the applicability domain. Antimicrobial peptides (AMPs) constitute an extensive library of peptides offering promising avenues against antibiotic-resistant infections. Most AMPs present in clinical trials are administrated topically due to their hemolytic toxicity. Here we developed machine learning models and outlier detection methods that ensure robust predictions for the discovery of AMPs and the design of novel peptides with reduced hemolytic activity. Our best models, gradient boosting classifiers, predicted the hemolytic nature from any peptide sequence with 95–97% accuracy. Nearly 70% of AMPs were predicted as hemolytic peptides. Applying multivariate outlier detection models, we found that 273 AMPs (~ 9%) could not be predicted reliably. Our combined approach led to the discovery of 34 high-confidence non-hemolytic natural AMPs, the de novo design of 507 non-hemolytic peptides, and the guidelines for non-hemolytic peptide design. Nature Publishing Group UK 2020-10-06 /pmc/articles/PMC7538962/ /pubmed/33024236 http://dx.doi.org/10.1038/s41598-020-73644-6 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Plisson, Fabien
Ramírez-Sánchez, Obed
Martínez-Hernández, Cristina
Machine learning-guided discovery and design of non-hemolytic peptides
title Machine learning-guided discovery and design of non-hemolytic peptides
title_full Machine learning-guided discovery and design of non-hemolytic peptides
title_fullStr Machine learning-guided discovery and design of non-hemolytic peptides
title_full_unstemmed Machine learning-guided discovery and design of non-hemolytic peptides
title_short Machine learning-guided discovery and design of non-hemolytic peptides
title_sort machine learning-guided discovery and design of non-hemolytic peptides
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7538962/
https://www.ncbi.nlm.nih.gov/pubmed/33024236
http://dx.doi.org/10.1038/s41598-020-73644-6
work_keys_str_mv AT plissonfabien machinelearningguideddiscoveryanddesignofnonhemolyticpeptides
AT ramirezsanchezobed machinelearningguideddiscoveryanddesignofnonhemolyticpeptides
AT martinezhernandezcristina machinelearningguideddiscoveryanddesignofnonhemolyticpeptides