Cargando…
Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques
BACKGROUND: Cyclic peptide-based drug discovery is attracting increasing interest owing to its potential to avoid target protein depletion. In drug discovery, it is important to maintain the biostability of a drug within the proper range. Plasma protein binding (PPB) is the most important index of b...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311893/ https://www.ncbi.nlm.nih.gov/pubmed/30598072 http://dx.doi.org/10.1186/s12859-018-2529-z |
_version_ | 1783383694490206208 |
---|---|
author | Tajimi, Takashi Wakui, Naoki Yanagisawa, Keisuke Yoshikawa, Yasushi Ohue, Masahito Akiyama, Yutaka |
author_facet | Tajimi, Takashi Wakui, Naoki Yanagisawa, Keisuke Yoshikawa, Yasushi Ohue, Masahito Akiyama, Yutaka |
author_sort | Tajimi, Takashi |
collection | PubMed |
description | BACKGROUND: Cyclic peptide-based drug discovery is attracting increasing interest owing to its potential to avoid target protein depletion. In drug discovery, it is important to maintain the biostability of a drug within the proper range. Plasma protein binding (PPB) is the most important index of biostability, and developing a computational method to predict PPB of drug candidate compounds contributes to the acceleration of drug discovery research. PPB prediction of small molecule drug compounds using machine learning has been conducted thus far; however, no study has investigated cyclic peptides because experimental information of cyclic peptides is scarce. RESULTS: First, we adopted sparse modeling and small molecule information to construct a PPB prediction model for cyclic peptides. As cyclic peptide data are limited, applying multidimensional nonlinear models involves concerns regarding overfitting. However, models constructed by sparse modeling can avoid overfitting, offering high generalization performance and interpretability. More than 1000 PPB data of small molecules are available, and we used them to construct a prediction models with two enumeration methods: enumerating lasso solutions (ELS) and forward beam search (FBS). The accuracies of the prediction models constructed by ELS and FBS were equal to or better than those of conventional non-linear models (MAE = 0.167–0.174) on cross-validation of a small molecule compound dataset. Moreover, we showed that the prediction accuracies for cyclic peptides were close to those for small molecule compounds (MAE = 0.194–0.288). Such high accuracy could not be obtained by a simple method of learning from cyclic peptide data directly by lasso regression (MAE = 0.286–0.671) or ridge regression (MAE = 0.244–0.354). CONCLUSION: In this study, we proposed a machine learning techniques that uses low-dimensional sparse modeling to predict the PPB value of cyclic peptides computationally. The low-dimensional sparse model not only exhibits excellent generalization performance but also improves interpretation of the prediction model. This can provide common an noteworthy knowledge for future cyclic peptide drug discovery studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2529-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6311893 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63118932019-01-07 Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques Tajimi, Takashi Wakui, Naoki Yanagisawa, Keisuke Yoshikawa, Yasushi Ohue, Masahito Akiyama, Yutaka BMC Bioinformatics Research BACKGROUND: Cyclic peptide-based drug discovery is attracting increasing interest owing to its potential to avoid target protein depletion. In drug discovery, it is important to maintain the biostability of a drug within the proper range. Plasma protein binding (PPB) is the most important index of biostability, and developing a computational method to predict PPB of drug candidate compounds contributes to the acceleration of drug discovery research. PPB prediction of small molecule drug compounds using machine learning has been conducted thus far; however, no study has investigated cyclic peptides because experimental information of cyclic peptides is scarce. RESULTS: First, we adopted sparse modeling and small molecule information to construct a PPB prediction model for cyclic peptides. As cyclic peptide data are limited, applying multidimensional nonlinear models involves concerns regarding overfitting. However, models constructed by sparse modeling can avoid overfitting, offering high generalization performance and interpretability. More than 1000 PPB data of small molecules are available, and we used them to construct a prediction models with two enumeration methods: enumerating lasso solutions (ELS) and forward beam search (FBS). The accuracies of the prediction models constructed by ELS and FBS were equal to or better than those of conventional non-linear models (MAE = 0.167–0.174) on cross-validation of a small molecule compound dataset. Moreover, we showed that the prediction accuracies for cyclic peptides were close to those for small molecule compounds (MAE = 0.194–0.288). Such high accuracy could not be obtained by a simple method of learning from cyclic peptide data directly by lasso regression (MAE = 0.286–0.671) or ridge regression (MAE = 0.244–0.354). CONCLUSION: In this study, we proposed a machine learning techniques that uses low-dimensional sparse modeling to predict the PPB value of cyclic peptides computationally. The low-dimensional sparse model not only exhibits excellent generalization performance but also improves interpretation of the prediction model. This can provide common an noteworthy knowledge for future cyclic peptide drug discovery studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2529-z) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-31 /pmc/articles/PMC6311893/ /pubmed/30598072 http://dx.doi.org/10.1186/s12859-018-2529-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Tajimi, Takashi Wakui, Naoki Yanagisawa, Keisuke Yoshikawa, Yasushi Ohue, Masahito Akiyama, Yutaka Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques |
title | Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques |
title_full | Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques |
title_fullStr | Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques |
title_full_unstemmed | Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques |
title_short | Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques |
title_sort | computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311893/ https://www.ncbi.nlm.nih.gov/pubmed/30598072 http://dx.doi.org/10.1186/s12859-018-2529-z |
work_keys_str_mv | AT tajimitakashi computationalpredictionofplasmaproteinbindingofcyclicpeptidesfromsmallmoleculeexperimentaldatausingsparsemodelingtechniques AT wakuinaoki computationalpredictionofplasmaproteinbindingofcyclicpeptidesfromsmallmoleculeexperimentaldatausingsparsemodelingtechniques AT yanagisawakeisuke computationalpredictionofplasmaproteinbindingofcyclicpeptidesfromsmallmoleculeexperimentaldatausingsparsemodelingtechniques AT yoshikawayasushi computationalpredictionofplasmaproteinbindingofcyclicpeptidesfromsmallmoleculeexperimentaldatausingsparsemodelingtechniques AT ohuemasahito computationalpredictionofplasmaproteinbindingofcyclicpeptidesfromsmallmoleculeexperimentaldatausingsparsemodelingtechniques AT akiyamayutaka computationalpredictionofplasmaproteinbindingofcyclicpeptidesfromsmallmoleculeexperimentaldatausingsparsemodelingtechniques |